The outreach program will develop infrastructure and methods to involve a large number of researchers, faculty, and students in the analysis of the maize genome. During the outreach program we will focus on training researchers and faculty members on how to use workflows and bioinformatics programs to analyze the expected MaizeCODE data sets. Researchers, faculty, and students will be poised to participate in de novo explorations of the structure and function of the maize genome.
In collaboration with
We invite graduate students and faculty to apply to a maize genome annotation jamboree on March 10-12, 2020 in the context of the Maize Genetics Conference in Kailua-Kona, Hawai'i. Faculty will learn how they can involve students in authentic research by making meaningful contributions to genome annotation while improving reference Maize genomes at PanMaize. We will introduce tools developed by MaizeCODE and Gramene, and provide help and support for faculty bringing these tools into the classroom. Resources are available for up to $1,000 to support travel, room and board.
Marcela K. Tello-Ruiz, Cristina F. Marco, Fei-Man Hsu, Rajdeep S. Khangura, Pengfei Qiao, Sirjan Sapkota, Michelle C. Stitzer, Rachael Wasikowski, Hao Wu, Junpeng Zhan, Kapeel Chougule, Lindsay M. Barone, Cornel Ghiban, Demitri Muna, Andrew C. Olson, Liya C. Wang, Doreen C. Ware, David A. Micklos
PLoSONE 14(10): e0224086, published: October 28, 2019 https://doi.org/10.1371/journal.pone.0224086
The sophistication of gene prediction algorithms and the abundance of RNA-based evidence for the maize genome may suggest that manual curation of gene models is no longer necessary. However, quality metrics generated by the MAKER-P gene annotation pipeline identified 17,225 of 130,330 (13%) protein-coding transcripts in the B73 Reference Genome V4 gene set with models of low concordance to available biological evidence. Working with eight graduate students, we used the Apollo annotation editor to curate 86 transcript models flagged by quality metrics and a complimentary method using the Gramene gene tree visualizer. All of the triaged models had significant errors–including missing or extra exons, non-canonical splice sites, and incorrect UTRs. A correct transcript model existed for about 60% of genes (or transcripts) flagged by quality metrics; we attribute this to the convention of elevating the transcript with the longest coding sequence (CDS) to the canonical, or first, position. The remaining 40% of flagged genes resulted in novel annotations and represent a manual curation space of about 10% of the maize genome (~4,000 protein-coding genes). MAKER-P metrics have a specificity of 100%, and a sensitivity of 85%; the gene tree visualizer has a specificity of 100%. Together with the Apollo graphical editor, our double triage provides an infrastructure to support the community curation of eukaryotic genomes by scientists, students, and potentially even citizen scientists.
The second Maize Annotation Jamboree—the first designed for primarily undergraduate institution (PUI) faculty—was held January 10-11, 2019 at the Scripps Institution of Oceanography in San Diego, CA. The Jamboree was designed to train PUI faculty in the use of genome annotation tools as part of the community curation effort to improve the B73 Zea mays v4 gene models. The ultimate goal is to help faculty integrate maize annotation activities as course-based undergraduate research experiences (CUREs).
Eight applicants participated in this two-day event held adjacent to the annual International Plant and Animal Genome meeting, which half went on to attend.
This year’s participants were affiliated with eight institutions across the United States: Paul Bilinski (West Shore Community College), James Godde (Monmouth College), Chelsey McKenna (College of Southern Nevada), Selene Nikaido (University of Central Missouri), Christos Noutsos (SUNY at Old Westbury), Leocadia Paliulis (Bucknell University), Rebecca Seipelt-Thiemann (Middle Tennessee State University), and Melkamu Woldemariam (The College of New Jersey).
Jamboree instructors: Cristina Fernández-Marco (DNA Learning Center - Cold Spring Harbor Laboratory), Marcela Karey Tello-Ruiz (CSHL), and Raj Singh Khangura (Purdue University).
Three additional researchers participated as observers: Jerome Grimplet from the EU-COST grapevine project and Sally Elgin and Wilson Leung from the Genomics Education Partnership (GEP) at Washington University in St. Louis. Observers not only shared their own knowledge, but may help promote future Jamborees within their own faculty training networks.
Over the next year, we will continue supporting faculty attendees via periodic meetings and providing assistance in developing bioinformatics lessons and wet lab resources that can be implemented in the classroom. As we continue to refine the Jamboree approach we will also use the experience gained to help develop similar efforts to improve genomic annotations in other species of the maize pangenome, sorghum, grape, and other important crops.
The third Maize Annotation Jamboree—the first designed for researchers—was held on March 13-14, 2019 at the Biology Department of Washington University in St. Louis, Missouri. The objective is for the researchers to establish collaborations with PUI faculty and be part of a larger community curation effort to improve the B73 Zea mays v4 gene models.
Thirteen applicants received scholarships to participate in this two-day event followed by the 2019 Maize Genetics Conference (MGC) Conference.
Jamboree participants: Feseha Abebe-Akele (Texas A&M International University), Michael Jochum (Texas A&M University), Abi Gyawali (University of Missouri, Columbia), Waltram Ravelombola (University of Arkansas, Fayetteville), Shailesh Karre Satyanarayana Guptha (North Carolina State University), Ramesh Dhakal (U of A Rice Research and Extension Center), Ghana Challa (University of Illinois at Urbana-Champaign), Penny Kianian (U Minessota), Ying Hu (U Florida), Patrick Monnahan (U Minessota), Erin Baggs (UC Berkeley); David Carlson (Stony Brook), and Juan Antonio Baeza (Clemson University).
Jamboree instructors: Cristina Fernández-Marco (DNA Learning Center - Cold Spring Harbor Laboratory), Marcela Karey Tello-Ruiz (CSHL), and Joshua Stein (CSHL).
This effort will continue via periodic meetings to discuss progress on the partnerships where maize researchers who participated in this event will serve as consultants for PUI faculty and students participating in CUREs.
Special thanks to Sarah Elgin and Wilson Leung of the Genomics Education Partnership, and Patrick Clark of the Biology Department of Washington University in St. Louis for their support with this event.
The first genomic annotation jamboree for the current reference Maize B73 (B73, RefGen_V4) was held on December 4-5, 2017 at Cold Spring Harbor Laboratory (CSHL). Sponsored by the NSF-funded MaizeCode (IOS-1445025) and Gramene (IOS-1127112) projects, the jamboree aimed to engage graduate students in the plant research community in the improvement of the Maize gene models. This event was a proof-of-concept for similar future efforts that will help improving annotations in newly sequences maize inbreeds, sorghum, and other important crops.
This event brought together participants from seven US and one international institution (University of Tokyo), and included underrepresented minorities. We also had a participant from University of Toledo, a primarily undergraduate-serving institution (PUI).
Ten graduate students and one postdoctoral applicant were selected to participate in this two-day event: Fei-Man Hsu (University of Tokyo), Hao Wu (Iowa State University), Kokulapalan Wimalanathan (Iowa State University), Junpeng Zhan (University of Arizona), Michelle Stitzer (UC Davis), Pengfei Qiao (Cornell University), Rachel Wasikowski (University of Toledo), Rachel Wasikowski (Purdie University), Sirjan Sapkota (Clemson Univeristy), and Zach Brenton (Clemson Univeristy). Erin Baggs (UC Berkeley); David Carlson (Stony Brook), and Juan Antonio Baeza (Clemson University).
Jamboree instructor: Monica Munoz-Torres, currently a program manager for the development of scientific software at the Translational and Integrative Sciences Lab, at Oregon State University and former project manager of the Apollo project, a web-based genome annotation editor tool designed to support community-based curation. Munoz-Torres gave an introduction on the importance of community-curation efforts and an in-depth demonstration of Apollo’s capabilities. Students were paired groups and tasked with checking the accuracy of 5 distinct maize gene families: PIN, GH3, ABC, TCP and ORC. From these gene families, "suspicious" MAKER-P-generated annotations were identified based on their annotation edit distance (AED) and quality indexes (QI). Working independently on the same set of models, students described their conclusions and approaches and identified ten genes that needed improvements and will be fed back to MaizeGDB curators as updates.
An additional researcher participated as an observer: Uwe Hilgert, director of STEM training at the BIO5 Institute & Cyverse from the University of Arizona.
Jamboree participants continue to annotate their genes of interest and the group will meet periodically to discuss progress and write an official report that we plan on submitting to an educational journal. In addition, Michelle Stitzer presented a summary of the results of this exercise at the Plant and Animal Genome conference on January 15, 2018.
The Weed to Wonder website and iPad ebook tells the story of how human ingenuity transformed a common Mexican weed (teosinte) into a modern food wonder (maize). Weed to Wonder shows the continuity of research on corn – from Native American agriculturalists to agricultural breeders, corn geneticists, plant physiologists, and molecular biologists – that culminated in the Maize Genome Sequencing Project. The interactive e-book uses over 150 animations, photographs, illustrations, interviews, and a time-lapse video to provide background on the development of maize, from domestication, hybrid vigor, genome sequencing, and transposons, to genetic modification and biofortification of modern maize. The e-book revolves around footage from Mexico, interviews with prominent scientists, and animations of different approaches to sequencing the maize genome.
Researcher & Graduate Student
Training will prepare faculty from PUIs to analyze MaizeCODE with undergraduate students and provide travel awards for graduate students to attend MaizeCODE training at professional meetings.
Undergraduate Faculty & Student
The program will be promote Science, Technology, Engineering and Math (STEM) disciplines by anticipating and encouraging broad participation in primary data analysis by undergraduate and graduate students.