Medicine, agriculture, and other biology-related industries increasingly depend on the information contained in genomic DNA. Advancing our understanding of how genes are structured and regulated will eventually lead to novel therapeutics for combating cancer and other diseases, to cheaper and more nutritious food, and to less wasteful materials and energy sources. Sequencing genomes is now a relatively straightforward task. Annotating them, however, is proving to be a much more difficult one, especially for eukaryotic genomes. The gene catalogs of all but the simplest eukaryotic genomes are still incomplete, even for well-known organisms with a long history of genetics and molecular biology. Because annotations are the focal point for many kinds of research and technological applications, it is essential that they be correct. Incomplete and incorrect annotations poison every experiment that employs them. We believe the key to improving genome annotation lies in better software for the creation and quality control of genome annotations. This proposal describes our design for a system we call GenomeInvestigator. GenomeInvestigator consists of three components: MAKER, EVALUATOR and VERIFIER. MAKER creates genome annotations, EVALUATOR performs quality control analyses on extant annotations, and VERIFIER automatically designs experiments to problematic portions of annotations. By analogy to genome sequencing, MAKER and EVALUATOR produce draft annotation with quality values, and VERIFIER directs annotation finishing efforts. GenomeInvestigator is designed to be easily portable and will be freely available. Its outputs will be Sequence Ontology compliant and GMOD compatible. Medicine, agriculture, and other biology-related industries increasingly depend on the information contained in genomic DNA. Advancing our understanding of how genes are structured and regulated will eventually lead to novel therapeutics for combating cancer and other diseases, to cheaper and more nutritious food, and to less wasteful materials and energy sources.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG004694-02
Application #
7636754
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Bonazzi, Vivien
Project Start
2008-06-12
Project End
2012-04-30
Budget Start
2009-05-01
Budget End
2010-04-30
Support Year
2
Fiscal Year
2009
Total Cost
$547,378
Indirect Cost
Name
University of Utah
Department
Genetics
Type
Schools of Medicine
DUNS #
009095365
City
Salt Lake City
State
UT
Country
United States
Zip Code
84112
Domyan, Eric T; Guernsey, Michael W; Kronenberg, Zev et al. (2014) Epistatic and combinatorial effects of pigmentary gene mutations in the domestic pigeon. Curr Biol 24:459-64
Campbell, Michael S; Holt, Carson; Moore, Barry et al. (2014) Genome Annotation and Curation Using MAKER and MAKER-P. Curr Protoc Bioinformatics 48:4.11.1-39
Shapiro, Michael D; Kronenberg, Zev; Li, Cai et al. (2013) Genomic diversity and evolution of the head crest in the rock pigeon. Science 339:1063-7
Smith, Jeramiah J; Kuraku, Shigehiro; Holt, Carson et al. (2013) Sequencing of the sea lamprey (Petromyzon marinus) genome provides insights into vertebrate evolution. Nat Genet 45:415-21, 421e1-2
Kapusta, Aurélie; Kronenberg, Zev; Lynch, Vincent J et al. (2013) Transposable elements are major contributors to the origin, diversification, and regulation of vertebrate long noncoding RNAs. PLoS Genet 9:e1003470
Yandell, Mark; Ence, Daniel (2012) A beginner's guide to eukaryotic genome annotation. Nat Rev Genet 13:329-42
Vieler, Astrid; Wu, Guangxi; Tsai, Chia-Hong et al. (2012) Genome, functional gene annotation, and nuclear transformation of the heterokont oleaginous alga Nannochloropsis oceanica CCMP1779. PLoS Genet 8:e1003064
Suen, Garret; Teiling, Clotilde; Li, Lewyn et al. (2011) The genome sequence of the leaf-cutter ant Atta cephalotes reveals insights into its obligate symbiotic lifestyle. PLoS Genet 7:e1002007
Holt, Carson; Yandell, Mark (2011) MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics 12:491
Smith, Christopher D; Zimin, Aleksey; Holt, Carson et al. (2011) Draft genome of the globally widespread and invasive Argentine ant (Linepithema humile). Proc Natl Acad Sci U S A 108:5673-8

Showing the most recent 10 out of 14 publications