The awesome power of microbial genetics is based on the ability of geneticists to infer the components and properties of complex biological systems from the phenotypes of mutants. The systematic analysis of microbial phenotypes in thousands of bacterial genomes and metagenomes will be strengthened by a universal system to compare phenotypes across species and strains. The overall goal of this project is to build a such a system for ontology-based annotation of microbial phenotypes that will enable the efficient mining of relevant data to facilitate, and possibly one day automate, hypothesis generation leading to experimental studies. Under the first grant for this project, the Ontology of Microbial Phenotypes (OMP) was built and has so far been used to annotate 30% of the genes in Escherichia coli K-12. The work proposed in this application will develop and deploy a relational database for annotation data storage; continue development of the OMP; continue development of terms in the Evidence Ontology as needed for OMP annotation; develop a pipeline for generation of a synteny-based database of alleles and intraspecies ortholog groups (pangenes); continue improvement of the OMP web/database infrastructure; continue and expand annotation efforts in E. coli; begin annotation of Saccharomyces cerevisiae; and engage in community outreach. OMP is being developed in the context of multiple projects to improve phenotype annotation across all domains of life. As OMP encompasses some of the best-studied model genetic systems, it is in a position to influence the direction of the entire field o phenotype analysis. This project will stimulate the development of bioinformatics tools for phenotype analysis, which will be useful across all of genetics.

Public Health Relevance

Our ability to use genetics to understand bacteria and fungi relevant to human health is limited by the need to organize and compare information from mutant phenotypes of different organisms. This project will develop controlled vocabularies and standards for describing phenotypes. This system is needed for development of analysis tools that can recognize patterns among different studies and suggest new avenues for understanding microbial contributions to disease and normal health.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM089636-05
Application #
8929255
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Ravichandran, Veerasamy
Project Start
2010-04-01
Project End
2017-06-30
Budget Start
2015-07-01
Budget End
2016-06-30
Support Year
5
Fiscal Year
2015
Total Cost
Indirect Cost
Name
Texas A&M Agrilife Research
Department
Biochemistry
Type
Earth Sciences/Resources
DUNS #
847205713
City
College Station
State
TX
Country
United States
Zip Code
77843
Gaudet, Pascale; Škunca, Nives; Hu, James C et al. (2017) Primer on the Gene Ontology. Methods Mol Biol 1446:25-37
Chibucos, Marcus C; Siegele, Deborah A; Hu, James C et al. (2017) The Evidence and Conclusion Ontology (ECO): Supporting GO Annotations. Methods Mol Biol 1446:245-259
Nehring, Ralf B; Gu, Franklin; Lin, Hsin-Yu et al. (2016) An ultra-dense library resource for rapid deconvolution of mutations that cause phenotypes in Escherichia coli. Nucleic Acids Res 44:e41
Bastian, Frederic B; Chibucos, Marcus C; Gaudet, Pascale et al. (2015) The Confidence Information Ontology: a step towards a standard for asserting confidence in annotations. Database (Oxford) 2015:bav043
Chibucos, Marcus C; Mungall, Christopher J; Balakrishnan, Rama et al. (2014) Standardized description of scientific evidence using the Evidence Ontology (ECO). Database (Oxford) 2014:
Chibucos, Marcus C; Zweifel, Adrienne E; Herrera, Jonathan C et al. (2014) An ontology for microbial phenotypes. BMC Microbiol 14:294
Brister, J Rodney; Le Mercier, Phillippe; Hu, James C (2012) Microbial virus genome annotation-mustering the troops to fight the sequence onslaught. Virology 434:175-80