The "awesome power of microbial genetics" is based on the ability of geneticists to infer the components and properties of complex biological systems from the phenotypes of mutants. The systematic analysis of microbial phenotypes in thousands of bacterial genomes and metagenomes will be strengthened by a universal system to compare phenotypes across species and strains. The overall goal of this project is to build a such a system for ontology-based annotation of microbial phenotypes that will enable the efficient mining of relevant data to facilitate, and possibly one day automate, hypothesis generation leading to experimental studies. Under the first grant for this project, the Ontology of Microbial Phenotypes (OMP) was built and has so far been used to annotate 30% of the genes in Escherichia coli K-12. The work proposed in this application will develop and deploy a relational database for annotation data storage;continue development of the OMP;continue development of terms in the Evidence Ontology as needed for OMP annotation;develop a pipeline for generation of a synteny-based database of alleles and intraspecies ortholog groups ("pangenes");continue improvement of the OMP web/database infrastructure;continue and expand annotation efforts in E. coli;begin annotation of Saccharomyces cerevisiae;and engage in community outreach. OMP is being developed in the context of multiple projects to improve phenotype annotation across all domains of life. As OMP encompasses some of the best-studied model genetic systems, it is in a position to influence the direction of the entire field o phenotype analysis. This project will stimulate the development of bioinformatics tools for phenotype analysis, which will be useful across all of genetics.

Public Health Relevance

Our ability to use genetics to understand bacteria and fungi relevant to human health is limited by the need to organize and compare information from mutant phenotypes of different organisms. This project will develop controlled vocabularies and standards for describing phenotypes. This system is needed for development of analysis tools that can recognize patterns among different studies and suggest new avenues for understanding microbial contributions to disease and normal health.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
2R01GM089636-04
Application #
8579651
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Ravichandran, Veerasamy
Project Start
2010-04-01
Project End
2017-06-30
Budget Start
2014-09-18
Budget End
2015-06-30
Support Year
4
Fiscal Year
2014
Total Cost
$297,305
Indirect Cost
$14,870
Name
Texas A&M Agrilife Research
Department
Biochemistry
Type
Schools of Earth Sciences/Natur
DUNS #
847205713
City
College Station
State
TX
Country
United States
Zip Code
77843
Chibucos, Marcus C; Mungall, Christopher J; Balakrishnan, Rama et al. (2014) Standardized description of scientific evidence using the Evidence Ontology (ECO). Database (Oxford) 2014: