Rapid advances in biotechnology and clinical studies have produced an information bottleneck in data processing that has hindered application of such critical scientific data for use in medicine and disease diagnosis. Omicia will develop a novel genetics based informatics infrastructure to overcome this bottleneck that can predict the functional outcome of gene mutations and their relationship to human disease by integrating diverse knowledge bases in a consistent and structured manner. Implementation of the technology will reduce the cost and enhance the effectiveness of genetic association studies, population profiling analyses and pharmacogenomic applications, as well as potentially identify novel gene targets for disease. Our technology will mine classification systems for large number of genes and create statistical associations as well as integrate ontologies to allow for reliable inferences about functions of genes and their relation to disease phenotypes. Importantly, the predictive capabilities of the technology will assign functions to orphan genes and genetic markers for which very little functional information is available. The IT system will consist of a classification of genes into families using shared attributes in a number of well-structured domains to create a gene inference system (GIS) by comparing gene ontologies in GO with disease ontologies in MeSH. It will also use the well-annotated HGMD database to infer novel disease-related function to unannotated polymorphisms such as those in dbSNP. These studies are designed to demonstrate the feasibility of the technology so that it can be validated through future genetic association studies in a Phase II SBIR.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Small Business Innovation Research Grants (SBIR) - Phase I (R43)
Project #
1R43HG003667-01
Application #
6934988
Study Section
Special Emphasis Panel (ZRG1-BDMA (01))
Program Officer
Bonazzi, Vivien
Project Start
2005-05-03
Project End
2005-10-31
Budget Start
2005-05-03
Budget End
2005-10-31
Support Year
1
Fiscal Year
2005
Total Cost
$99,757
Indirect Cost
Name
Omicia, Inc.
Department
Type
DUNS #
148382315
City
Emeryville
State
CA
Country
United States
Zip Code
94608
Hu, Hao; Huff, Chad D; Moore, Barry et al. (2013) VAAST 2.0: improved variant classification and disease-gene identification using a conservation-controlled amino acid substitution matrix. Genet Epidemiol 37:622-34
Yandell, Mark; Huff, Chad; Hu, Hao et al. (2011) A probabilistic disease-gene finder for personal genomes. Genome Res 21:1529-42
Moore, Barry; Hu, Hao; Singleton, Marc et al. (2011) Global analysis of disease-related DNA sequence variation in 10 healthy individuals: implications for whole genome-based clinical diagnostics. Genet Med 13:210-7
Yandell, Mark; Moore, Barry; Salas, Fidel et al. (2008) Genome-wide analysis of human disease alleles reveals that their locations are correlated in paralogous proteins. PLoS Comput Biol 4:e1000218