Computational prediction of gene function (or phenotype) can reduce the scale of an experimental problem by focusing attention on a subset of possible experiments. Current function annotation databases-e.g., the Saccharomyces Genome Database (SGD) annotation of genes with Gene Ontology (GO) functions-are critically important resources, but were not designed to host computational predictions Although SGD and several other annotation databases label predictions as such, they provide no measures of confidence. The need exists for quantitative predictions, as distinct from qualitative """"""""somebody said so"""""""" predictions. Probabilistic scoring systems, in which the score communicates the probability of veracity, are likely to be the most useful. We will generate probabilistic predictions by developing probabilistic models for predicting function and phenotype. For reasons of data availability, we use S. cerevisiae and C. elegans as model systems. We will also generate probabilistic models to predict protein and genetic interactions. We will exploit probabilistic networks of protein and genetic interaction in several ways. We will apply ideas from communication theory (2-terminal network reliability) to predict new members of protein complexes from probabilistic protein networks. We will develop computational methods to guide efficient discovery of genetic interactions in S. cerevisiae, as a model for guiding future high-throughput studies in metazoans. We will exploit probabilistic synthetic lethal interaction networks to identify drug mechanism of action. We will disseminate predictions to the broader biomedical community. We propose a distributed quantitative prediction resource inspired by the DAS system of distributed genome annotation. We will adapt previously developed interfaces for browsing, searching, and retrieving probabilistic annotations to enhance their utility.
In Aim 1. we develop, apply, and validate methods for predicting function, phenotype, physical and genetic interaction in S. cerevisiae and C. elegans.
In Aim 2. we exploit probabilistic networks of protein and genetic interaction in S. cerevisiae to elucidate network structure, to guide functional genomic experiments, and to reveal drug mechanism of action.
In Aim, 3. we disseminate probabilistic predictions within a simple, generic, distributed software framework for sharing and browsing quantitative predictions.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG003224-04
Application #
7487999
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Bonazzi, Vivien
Project Start
2005-09-01
Project End
2010-08-31
Budget Start
2008-09-01
Budget End
2009-08-31
Support Year
4
Fiscal Year
2008
Total Cost
$414,441
Indirect Cost
Name
Harvard University
Department
Biochemistry
Type
Schools of Medicine
DUNS #
047006379
City
Boston
State
MA
Country
United States
Zip Code
02115
Ta?an, Murat; Musso, Gabriel; Hao, Tong et al. (2015) Selecting causal genes from genome-wide association studies via functionally coherent subnetworks. Nat Methods 12:154-9
Musso, Gabriel; Tasan, Murat; Mosimann, Christian et al. (2014) Novel cardiovascular gene functions revealed via systematic phenotype prediction in zebrafish. Development 141:224-35
Irimia, Manuel; Weatheritt, Robert J; Ellis, Jonathan D et al. (2014) A highly conserved program of neuronal microexons is misregulated in autistic brains. Cell 159:1511-23
Suzuki, Yo; Stam, Jason; Novotny, Mark et al. (2012) The green monster process for the generation of yeast strains carrying multiple gene deletions. J Vis Exp :e4072
Ta?an, Murat; Drabkin, Harold J; Beaver, John E et al. (2012) A Resource of Quantitative Functional Annotation for Homo sapiens Genes. G3 (Bethesda) 2:223-33
Suk, Kyoungho; Choi, Jihye; Suzuki, Yo et al. (2011) Reconstitution of human RNA interference in budding yeast. Nucleic Acids Res 39:e43
Chua, Hon Nian; Roth, Frederick P (2011) Discovering the targets of drugs via computational systems biology. J Biol Chem 286:23653-8
Suzuki, Yo; St Onge, Robert P; Mani, Ramamurthy et al. (2011) Knocking out multigene redundancies via cycles of sexual assortment and fluorescence selection. Nat Methods 8:159-64
Cokol, Murat; Chua, Hon Nian; Tasan, Murat et al. (2011) Systematic exploration of synergistic drug pairs. Mol Syst Biol 7:544
Cenik, Can; Chua, Hon Nian; Zhang, Hui et al. (2011) Genome analysis reveals interplay between 5'UTR introns and nuclear mRNA export for secretory and mitochondrial genes. PLoS Genet 7:e1001366

Showing the most recent 10 out of 46 publications