Decades of experiments have produced vast amounts of data and identified a multitude of molecular processes that underlie specific biological functions directly relevant to human health. However, the potential of these data to inform about human health and disease have not yet been fully realized because publications report results in natural language that is not easily identifiable or computable. To capture and interrogate this wealth of data from the literature, we developed the BioGRID, an open repository for molecular interactions. BioGRID is a widely used resource, with on average over 6,500 unique visitors per month who explore the >360,000 interactions in the database with custom search and visualization tools. In addition, BioGRID data sets are the source of interaction information for a host of partner databases. An analogous challenge exists with the description of models of human disease. While much information is available from years of research in powerful models of human disease, including yeast, nematode, fly, zebrafish and mouse models, the relationship of these models to each other and to human disease has not been systematically organized. In this and other proposals connected through the Linking Animal Models to Human Disease Initiative (LAMHDl), we will undertake a systematic, coordinated effort to expand the BioGRID database through curation of pivotal new data compendia, application of sophisticated new methods for data integration, organization of data into predicted networks, and critically, linkage of networks between model systems and human disease processes. Our curation effort will comprehensively annotate RNAi phenotype data and chemical genetic data, which are crucial for accurate models of human disease and therapeutic intervention in disease, respectively. We will apply data analysis techniques to integrate these and other data across species to link human diseases with all relevant models to predict new features of human disease. We will also develop software tools to allow facile access of the research community to all of these results. Thus, we will enable the biomedical community to access fully comprehensive, integrated datasets across multiple models for hypothesis generation and analysis of human diseases.

Public Health Relevance

(provided by applicant): We will collect a unique and extensive set of protein and gene interactions from models that are relevant to human disease, as well as their interactions with chemicals (drugs) and their effect on specific functions. These data will allow the prediction of new disease network functions using specialized algorithms, which will lead to a better understanding of human disease and facilitate the discovery of new drugs.

Agency
National Institute of Health (NIH)
Institute
National Center for Research Resources (NCRR)
Type
Resource-Related Research Projects (R24)
Project #
1R24RR032659-01
Application #
8215398
Study Section
National Center for Research Resources Initial Review Group (RIRG)
Program Officer
Watson, Harold L
Project Start
2011-09-14
Project End
2015-07-31
Budget Start
2011-09-14
Budget End
2012-07-31
Support Year
1
Fiscal Year
2011
Total Cost
$674,775
Indirect Cost
Name
Princeton University
Department
Type
Organized Research Units
DUNS #
002484665
City
Princeton
State
NJ
Country
United States
Zip Code
08544
Chatr-Aryamontri, Andrew; Breitkreutz, Bobby-Joe; Heinicke, Sven et al. (2013) The BioGRID interaction database: 2013 update. Nucleic Acids Res 41:D816-23
Dolinski, Kara; Chatr-Aryamontri, Andrew; Tyers, Mike (2013) Systematic curation of protein and genetic interaction data for computable biology. BMC Biol 11:43
Sadowski, Ivan; Breitkreutz, Bobby-Joe; Stark, Chris et al. (2013) The PhosphoGRID Saccharomyces cerevisiae protein phosphorylation site database: version 2.0 update. Database (Oxford) 2013:bat026
Louie, Raymond J; Guo, Jingyu; Rodgers, John W et al. (2012) A yeast phenomic model for the gene interaction network modulating CFTR-?F508 protein biogenesis. Genome Med 4:103
Wong, Aaron K; Park, Christopher Y; Greene, Casey S et al. (2012) IMP: a multi-species functional genomics portal for integration, visualization and prediction of protein functions and networks. Nucleic Acids Res 40:W484-90
Krallinger, Martin; Leitner, Florian; Vazquez, Miguel et al. (2012) How to link ontologies and protein-protein interactions to literature: text-mining approaches and the BioCreative experience. Database (Oxford) 2012:bas017