Systematic data curation and integration to link models of human disease

Dolinski, Kara

Abstract

Decades of experiments have produced vast amounts of data and identified a multitude of molecular processes that underlie specific biological functions directly relevant to human health. However, the potential of these data to inform about human health and disease have not yet been fully realized because publications report results in natural language that is not easily identifiable or computable. To capture and interrogate this wealth of data from the literature, we developed the BioGRID, an open repository for molecular interactions. BioGRID is a widely used resource, with on average over 6,500 unique visitors per month who explore the >360,000 interactions in the database with custom search and visualization tools. In addition, BioGRID data sets are the source of interaction information for a host of partner databases. An analogous challenge exists with the description of models of human disease. While much information is available from years of research in powerful models of human disease, including yeast, nematode, fly, zebrafish and mouse models, the relationship of these models to each other and to human disease has not been systematically organized. In this and other proposals connected through the Linking Animal Models to Human Disease Initiative (LAMHDl), we will undertake a systematic, coordinated effort to expand the BioGRID database through curation of pivotal new data compendia, application of sophisticated new methods for data integration, organization of data into predicted networks, and critically, linkage of networks between model systems and human disease processes. Our curation effort will comprehensively annotate RNAi phenotype data and chemical genetic data, which are crucial for accurate models of human disease and therapeutic intervention in disease, respectively. We will apply data analysis techniques to integrate these and other data across species to link human diseases with all relevant models to predict new features of human disease. We will also develop software tools to allow facile access of the research community to all of these results. Thus, we will enable the biomedical community to access fully comprehensive, integrated datasets across multiple models for hypothesis generation and analysis of human diseases.

Public Health Relevance

(provided by applicant): We will collect a unique and extensive set of protein and gene interactions from models that are relevant to human disease, as well as their interactions with chemicals (drugs) and their effect on specific functions. These data will allow the prediction of new disease network functions using specialized algorithms, which will lead to a better understanding of human disease and facilitate the discovery of new drugs.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Center for Research Resources (NCRR)
Type: Resource-Related Research Projects (R24)
Project #: 1R24RR032659-01
Application #: 8215398
Study Section: National Center for Research Resources Initial Review Group (RIRG)
Program Officer: Watson, Harold L

Project Start: 2011-09-14
Project End: 2015-07-31
Budget Start: 2011-09-14
Budget End: 2012-07-31
Support Year: 1
Fiscal Year: 2011
Total Cost: $674,775
Indirect Cost

Institution

Name: Princeton University
Department
Type: Organized Research Units
DUNS #: 002484665

City: Princeton
State: NJ
Country: United States
Zip Code: 08544

Publications

Chatr-Aryamontri, Andrew; Breitkreutz, Bobby-Joe; Heinicke, Sven et al. (2013) The BioGRID interaction database: 2013 update. Nucleic Acids Res 41:D816-23

Dolinski, Kara; Chatr-Aryamontri, Andrew; Tyers, Mike (2013) Systematic curation of protein and genetic interaction data for computable biology. BMC Biol 11:43

Sadowski, Ivan; Breitkreutz, Bobby-Joe; Stark, Chris et al. (2013) The PhosphoGRID Saccharomyces cerevisiae protein phosphorylation site database: version 2.0 update. Database (Oxford) 2013:bat026

Wong, Aaron K; Park, Christopher Y; Greene, Casey S et al. (2012) IMP: a multi-species functional genomics portal for integration, visualization and prediction of protein functions and networks. Nucleic Acids Res 40:W484-90

Krallinger, Martin; Leitner, Florian; Vazquez, Miguel et al. (2012) How to link ontologies and protein-protein interactions to literature: text-mining approaches and the BioCreative experience. Database (Oxford) 2012:bas017

Louie, Raymond J; Guo, Jingyu; Rodgers, John W et al. (2012) A yeast phenomic model for the gene interaction network modulating CFTR-?F508 protein biogenesis. Genome Med 4:103

Comments

Be the first to comment on Kara Dolinski's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: