BioGRID: An Open Integrated Resource for Biological Interaction Data

Tyers, Michael

Abstract

Complex protein and genetic interaction networks determine the properties of all biological systems and underlie human development, health and disease. Decades of biochemical, genetic and molecular biological experiments have identified myriad molecular processes that underpin specific biological functions, as documented in the primary biomedical literature. Recent technological innovations combined with complete genome sequence information have enabled a host of high-throughput (HTP) methods to generate protein and genetic interaction data on an unprecedented scale. Because human interaction networks are often directly analogous to networks in tractable model organisms, it is essential that the hundreds of thousands of biological interactions discovered across the major model organisms, as well as humans, are archived in a well- annotated manner that is amenable to rigorous analysis and computation. To capture, integrate, and interrogate this wealth of data from both the literature and HTP datasets, we developed the BioGRID database as an open repository for protein and genetic interactions (www.thebiogrid.org). BioGRID is widely used by the biological and biomedical research community, with on average over 16,555 unique visitors per month in 2015. Using the search and visualization tools in BioGRID, these users explore the 971,027 total interactions that have been directly traced to experimental data in 45,603 publications by our curators. In addition, the unique datasets in BioGRID are disseminated widely by a host of partner databases, meta-databases, and applications. Here, we propose to markedly enhance the data content, the database architecture, and the user interface of BioGRID. We will expand the amount and types of data available through BioGRID, with a focus on interactions of central biological processes that are frequently perturbed in human disease. We will use new ontologies to systematically capture new data types, including CRISPR-based genetic interactions, structured phenotypes across all species, chemical and drug interactions, and post-translational modifications. Text- mining algorithms will be incorporated into the curation pipeline to enhance curation rates, and thereby substantially expand the coverage of the database. User access to the large datasets in BioGRID will be facilitated by data-rich interfaces, user-defined search and display parameters, and multiple methods of visualization. All software will continue to be open source and engineered toward compatibility and complementary with other academic database and software development efforts. The BioGRID will provide interaction data and software tools to model organism databases and other interested parties without restriction. The BioGRID resource will enable the biomedical research community to access validated biological interaction datasets across model organisms and humans for hypothesis generation and network analysis, and thereby further the general mission of the NIH.

Public Health Relevance

The BioGRID database is a comprehensive resource that provides protein and genetic interaction data for the major model organism species and humans, along with user-oriented tools to explore this information. The BioGRID facilitates better understanding of human disease by enabling inference of gene and protein function through network context and the computational comparison of these gene and protein networks in human health and disease to analogous networks mapped in model organisms. The large amounts of data in the BioGRID are freely provided to many other databases and users, thus facilitating both fundamental and translational research.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: Office of The Director, National Institutes of Health (OD)
Type: Research Project (R01)
Project #: 5R01OD010929-13
Application #: 9745709
Study Section: Biodata Management and Analysis Study Section (BDMA)
Program Officer: Watson, Harold L

Project Start: 2007-05-15
Project End: 2021-05-31
Budget Start: 2019-06-01
Budget End: 2020-05-31
Support Year: 13
Fiscal Year: 2019
Total Cost
Indirect Cost

Institution

Name: Sinai Health System
Department
Type
DUNS #: 208808949

City: Toronto
State: ON
Country: Canada
Zip Code: M5 1X5

Related projects


NIH 2020 R01 OD	BioGRID: An Open Integrated Resource for Biological Interaction Data Tyers, Michael David / Sinai Health System
NIH 2019 R01 OD	BioGRID: An Open Integrated Resource for Biological Interaction Data Tyers, Michael David / Sinai Health System
NIH 2018 R01 OD	BioGRID: An Open Integrated Resource for Biological Interaction Data Tyers, Michael David / Sinai Health System
NIH 2017 R01 OD	BioGRID: An Open Integrated Resource for Biological Interaction Data Tyers, Michael David / Sinai Health System	$805,815
NIH 2016 R01 OD	BioGRID: An Open Integrated Resource for Biological Interaction Data Tyers, Michael David / Sinai Health System
NIH 2015 R01 OD	BioGRID: An Open Integrated Resource for Biological Interaction Data Tyers, Michael David / MT Sinai Hosp-Samuel Lunenfeld Research Institute	$860,238
NIH 2014 R01 OD	BioGRID: An Open Integrated Resource for Biological Interaction Data Tyers, Michael David / MT Sinai Hosp-Samuel Lunenfeld Research Institute	$877,795
NIH 2013 R01 OD	BioGRID: An Open Integrated Resource for Biological Interaction Data Tyers, Michael David / MT Sinai Hosp-Samuel Lunenfeld Research Institute	$834,783
NIH 2012 R01 OD	BioGRID: An Open Integrated Resource for Biological Interaction Data Tyers, Michael David / MT Sinai Hosp-Samuel Lunenfeld Research Institute	$877,795

Publications

Bertomeu, Thierry; Coulombe-Huntington, Jasmin; Chatr-Aryamontri, Andrew et al. (2018) A High-Resolution Genome-Wide CRISPR/Cas9 Viability Screen Reveals Structural Features and Contextual Diversity of the Human Cell-Essential Proteome. Mol Cell Biol 38:

Chatr-Aryamontri, Andrew; Oughtred, Rose; Boucher, Lorrie et al. (2017) The BioGRID interaction database: 2017 update. Nucleic Acids Res 45:D369-D379

Courcelles, Mathieu; Coulombe-Huntington, Jasmin; Cossette, Émilie et al. (2017) CLMSVault: A Software Suite for Protein Cross-Linking Mass-Spectrometry Data Analysis and Visualization. J Proteome Res 16:2645-2652

Schapira, Matthieu; Tyers, Mike; Torrent, Maricel et al. (2017) WD40 repeat domain proteins: a novel target class? Nat Rev Drug Discov 16:773-786

Kanshin, Evgeny; Giguère, Sébastien; Jing, Cheng et al. (2017) Machine Learning of Global Phosphoproteomic Profiles Enables Discrimination of Direct versus Indirect Kinase Substrates. Mol Cell Proteomics 16:786-798

Islamaj Dogan, Rezarta; Kim, Sun; Chatr-Aryamontri, Andrew et al. (2017) The BioC-BioGRID corpus: full text articles annotated for curation of protein-protein and genetic interactions. Database (Oxford) 2017:

Wildenhain, Jan; Spitzer, Michaela; Dolma, Sonam et al. (2016) Systematic chemical-genetic and chemical-chemical interaction datasets for prediction of compound synergism. Sci Data 3:160095

Dolma, Sonam; Selvadurai, Hayden J; Lan, Xiaoyang et al. (2016) Inhibition of Dopamine Receptor D4 Impedes Autophagic Flux, Proliferation, and Survival of Glioblastoma Stem Cells. Cancer Cell 29:859-873

Oughtred, Rose; Chatr-aryamontri, Andrew; Breitkreutz, Bobby-Joe et al. (2016) Use of the BioGRID Database for Analysis of Yeast Protein and Genetic Interactions. Cold Spring Harb Protoc 2016:pdb.prot088880

Kim, Sun; Islamaj Do?an, Rezarta; Chatr-Aryamontri, Andrew et al. (2016) BioCreative V BioC track overview: collaborative biocurator assistant task for BioGRID. Database (Oxford) 2016:

Showing the most recent 10 out of 29 publications

Comments

Be the first to comment on Michael Tyers's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: