A Community Effort to Translate Protein Data to Knowledge: An Integrated Platform

Ping, Peipei; Lindsey, Merry; Su, Andrew; Watson, Karol

Abstract

The inception of the BD2K Initiative is a testament to the foresight of NIH and our community. Clearly, the future of biomedicine rests on our collective ability to transform Big Data into intelligible scientific facts. In line with the BD2K objectives,our goal is to revolutionize how we address the universal challenge to discern meaning from unruly data. Capitalizing on our investigators'complementary strengths in computational biology and cardiovascular medicine, we will present a fusion of cutting-edge innovations that are grounded in a cardiovascular research focus, encompassing: (i) on-the-cloud data processing, (ii) crowd sourcing and text-mining data annotation, (iii) protein spatiotemporal dynamics, (iv) multi-omic integration, and (v) multiscale clinical data modeling. Drawing from our decade of experience in creating and refining bioinformatics tools, we propose to amalgamate established Big Data resources into a generalizable model for data annotation and collaborative research, through a new query system and cloud infrastructure for accessing multiple omics repositories, and through computational-supported crowdsourcing initiatives for mining the biomedical literature. We propose to interweave diverse data types for revealing biological networks that coalesce from molecular entities at multiple scales, through machine learning methods for structuring molecular data and defining relationships with drugs and diseases, and through novel algorithms for on-the-cloud integration and pathway visualization of multi-dimensional molecular data. Moreover, we propose to innovate advanced modeling tools to resolve protein dynamics and spatiotemporal molecular mechanisms, through mechanistic modeling of protein properties and 3D protein expression maps, and through Bayesian algorithms that correlate patient phenotypes, health histories, and multi-scale molecular profiles. The utility and customizability o our tools to the broader research population is clearly demonstrated using three archetypical workflows that enable annotations of large lists of genes, transcripts, proteins, or metabolites;powerful analysis of complex protein datasets acquired over time;and seamless aQoregation of diverse molecular, textual and literature data. These workflows will be rigorously validated using data from two significant clinical cohorts, the Jackson Heart Study and the Healthy Elderly Longevity (Wellderly). In parallel, a multifaceted strategy will be implemented to educate and train biomedical investigators, and to engage the public for promoting the overall BD2K initiative. We are convinced that a community-driven BD2K initiative will best realize its scientific potential and transform the research culture in a sustainable manner, exhibiting lasting success beyond the current funding period.

Public Health Relevance

The challenges of biomedical Big Data are multifaceted. Biomedical investigators face daunting tasks of storing, analyzing, and distributing large-scale omics data, and aggregating all information to discern mechanistic insights. A coherent effort is required to harness disarrayed Big Data and transform them into intelligible scientific facts, whil engaging the global community via education and outreach programs. This Big Data Science Research proposal is designed to address these challenges by formulating a federated architecture of community-supported tools for enhancing data management, integration and analysis.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Specialized Center--Cooperative Agreements (U54)
Project #: 1U54GM114833-01
Application #: 8774362
Study Section: Special Emphasis Panel (ZRG1-BST-R (52))
Program Officer: Gregurick, Susan

Project Start: 2014-09-29
Project End: 2018-04-30
Budget Start: 2014-09-29
Budget End: 2015-04-30
Support Year: 1
Fiscal Year: 2014
Total Cost: $2,106,052
Indirect Cost: $276,592

Institution

Name: University of California Los Angeles
Department: Physiology
Type: Schools of Medicine
DUNS #: 092530369

City: Los Angeles
State: CA
Country: United States
Zip Code: 90095

Related projects

Publications

Ping, Peipei; Hermjakob, Henning; Polson, Jennifer S et al. (2018) Biomedical Informatics on the Cloud: A Treasure Hunt for Advancing Cardiovascular Medicine. Circ Res 122:1290-1301

Lindsey, Merry L; Jung, Mira; Yabluchanskiy, Andriy et al. (2018) Exogenous CXCL4 Infusion Inhibits Macrophage Phagocytosis by Limiting CD36 Signaling to Enhance Post-myocardial Infarction Cardiac Dilation and Mortality. Cardiovasc Res :

Goldfarb, Dennis; Lafferty, Michael J; Herring, Laura E et al. (2018) Approximating Isotope Distributions of Biomolecule Fragments. ACS Omega 3:11383-11391

Lindsey, Merry L (2018) Reg-ulating macrophage infiltration to alter wound healing following myocardial infarction. Cardiovasc Res 114:1571-1572

DeLeon-Pennell, Kristine Y; Mouton, Alan J; Ero, Osasere K et al. (2018) LXR/RXR signaling and neutrophil phenotype following myocardial infarction classify sex differences in remodeling. Basic Res Cardiol 113:40

Lindsey, Merry L; Kassiri, Zamaneh; Virag, Jitka A I et al. (2018) Guidelines for measuring cardiac physiology in mice. Am J Physiol Heart Circ Physiol 314:H733-H752

Bruggemann, Jacob; Lander, Gabriel C; Su, Andrew I (2018) Exploring applications of crowdsourcing to cryo-EM. J Struct Biol 203:37-45

Lindsey, Merry L; Jung, Mira; Hall, Michael E et al. (2018) Proteomic analysis of the cardiac extracellular matrix: clinical research applications. Expert Rev Proteomics 15:105-112

Mouton, Alan J; Rivera Gonzalez, Osvaldo J; Kaminski, Amanda R et al. (2018) Matrix metalloproteinase-12 as an endogenous resolution promoting factor following myocardial infarction. Pharmacol Res 137:252-258

DeLeon-Pennell, Kristine Y; Iyer, Rugmani Padmanabhan; Ma, Yonggang et al. (2018) The Mouse Heart Attack Research Tool 1.0 database. Am J Physiol Heart Circ Physiol 315:H522-H530

Showing the most recent 10 out of 118 publications

Comments

Be the first to comment on Peipei Ping's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: