Mounting amounts of diverse biomedical data have been generated. Extracting meaningful information from these datasets has relied on the efforts of informaticians, who are extensively trained in the computer science realm, with little to no training in biology. Similarly, biologists in general are not proficient to analyze, annotate, and translate their large datasets into valuable biomedical insights. In addition, there has been an overall lack of public understanding for the importance of Big Data science, hindering the enthusiasm to advance data science in the biomedical field. To bridge the gaps that exist among data generation, interpretation and awareness, our training program will provide critical data science education to current biomedical researchers, expand the data science workforce in the biomedical field, and elicit a broad public recognition of data science. Accordingly, we have engineered an integrated training program with four specific aims: 1) To empower current biomedical researchers with the ability to manage and interpret Big Data by gaining proficiency in utilizing data science software tools;2) To utilize the training component as an interactive testing field for software packages developed by the Data Science Research (DSR) component. User critiques/feedback will refine and transform software tools to a professional grade, facilitating the community to capture the full value of Big Data;3) To cultivate a new generation of developers with transdisciplinary expertise in both computational biology and biomedical informatics;and 4) To heighten public awareness of and enthusiasm for the substantial opportunities embedded within computational biology, which has the potential to transform biomedical research and medicine. To achieve these aims, we have constructed three trainee-oriented modules: Biomedical Researcher /User-Oriented Module, Big Data Science Researcher-Oriented Module, and General Public-Oriented Module. A trans-institutional collaboration has been organized (i.e., UCLA, TSRI, UMMC, and EMBL-EBI), and all components have demonstrated outstanding track records in education. This collaboration will ensure successful execution of the training component substantiated by distinguished experts and meritorious educators from a wide breadth of disciplines, spanning -omics, bioinformatics, and computational science.

Public Health Relevance

The challenges of biomedical Big Data are multifaceted. Advances in biomedical sciences using Big Data will require an adequate workforce with the appropriate data science expertise and skills, including those in computational biology, biomedical informatics, and related areas. Users of Big Data software tools and resources must be trained to use them well. This Training Component is designed to address these challenges.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Specialized Center--Cooperative Agreements (U54)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-BST-R (52))
Program Officer
Lyster, Peter
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California Los Angeles
Los Angeles
United States
Zip Code
Bruggemann, Jacob; Lander, Gabriel C; Su, Andrew I (2018) Exploring applications of crowdsourcing to cryo-EM. J Struct Biol 203:37-45
Lindsey, Merry L; Jung, Mira; Hall, Michael E et al. (2018) Proteomic analysis of the cardiac extracellular matrix: clinical research applications. Expert Rev Proteomics 15:105-112
Mouton, Alan J; Rivera Gonzalez, Osvaldo J; Kaminski, Amanda R et al. (2018) Matrix metalloproteinase-12 as an endogenous resolution promoting factor following myocardial infarction. Pharmacol Res 137:252-258
DeLeon-Pennell, Kristine Y; Iyer, Rugmani Padmanabhan; Ma, Yonggang et al. (2018) The Mouse Heart Attack Research Tool 1.0 database. Am J Physiol Heart Circ Physiol 315:H522-H530
Bloksgaard, Maria; Lindsey, Merry L; Martinez-Lemus, Luis A (2018) Extracellular Matrix in Cardiovascular Pathophysiology. Am J Physiol Heart Circ Physiol :
McClatchy, Daniel B; Ma, Yuanhui; Liem, David A et al. (2018) Quantitative temporal analysis of protein dynamics in cardiac remodeling. J Mol Cell Cardiol 121:163-172
Lindsey, Merry L (2018) Assigning matrix metalloproteinase roles in ischaemic cardiac remodelling. Nat Rev Cardiol 15:471-479
Lindsey, Merry L; Mouton, Alan J; Ma, Yonggang (2018) Adding Reg3? to the acute coronary syndrome prognostic marker list. Int J Cardiol 258:24-25
Brooks, Heddwen L; Lindsey, Merry L (2018) Guidelines for authors and reviewers on antibody use in physiology studies. Am J Physiol Heart Circ Physiol 314:H724-H732
Yates 3rd, John R (2018) Content Is King: Databases Preserve the Collective Information of Science. J Biomol Tech 29:1-3

Showing the most recent 10 out of 118 publications