We will establish our Center for Causal Modeling and Discovery (CCMD) of Biomedical Knowledge from Big Data as a collaboration among the University of Pittsburgh (Pitt;lead institution), Carnegie Mellon University (CMU), and Yale University (Yale). The CCMD will develop, validate, and disseminate methods, tools, and software based on Causal Bayesian Networks, which will enable the broader scientific community to effectively interrogate large imaging, genomic, and clinical (phenotype) data and derive knowledge on the causality of observed phenomena. We selected three Driving Biomedical Problems (DBPs) on cancer pathways, lung diseases, and brain fMRI as a platform for algorithm and software development. Our overarching goal is to develop and enable the broad usage of, a complete suite of interoperable software'and application programming interface that will help advance BD2K research and education. The activities of the CCMD will be organized in three major components. Data Science Research (DSR), Training and Consortium. The CCMD will be co-led by 3 PIs, each of whom will co-lead an activity component, in consultation with the Center Executive Committee. The Administrative Core (AdminCore) will oversee the overall operation of the CCMD to ensure productive interaction between the teams at the participating institutions, effective integration of the DSR and training activities of the CCMD, efficient development of a computational tools for CMD that will benefit the broader research community, and training of next generation of data scientists equipped with CMD skills necessary to tackle a variety of BD2K problems. The AdminCore will also continually monitor and evaluate the progress made in the three CCMD components, making strategic plans to increase the utility and quality of research, training, and collaborative activities. We will use a detailed logic model and associated metrics as its evaluation framework and will consult with the Center Internal and External Advisory Boards. The AdminCore will be responsible for ensuring access to adequate infrastructure and resources for efficient development and dissemination of CMD tools to benefit the data science and biomedical research communities, in particular the other BD2K Centers.

Public Health Relevance

The Administrative Core (AdminCore) will oversee the overall operation of the CCMD to ensure productive interaction between the teams at the participating institutions, effective integration of the DSR and training activities of the CCMD, efficient development of a computational tools for CMD that will benefit the broader research community, and training of next generation of data scientists to tackle a variety of BD2K problems.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Specialized Center--Cooperative Agreements (U54)
Project #
1U54HG008540-01
Application #
8932080
Study Section
Special Emphasis Panel (ZRG1-BST-R (52))
Program Officer
Brooks, Lisa
Project Start
2014-09-29
Project End
2018-08-31
Budget Start
2014-07-01
Budget End
2015-06-30
Support Year
1
Fiscal Year
2014
Total Cost
$99,855
Indirect Cost
$41,447
Name
University of Pittsburgh
Department
Type
DUNS #
004514360
City
Pittsburgh
State
PA
Country
United States
Zip Code
15213
Meyer, Wynn K; Jamison, Jerrica; Richter, Rebecca et al. (2018) Ancient convergent losses of Paraoxonase 1 yield potential risks for modern marine mammals. Science 361:591-594
Naeini, Mahdi Pakdaman; Cooper, Gregory F (2018) Binary Classifier Calibration Using an Ensemble of Piecewise Linear Regression Models. Knowl Inf Syst 54:151-170
Lu, Songjian; Fan, Xiaonan; Chen, Lujia et al. (2018) A novel method of using Deep Belief Networks and genetic perturbation data to search for yeast signaling pathways. PLoS One 13:e0203871
Ponzoni, Luca; Bahar, Ivet (2018) Structural dynamics is a determinant of the functional significance of missense variants. Proc Natl Acad Sci U S A 115:4164-4169
Ding, Michael Q; Chen, Lujia; Cooper, Gregory F et al. (2018) Precision Oncology beyond Targeted Therapy: Combining Omics Data with Machine Learning Matches the Majority of Cancer Cells to Effective Therapeutics. Mol Cancer Res 16:269-278
Sedgewick, Andrew J; Buschur, Kristina; Shi, Ivy et al. (2018) Mixed Graphical Models for Integrative Causal Analysis with Application to Chronic Lung Disease Diagnosis and Prognosis. Bioinformatics :
Andrews, Bryan; Ramsey, Joseph; Cooper, Gregory F (2018) Scoring Bayesian Networks of Mixed Variables. Int J Data Sci Anal 6:3-18
Raghu, Vineet K; Ramsey, Joseph D; Morris, Alison et al. (2018) Comparison of strategies for scalable causal discovery of latent variable models from mixed data. Int J Data Sci Anal 6:33-45
Huang, Biwei; Zhang, Kun; Lin, Yizhu et al. (2018) Generalized Score Functions for Causal Discovery. KDD 2018:1551-1560
Zhang, Kun; Schölkopf, Bernhard; Spirtes, Peter et al. (2018) Learning causality and causality-related learning: some recent progress. Natl Sci Rev 5:26-29

Showing the most recent 10 out of 61 publications