Much of science consists of discovering and modeling causal relationships that occur in nature. Increasingly big data are being used to drive such discoveries. There is a pressing need for methods that can efficiently infer causal networks from large and diverse types of biomedical data and background knowledge. This center of excellence will develop, implement, and evaluate an integrated set of tools that support causal modeling and discovery (CMD) of biomedical knowledge from very large and complex biomedical data. We also plan to actively share our knowledge, methods, and tools with others, through an innovative set of training and consortium activities. In the past 25 years, there has been tremendous progress in developing general computational methods for representing and discovering causal knowledge from data, based on a representation called causal Bayesian networks (CBNs). These methods have been applied successfully in a wide range of fields, including medicine and biology. While much progress has been made in the development of these computational methods, they are not readily available, sufficiently efficient, nor easy to use by biomedical scientists, and they have not been reconfigured to exploit the increasingly Big Data available for analysis. This Center will make these methods widely available, highly efficient when applied to big datasets, and easy to use. The proposed Center will provide a powerful set of concepts and tools that accelerate the discovery and sharing of causal knowledge derived from very large and complex biomedical datasets. The approaches and products emanating from this center of excellence are likely to have a significant positive impact on our understanding of health and disease, and thereby on the improvement of human health.

Public Health Relevance

This center of excellence will develop, implement, and evaluate an integrated set of tools that support causal modeling and discovery (CMD) of biomedical knowledge from very large and complex biomedical data. The approaches and products emanating from this center of excellence are likely to have a significant positive impact on our understanding of health and disease, and thereby on the improvement of human health.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Specialized Center--Cooperative Agreements (U54)
Project #
1U54HG008540-01
Application #
8775019
Study Section
Special Emphasis Panel ()
Program Officer
Brooks, Lisa
Project Start
2014-09-29
Project End
2018-08-31
Budget Start
2014-09-29
Budget End
2015-04-30
Support Year
1
Fiscal Year
2014
Total Cost
$1,990,213
Indirect Cost
$519,444
Name
University of Pittsburgh
Department
Miscellaneous
Type
Schools of Medicine
DUNS #
004514360
City
Pittsburgh
State
PA
Country
United States
Zip Code
15213
Meyer, Wynn K; Jamison, Jerrica; Richter, Rebecca et al. (2018) Ancient convergent losses of Paraoxonase 1 yield potential risks for modern marine mammals. Science 361:591-594
Naeini, Mahdi Pakdaman; Cooper, Gregory F (2018) Binary Classifier Calibration Using an Ensemble of Piecewise Linear Regression Models. Knowl Inf Syst 54:151-170
Lu, Songjian; Fan, Xiaonan; Chen, Lujia et al. (2018) A novel method of using Deep Belief Networks and genetic perturbation data to search for yeast signaling pathways. PLoS One 13:e0203871
Ponzoni, Luca; Bahar, Ivet (2018) Structural dynamics is a determinant of the functional significance of missense variants. Proc Natl Acad Sci U S A 115:4164-4169
Ding, Michael Q; Chen, Lujia; Cooper, Gregory F et al. (2018) Precision Oncology beyond Targeted Therapy: Combining Omics Data with Machine Learning Matches the Majority of Cancer Cells to Effective Therapeutics. Mol Cancer Res 16:269-278
Sedgewick, Andrew J; Buschur, Kristina; Shi, Ivy et al. (2018) Mixed Graphical Models for Integrative Causal Analysis with Application to Chronic Lung Disease Diagnosis and Prognosis. Bioinformatics :
Andrews, Bryan; Ramsey, Joseph; Cooper, Gregory F (2018) Scoring Bayesian Networks of Mixed Variables. Int J Data Sci Anal 6:3-18
Raghu, Vineet K; Ramsey, Joseph D; Morris, Alison et al. (2018) Comparison of strategies for scalable causal discovery of latent variable models from mixed data. Int J Data Sci Anal 6:33-45
Huang, Biwei; Zhang, Kun; Lin, Yizhu et al. (2018) Generalized Score Functions for Causal Discovery. KDD 2018:1551-1560
Zhang, Kun; Schölkopf, Bernhard; Spirtes, Peter et al. (2018) Learning causality and causality-related learning: some recent progress. Natl Sci Rev 5:26-29

Showing the most recent 10 out of 61 publications