The Consortium Activities Component (CAC) of the Center for Causal Modeling and Discovery (CCMD) of Biomedical Knowledge from Big Data will ensure the Pittsburgh Center's participation and integration into all of the BD2K Center Consortium Activities. With highly collaborative leaders in the fields biomedical informatics, computational and systems biology, philosophy, computer science, supercomputing, statistical genetics, cancer research, fMRI, and lung disease, our CCMD team will serve as a broad interface into many disciplines in data science, biomedicine, and beyond. As part of the Consortium, CCMD will contribute to the achievement of the following BD2K goals: 1) Disseminate data and tools developed by the CCMD through the Consortium;2) Extend data- and software-sharing policies and practices through collaboration with other Consortium sites;3) Develop new methods to analyze big data that integrate with other Consortium sites, such as those proposed in Intra-Consortium Project 1 with Harvard;and 4) Use standards-based metadata to describe the data consumed by the tools of CCMD, such as those proposed in Intra-Consortium Project 2 with Stanford. Part of our approach to achieving these goals will be the deployment of a Technical Catalyst whose main responsibility will be to spend time learning with and from the other funded consortium sites. A key responsibility for this individual will be the production of CCMD technical updates describing how CCMD integration and interoperability will be accomplished after careful study during quarterly, rotating site visits of the other funded BD2K Centers. As part of our commitment to the BD2K Consortium, the CCMD will participate in all Consortium meetings and in all of the Consortium subcommittees, such as data sharing, publication, regulatory, evaluation, and others created by the Steering Committee. We will participate in the development of and abide by all policies set by these committees. Through this CCMD component, we will contribute to the change in research culture that BD2K has as its central goal.

Public Health Relevance

CAC will help accomplish the BD2K goals by integrating CCMD tools with other Consortium Centers through participation in Annual Consortium meetings, our Technical Catalyst Program, our Scientific Catalyst Programs and innovative Intra-Consortium projects focused on standards based metadata to promote interoperability and facilitate integration of novel CCMD algorithms ot analyze complex biomedical data sets.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Specialized Center--Cooperative Agreements (U54)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-BST-R (52))
Program Officer
Brooks, Lisa
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Pittsburgh
United States
Zip Code
Raghu, Vineet K; Ramsey, Joseph D; Morris, Alison et al. (2018) Comparison of strategies for scalable causal discovery of latent variable models from mixed data. Int J Data Sci Anal 6:33-45
Huang, Biwei; Zhang, Kun; Lin, Yizhu et al. (2018) Generalized Score Functions for Causal Discovery. KDD 2018:1551-1560
Zhang, Kun; Schölkopf, Bernhard; Spirtes, Peter et al. (2018) Learning causality and causality-related learning: some recent progress. Natl Sci Rev 5:26-29
Meyer, Wynn K; Jamison, Jerrica; Richter, Rebecca et al. (2018) Ancient convergent losses of Paraoxonase 1 yield potential risks for modern marine mammals. Science 361:591-594
Naeini, Mahdi Pakdaman; Cooper, Gregory F (2018) Binary Classifier Calibration Using an Ensemble of Piecewise Linear Regression Models. Knowl Inf Syst 54:151-170
Lu, Songjian; Fan, Xiaonan; Chen, Lujia et al. (2018) A novel method of using Deep Belief Networks and genetic perturbation data to search for yeast signaling pathways. PLoS One 13:e0203871
Ponzoni, Luca; Bahar, Ivet (2018) Structural dynamics is a determinant of the functional significance of missense variants. Proc Natl Acad Sci U S A 115:4164-4169
Ding, Michael Q; Chen, Lujia; Cooper, Gregory F et al. (2018) Precision Oncology beyond Targeted Therapy: Combining Omics Data with Machine Learning Matches the Majority of Cancer Cells to Effective Therapeutics. Mol Cancer Res 16:269-278
Sedgewick, Andrew J; Buschur, Kristina; Shi, Ivy et al. (2018) Mixed Graphical Models for Integrative Causal Analysis with Application to Chronic Lung Disease Diagnosis and Prognosis. Bioinformatics :
Andrews, Bryan; Ramsey, Joseph; Cooper, Gregory F (2018) Scoring Bayesian Networks of Mixed Variables. Int J Data Sci Anal 6:3-18

Showing the most recent 10 out of 61 publications