Much of science, including biomedical science, consists of discovering and modeling causal relationships. Increasingly, biomedical scientists have available multiple complex data types and a very large number of samples, each of which has an enormous number of measurements recorded.. There is a pressing need for algorithms that can efficiently discover causal relationships from large and diverse types of biomedical data and background knowledge. In the past 25 years, tremendous progress has been made in developing general computational methods for representing and discovering causal knowledge from data. However, these methods are not readily available, nor easy to use by biomedical scientists, and they have not been designed to exploit the increasingly Big Data available for analysis. The proposed Center will create a computer ecosystem through which to implement and apply an integrated set of tools, new and repurposed, that support the representation and discovery of causal knowledge from large and complex biomedical data. These computational approaches will be accessible to a wide variety of biomedical researchers, data analysts, and data scientists who might not otherwise take advantage of them. Three very different biomedical problems will drive the development of the methods, tools, and interactive system architecture. While we anticipate that new biomedical discoveries will be made in each of these problem areas using the methods developed by the Center, the longer-term impact will result from the development of the computational technology itself, which will be generalizable to the full spectrum of biomedical research. The Center will be very active in the sharing of these knowledge, methods, and tools through a rich offering of training activities and through engagement with other Centers in the consortium.

Public Health Relevance

There is a pressing need for new computational methods that can assist biomedical scientists in discovering causal knowledge from large and complex biomedical datasets. The proposed Center will develop and make freely available a suite of such methods for use by biomedical scientists, data analysts, and data scientists. The Center will also provide training about the methods and engage actively with other Centers.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Specialized Center--Cooperative Agreements (U54)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-BST-R (52))
Program Officer
Brooks, Lisa
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Pittsburgh
United States
Zip Code
Huang, Tianzhi; Alvarez, Angel A; Pangeni, Rajendra P et al. (2016) A regulatory circuit of miR-125b/miR-20b and Wnt signalling controls glioblastoma phenotypes through FZD6-modulated pathways. Nat Commun 7:12885
Sedgewick, Andrew J; Shi, Ivy; Donovan, Rory M et al. (2016) Learning mixed graphical models with separate sparsity parameters and stability-based model selection. BMC Bioinformatics 17 Suppl 5:175
Spagnolo, Daniel M; Gyanchandani, Rekha; Al-Kofahi, Yousef et al. (2016) Pointwise mutual information quantifies intratumor heterogeneity in tissue sections labeled with multiple fluorescent biomarkers. J Pathol Inform 7:47
Lu, Songjian; Cai, Chunhui; Yan, Gonghong et al. (2016) Signal-Oriented Pathway Analyses Reveal a Signaling Complex as a Synthetic Lethal Target for p53 Mutations. Cancer Res 76:6785-6794
Spirtes, Peter; Zhang, Kun (2016) Causal discovery and inference: concepts and recent methodological advances. Appl Inform (Berl) 3:3
Böhm, Stefanie; Szakal, Barnabas; Herken, Benjamin W et al. (2016) The Budding Yeast Ubiquitin Protease Ubp7 Is a Novel Component Involved in S Phase Progression. J Biol Chem 291:4442-52
Strobl, Eric V; Visweswaran, Shyam (2016) Markov Boundary Discovery with Ridge Regularized Linear Models. J Causal Inference 4:31-48
Kummerfeld, Erich; Ramsey, Joseph (2016) Causal Clustering for 1-Factor Measurement Models. KDD 2016:1655-1664
Lu, Songjian; Mandava, Gunasheil; Yan, Gaibo et al. (2016) An exact algorithm for finding cancer driver somatic genome alterations: the weighted mutually exclusive maximum set cover problem. Algorithms Mol Biol 11:11
Plis, Sergey; Danks, David; Yang, Jianyu (2015) Mesochronal Structure Learning. Uncertain Artif Intell 31:

Showing the most recent 10 out of 23 publications