The practice of biomedical research has undergone dramatic changes in recent years, largely driven by new biotechnology for high-throughput data generation. These technologies include high-throughput methods for imaging, genetic sequencing, proteomics, structure determination, and numerous other tasks that now make it possible to finely characterize numerous aspects of living systems from the molecular to the organismal levels. These advances in biotechnology and the vast amounts of data they are producing have revolutionized biomedical research. They have also, however, created a pressing need for scientists capable of working in a field that is increasingly data-driven and dependent on advanced computational methods. In particular, modern biomedical research depends on a new breed of computationally and mathematically sophisticated researchers who can understand new biotechnologies, develop innovative mathematical models and computer algorithms needed to make sense of their data, and apply this knowledge to drive biological and medical advances. To do so, these researchers require a strong command of computational science, the biomedical applications on which they work, and the biological and physical sciences that inform them. The Carnegie Mellon University/University of Pittsburgh Ph.D. Program in Computational Biology (CPCB) was created to meet this need for training experts in computational biology. The program aims to prepare the future leaders of computational biology: research scientists with deep knowledge of computational theory, biological and physical sciences, and a growing body of specialized interdisciplinary knowledge at the intersection of these areas. To accomplish this, the program leverages the shared strengths of its two hosts institutions, collectively world leaders in computer science, engineering, and medical research with long track records of innovation in computational biology research and educational. The training program includes an innovative curriculum covering fundamentals of computational biology, broadly defined, and a large body of advanced elective coursework spanning four broad domains of computational biology research: bioimage informatics, cellular and systems modeling, computational genomics, and computational structural biology. Program students perform thesis research in any of numerous laboratories at the cutting edge of computational biology research. These primary components of coursework and thesis research are supplemented by numerous mechanisms to facilitate student success, promote professional development, encourage responsible conduct of research, and aid in recruiting and retaining underrepresented groups. The proposed program seeks to renew training support for a select subset of students in the broader CPCB graduate program. It will provide the most promising students with two years of research support, providing them added resources and flexibility to pursue the most innovative research directions and to aid in their development into future leaders of computational biology and biomedical research as a whole.

Public Health Relevance

Biomedical research has become a data-intensive field that depends on researchers with sophisticated knowledge of both computational and biomedical sciences. By training a core of exceptionally talented students in these skills, the proposed work will help advance numerous directions in improving medical treatment that now critically depend on computational innovation, such as medical image analysis, personalized and genomic medicine, and modern drug design.

National Institute of Health (NIH)
Institutional National Research Service Award (T32)
Project #
Application #
Study Section
Special Emphasis Panel (ZEB1)
Program Officer
Baird, Richard A
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Carnegie-Mellon University
Schools of Arts and Sciences
United States
Zip Code
Poon, Leo L M; Song, Timothy; Rosenfeld, Roni et al. (2016) Quantifying influenza virus diversity and transmission in humans. Nat Genet 48:195-200
Naik, Armaghan W; Kangas, Joshua D; Sullivan, Devin P et al. (2016) Active machine learning-driven experimentation to determine compound effects on protein patterns. Elife 5:e10047
Donovan, Rory M; Tapia, Jose-Juan; Sullivan, Devin P et al. (2016) Unbiased Rare Event Sampling in Spatial Stochastic Systems Biology Models Using a Weighted Ensemble of Trajectories. PLoS Comput Biol 12:e1004611
Harris, Leonard A; Hogg, Justin S; Tapia, José-Juan et al. (2016) BioNetGen 2.2: advances in rule-based modeling. Bioinformatics 32:3366-3368
Spagnolo, Daniel M; Gyanchandani, Rekha; Al-Kofahi, Yousef et al. (2016) Pointwise mutual information quantifies intratumor heterogeneity in tissue sections labeled with multiple fluorescent biomarkers. J Pathol Inform 7:47
Solomon, Brad; Kingsford, Carl (2016) Fast search of thousands of short-read sequencing experiments. Nat Biotechnol 34:300-2
Roman, Theodore; Xie, Lu; Schwartz, Russell (2016) Medoidshift clustering applied to genomic bulk tumor data. BMC Genomics 17 Suppl 1:6
Li, Ying; Majarian, Timothy D; Naik, Armaghan W et al. (2016) Point process models for localization and interdependence of punctate cellular structures. Cytometry A 89:633-43
Savol, Andrej J; Chennubhotla, Chakra S (2015) Approximating frustration scores in complex networks via perturbed Laplacian spectra. Phys Rev E Stat Nonlin Soft Matter Phys 92:062806
Malkin, Alexander D; Sheehan, Robert P; Mathew, Shibin et al. (2015) A Neutrophil Phenotype Model for Extracorporeal Treatment of Sepsis. PLoS Comput Biol 11:e1004314

Showing the most recent 10 out of 41 publications