The practice of biomedical research has undergone dramatic changes in recent years, largely driven by new biotechnology for high-throughput data generation. These technologies include high-throughput methods for imaging, genetic sequencing, proteomics, structure determination, and numerous other tasks that now make it possible to finely characterize numerous aspects of living systems from the molecular to the organismal levels. These advances in biotechnology and the vast amounts of data they are producing have revolutionized biomedical research. They have also, however, created a pressing need for scientists capable of working in a field that is increasingly data-driven and dependent on advanced computational methods. In particular, modern biomedical research depends on a new breed of computationally and mathematically sophisticated researchers who can understand new biotechnologies, develop innovative mathematical models and computer algorithms needed to make sense of their data, and apply this knowledge to drive biological and medical advances. To do so, these researchers require a strong command of computational science, the biomedical applications on which they work, and the biological and physical sciences that inform them. The Carnegie Mellon University/University of Pittsburgh Ph.D. Program in Computational Biology (CPCB) was created to meet this need for training experts in computational biology. The program aims to prepare the future leaders of computational biology: research scientists with deep knowledge of computational theory, biological and physical sciences, and a growing body of specialized interdisciplinary knowledge at the intersection of these areas. To accomplish this, the program leverages the shared strengths of its two hosts institutions, collectively world leaders in computer science, engineering, and medical research with long track records of innovation in computational biology research and educational. The training program includes an innovative curriculum covering fundamentals of computational biology, broadly defined, and a large body of advanced elective coursework spanning four broad domains of computational biology research: bioimage informatics, cellular and systems modeling, computational genomics, and computational structural biology. Program students perform thesis research in any of numerous laboratories at the cutting edge of computational biology research. These primary components of coursework and thesis research are supplemented by numerous mechanisms to facilitate student success, promote professional development, encourage responsible conduct of research, and aid in recruiting and retaining underrepresented groups. The proposed program seeks to renew training support for a select subset of students in the broader CPCB graduate program. It will provide the most promising students with two years of research support, providing them added resources and flexibility to pursue the most innovative research directions and to aid in their development into future leaders of computational biology and biomedical research as a whole.

Public Health Relevance

Biomedical research has become a data-intensive field that depends on researchers with sophisticated knowledge of both computational and biomedical sciences. By training a core of exceptionally talented students in these skills, the proposed work will help advance numerous directions in improving medical treatment that now critically depend on computational innovation, such as medical image analysis, personalized and genomic medicine, and modern drug design.

National Institute of Health (NIH)
Institutional National Research Service Award (T32)
Project #
Application #
Study Section
Special Emphasis Panel (ZEB1)
Program Officer
Baird, Richard A
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Carnegie-Mellon University
Schools of Arts and Sciences
United States
Zip Code
Kangas, Joshua D; Naik, Armaghan W; Murphy, Robert F (2014) Efficient discovery of responses of proteins to compounds using active learning. BMC Bioinformatics 15:143
Hogg, Justin S; Harris, Leonard A; Stover, Lori J et al. (2014) Exact hybrid particle/population simulation of rule-based models of biochemical systems. PLoS Comput Biol 10:e1003544
Donovan, Rory M; Sedgewick, Andrew J; Faeder, James R et al. (2013) Efficient stochastic simulation of chemical kinetics networks using a weighted ensemble of trajectories. J Chem Phys 139:115105
Coelho, Luis Pedro; Kangas, Joshua D; Naik, Armaghan W et al. (2013) Determining the subcellular location of new proteins from microscope images using local features. Bioinformatics 29:2343-9
Sedgewick, Andrew J; Benz, Stephen C; Rabizadeh, Shahrooz et al. (2013) Learning subgroup-specific regulatory interactions and regulator independence with PARADIGM. Bioinformatics 29:i62-70
Travers, Timothy; Shao, Hanshuang; Wells, Alan et al. (2013) Modeling the assembly of the multiple domains of ýý-actinin-4 and its role in actin cross-linking. Biophys J 104:705-15
Savol, Andrej J; Burger, Virginia M; Agarwal, Pratul K et al. (2011) QAARM: quasi-anharmonic autoregressive model reveals molecular recognition pathways in ubiquitin. Bioinformatics 27:i52-60
Ramanathan, Arvind; Savol, Andrej J; Langmead, Christopher J et al. (2011) Discovering conformational sub-states relevant to protein function. PLoS One 6:e15827
Tsai, Ming-Chi; Blelloch, Guy; Ravi, R et al. (2011) A consensus tree approach for reconstructing human evolutionary history and detecting population substructure. IEEE/ACM Trans Comput Biol Bioinform 8:918-28