The Penn State ?Computation, Bioinformatics, and Statistics (CBIOS) Predoctoral Training Program? will train a new generation of scientists with strong computational, statistical, biological, and science communication skills to enable them to develop and lead interdisciplinary collaborative research efforts that promote advances in the biomedical and life sciences. The goals of CBIOS are consistent with the ?Bioinformatics and Computational Biology? program at National Institute of General Medicine.
The aim i s to train a new class of scientists with a primary identity as computational biologists/bioinformaticians, and whose disciplinary core draws from an emerging set of principles on how to compute, analyze, and apply biological data. The goals of the first five years of the CBIOS program were to train students ?that can think statistically, use computational and statistical tools, and generate computational and statistical innovation to keep pace with the quickly evolving landscape of high-throughput genomics technologies.? The second cycle of CBIOS will continue to provide a strong foundation in quantitative and life sciences, and incorporate a new science communication curriculum aimed at developing fluency in communicating the purpose, significance, and outcomes of research to diverse stakeholders in the science, medicine, policy, and business arenas. CBIOS builds upon six established graduate programs including four departmental and two intercollegiate programs. Training faculty participating in the program belong to five different colleges and two campuses ? providing a broad and interdisciplinary research spectrum. Most of the training faculty are affiliated with the intercollegiate Bioinformatics and Genomics graduate program and belong to centers of excellence under the auspices of the Genome Sciences Institute, which provides a common platform for interactions. Our faculty interact closely ? as evidenced by joint authorship on research publications, co-funded grants, co-teaching, and co-advising of graduate students. They have a combined annual research funding base of $75,265,600 direct cost, offering a solid foundation for trainees? research experience and opportunities. Combining NIH and Penn State support, CBIOS plans to train a minimum of 18 predoctoral trainees over a period of five years. Each trainee will be supported for two years (year 2 and 3 of their graduate career) while receiving a foundational curriculum through the CBIOS program. Trainees will gain a thorough understanding of the scientific process, responsible conduct of research, fluency in research methodologies, ability to utilize computational, bioinformatics and statistical tools in large-scale genomic data analysis, excellence in communication to diverse audiences, leadership in cross-disciplinary research teams, and excellent professional developmental activities. After the two years of support, trainees will continue to participate in monthly trainee meetings and remain connected with program activities throughout their graduate career.

Public Health Relevance

The exploration of large, high dimensional data on genomes, transcriptomes, proteomes, and metabolomes is providing critical information linking myriad of inter-individual differences, or inter-cellular differences within individuals, to health or diseases. Analysis and interpretation of the comprehensive datasets generated by contemporary genomics research requires skills cutting across the disciplines of computer science, bioinformatics, statistics and the biological sciences. Our predoctoral training program prepares a cadre of young scientists to excel in cross-disciplinary skills encompassing computation, statistics, biology and science communication, and to be innovative leaders in a research community that strives to harvest insights from ?omics? data to improve human health.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Institutional National Research Service Award (T32)
Project #
2T32GM102057-06
Application #
9491167
Study Section
NIGMS Initial Review Group (TWD)
Program Officer
Resat, Haluk
Project Start
2013-07-01
Project End
2023-06-30
Budget Start
2018-07-01
Budget End
2019-06-30
Support Year
6
Fiscal Year
2018
Total Cost
Indirect Cost
Name
Pennsylvania State University
Department
Biology
Type
Schools of Arts and Sciences
DUNS #
003403953
City
University Park
State
PA
Country
United States
Zip Code
16802
Vinayachandran, Vinesh; Reja, Rohit; Rossi, Matthew J et al. (2018) Widespread and precise reprogramming of yeast protein-genome interactions in response to heat shock. Genome Res :
Zhang, Yan; An, Lin; Xu, Jie et al. (2018) Enhancing Hi-C data resolution with deep convolutional neural network HiCPlus. Nat Commun 9:750
Iyer, Janani; Singh, Mayanglambam Dhruba; Jensen, Matthew et al. (2018) Pervasive genetic interactions modulate neurodevelopmental defects of the autism-associated 16p11.2 deletion in Drosophila melanogaster. Nat Commun 9:2548
Warris, Sven; Schijlen, Elio; van de Geest, Henri et al. (2018) Correcting palindromes in long reads after whole-genome amplification. BMC Genomics 19:798
Rangavittal, Samarth; Harris, Robert S; Cechova, Monika et al. (2018) RecoverY: k-mer-based read classification for Y-chromosome-specific sequencing and assembly. Bioinformatics 34:1125-1131
Lee, Sang Y; Zhu, Junjia; Salzberg, Anna C et al. (2017) Analysis of single nucleotide variants of HFE gene and association to survival in The Cancer Genome Atlas GBM data. PLoS One 12:e0174778
Yang, Tao; Zhang, Feipeng; Yard?mc?, Galip Gürkan et al. (2017) HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient. Genome Res 27:1939-1949
Jensen, Matthew; Girirajan, Santhosh (2017) Mapping a shared genetic basis for neurodevelopmental disorders. Genome Med 9:109
Rieber, Lila; Mahony, Shaun (2017) miniMDS: 3D structural inference from high-resolution Hi-C data. Bioinformatics 33:i261-i266
Wang, Qingyu; Shashikant, Cooduvalli S; Jensen, Matthew et al. (2017) Novel metrics to measure coverage in whole exome sequencing datasets reveal local and global non-uniformity. Sci Rep 7:885

Showing the most recent 10 out of 18 publications