The proposed Program Project, Statistical Informatics for Cancer Research, will tackle a wide range of challenging statistical problems based on the computationally-intensive analysis of large and complicated data sets. Among the types of datasets to be handled are large administrative databases for disease mapping and public health surveillance subject to spatial and temporal correlation and high dimensional datasets arising from genomics or proteomics studies in cancer epidemiology. The Statistical Computing Core will be responsible for supporting the computational needs of all Program investigators. Specifically, the Core will 1. advise Program investigators on computational aspects of their work; 2. provide educational tutorials and training as appropriate; 3. provide specialized advice and support in terms of Geographic Information Systems (CIS) and bioinformatics; 4. serve as a Liaison with the Information Technology Department at the Harvard School of Public Health to ensure that all Program investigators have adequate support through the School's high performance Linux cluster; 5. create and maintain a Program website; 6. work with Program investigators to take their prototype programs and turn them into flexible, efficient, robust, well-documented, and user-friendly R libraries that can be distributed through the online R archive, as well as via the Program website. Christopher Paciorek, Assistant Professor of Biostatistics at the Harvard School of Public Health, will serve as Core Director. Dr. Paciorek is an experienced programmer and received strong training in statistical computing during his doctoral studies at Carnegie Mellon University. Dr. Paciorek's own research interests are in computational methods, especially for spatio-temporal and Bayesian modeling. The Core will subcontract professional programming support from Battelle Memorial Institute. The Battelle group providing the support (Project Manager, Mr. Warren Strauss) has an outstanding track record with respect to providing such support, and we are confident that we have identified a high quality, cost-effective solution. The overall Program PI, Dr. Ryan, has a successful ongoing collaboration with this same group.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Research Program Projects (P01)
Project #
Application #
Study Section
Special Emphasis Panel (ZCA1-RPRB-7)
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Harvard University
United States
Zip Code
Bind, M-A C; Vanderweele, T J; Coull, B A et al. (2016) Causal mediation analysis for longitudinal data with exogenous exposure. Biostatistics 17:122-34
Hernán, Miguel A; Robins, James M (2016) Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available. Am J Epidemiol :
Chen, Jun; Just, Allan C; Schwartz, Joel et al. (2016) CpGFilter: model-based CpG probe filtering with replicates for epigenome-wide association studies. Bioinformatics 32:469-71
Lin, Xinyi; Lee, Seunggeun; Wu, Michael C et al. (2016) Test for rare variants by environment interactions in sequencing association studies. Biometrics 72:156-64
Lee, Kyu Ha; Tadesse, Mahlet G; Baccarelli, Andrea A et al. (2016) Multivariate Bayesian variable selection exploiting dependence structure among outcomes: Application to air pollution effects on DNA methylation. Biometrics :
Yung, Godwin; Lin, Xihong (2016) Validity of using ad hoc methods to analyze secondary traits in case-control association studies. Genet Epidemiol 40:732-743
Arvold, Nils D; Cefalu, Matthew; Wang, Yun et al. (2016) Comparative effectiveness of radiotherapy with vs. without temozolomide in older patients with glioblastoma. J Neurooncol :
Wasfy, Jason H; Dominici, Francesca; Yeh, Robert W (2016) Letter by Wasfy et al Regarding Article, ""Facility Level Variation in Hospitalization, Mortality, and Costs in the 30 Days After Percutaneous Coronary Intervention: Insights on Short-Term Healthcare Value From the Veterans Affairs Clinical Assessment, Re Circulation 133:e376
Carere, Deanna Alexis; Kraft, Peter; Kaphingst, Kimberly A et al. (2016) Consumers report lower confidence in their genetics knowledge following direct-to-consumer personal genomic testing. Genet Med 18:65-72
Zigler, Corwin Matthew (2016) The Central Role of Bayes' Theorem for Joint Estimation of Causal Effects and Propensity Scores. Am Stat 70:47-54

Showing the most recent 10 out of 136 publications