The proposed Program Project, Statistical Informatics for Cancer Research, will tackle a wide range of challenging statistical problems based on the computationally-intensive analysis of large and complicated data sets. Among the types of datasets to be handled are large administrative databases for disease mapping and public health surveillance subject to spatial and temporal correlation and high dimensional datasets arising from genomics or proteomics studies in cancer epidemiology. The Statistical Computing Core will be responsible for supporting the computational needs of all Program investigators. Specifically, the Core will 1. advise Program investigators on computational aspects of their work; 2. provide educational tutorials and training as appropriate; 3. provide specialized advice and support in terms of Geographic Information Systems (CIS) and bioinformatics; 4. serve as a Liaison with the Information Technology Department at the Harvard School of Public Health to ensure that all Program investigators have adequate support through the School's high performance Linux cluster; 5. create and maintain a Program website; 6. work with Program investigators to take their prototype programs and turn them into flexible, efficient, robust, well-documented, and user-friendly R libraries that can be distributed through the online R archive, as well as via the Program website. Christopher Paciorek, Assistant Professor of Biostatistics at the Harvard School of Public Health, will serve as Core Director. Dr. Paciorek is an experienced programmer and received strong training in statistical computing during his doctoral studies at Carnegie Mellon University. Dr. Paciorek's own research interests are in computational methods, especially for spatio-temporal and Bayesian modeling. The Core will subcontract professional programming support from Battelle Memorial Institute. The Battelle group providing the support (Project Manager, Mr. Warren Strauss) has an outstanding track record with respect to providing such support, and we are confident that we have identified a high quality, cost-effective solution. The overall Program PI, Dr. Ryan, has a successful ongoing collaboration with this same group.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Research Program Projects (P01)
Project #
Application #
Study Section
Special Emphasis Panel (ZCA1-RPRB-7)
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Harvard University
United States
Zip Code
Valeri, Linda; Reese, Sarah L; Zhao, Shanshan et al. (2017) Misclassified exposure in epigenetic mediation analyses. Does DNA methylation mediate effects of smoking on birthweight? Epigenomics 9:253-265
Chipman, J; Braun, D (2017) Simpson's paradox in the integrated discrimination improvement. Stat Med 36:4468-4481
Wilson, Ander; Chiu, Yueh-Hsiu Mathilda; Hsu, Hsiao-Hsien Leon et al. (2017) Bayesian distributed lag interaction models to identify perinatal windows of vulnerability in children's health. Biostatistics 18:537-552
García-Albéniz, Xabier; Hsu, John; Hernán, Miguel A (2017) The value of explicitly emulating a target trial when using real world evidence: an application to colorectal cancer screening. Eur J Epidemiol 32:495-500
Krieger, Nancy; Feldman, Justin M; Waterman, Pamela D et al. (2017) Local Residential Segregation Matters: Stronger Association of Census Tract Compared to Conventional City-Level Measures with Fatal and Non-Fatal Assaults (Total and Firearm Related), Using the Index of Concentration at the Extremes (ICE) for Racial, Econ J Urban Health 94:244-258
Barnett, Ian; Mukherjee, Rajarshi; Lin, Xihong (2017) The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies. J Am Stat Assoc 112:64-76
Lee, Kyu Ha; Tadesse, Mahlet G; Baccarelli, Andrea A et al. (2017) Multivariate Bayesian variable selection exploiting dependence structure among outcomes: Application to air pollution effects on DNA methylation. Biometrics 73:232-241
Asafu-Adjei, Josephine; Mahlet, G Tadesse; Coull, Brent et al. (2017) Bayesian Variable Selection Methods for Matched Case-Control Studies. Int J Biostat 13:
Di, Qian; Wang, Yan; Zanobetti, Antonella et al. (2017) Air Pollution and Mortality in the Medicare Population. N Engl J Med 376:2513-2522
Cefalu, Matthew; Dominici, Francesca; Arvold, Nils et al. (2017) Model averaged double robust estimation. Biometrics 73:410-421

Showing the most recent 10 out of 178 publications