This renewal application proposes to carry out a Program Project of statistical methods research to address gaps and barriers arising in the analysis of large and complex data from observational studies in cancer research. The ultimate goal of the Program is to use rich data sources to develop effective strategies for reducing cancer burden in the U.S. and improving longevity and quality of life. This Program Project comprises three research projects and two cores. The three integrated projects jointly address the statistical needs for three research priority areas identified by the Division of Cancer Contro and Population Science of National Cancer Institute: Health Disparities;Comparative Effectiveness Research;and Public Health Genomics. In Project 1, we will develop statistical methods to overcome common data limitations for the investigation of social and racial disparities spanning the cancer continuum. We will analyze data from the SEER database that is linked with data from the National Longitudinal Mortality Survey (NLMS). In Project 2, we will develop methods for comparative effectiveness research (CER) in cancer using large observational data. We will use the SEER-Medicare data and the CaPSURE cohort to emulate complex randomized trials to compare the effectiveness of personalized strategies for cancer diagnosis and dynamic strategies for cancer treatment. In Project 3, we will develop statistical methods for analysis of next generation sequencing data in genetic cancer epidemiological studies. The proposed research in Project 3 is motivated by and applied to the Harvard lung cancer and breast cancer exome and targeted sequencing studies as well as the affiliated Genome-Wide Association Studies. The Administrative Core will coordinate the overall scientific direction and programmatic activities of the Program, which will include regular P01 meetings, seminars, the annual retreat, the external advisory committee meeting, short courses, a visitor program, dissemination of research results. The Statistical Computing Core will allow access to Harvard largest high performance computing cluster, perform data management, and ensure the development and dissemination of open access, high quality software. The Program PIs, Professors Xihong Lin and Francesca Dominici, are renowned biostatisticians with strong track records of methodological and collaborative research and academic administration.
This research Program aims to develop innovative and practical statistical tools for the analysis of large and complex observational data to study social disparities in cancer, comparative effectiveness of cancer diagnosis and treatment, and cancer risk assessment and prediction, prevention, and progression using genetic profiles and environmental/behavior/social exposures.
|Valeri, Linda; Reese, Sarah L; Zhao, Shanshan et al. (2017) Misclassified exposure in epigenetic mediation analyses. Does DNA methylation mediate effects of smoking on birthweight? Epigenomics 9:253-265|
|Chipman, J; Braun, D (2017) Simpson's paradox in the integrated discrimination improvement. Stat Med 36:4468-4481|
|Wilson, Ander; Chiu, Yueh-Hsiu Mathilda; Hsu, Hsiao-Hsien Leon et al. (2017) Bayesian distributed lag interaction models to identify perinatal windows of vulnerability in children's health. Biostatistics 18:537-552|
|García-Albéniz, Xabier; Hsu, John; Hernán, Miguel A (2017) The value of explicitly emulating a target trial when using real world evidence: an application to colorectal cancer screening. Eur J Epidemiol 32:495-500|
|Krieger, Nancy; Feldman, Justin M; Waterman, Pamela D et al. (2017) Local Residential Segregation Matters: Stronger Association of Census Tract Compared to Conventional City-Level Measures with Fatal and Non-Fatal Assaults (Total and Firearm Related), Using the Index of Concentration at the Extremes (ICE) for Racial, Econ J Urban Health 94:244-258|
|Barnett, Ian; Mukherjee, Rajarshi; Lin, Xihong (2017) The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies. J Am Stat Assoc 112:64-76|
|Lee, Kyu Ha; Tadesse, Mahlet G; Baccarelli, Andrea A et al. (2017) Multivariate Bayesian variable selection exploiting dependence structure among outcomes: Application to air pollution effects on DNA methylation. Biometrics 73:232-241|
|Asafu-Adjei, Josephine; Mahlet, G Tadesse; Coull, Brent et al. (2017) Bayesian Variable Selection Methods for Matched Case-Control Studies. Int J Biostat 13:|
|Di, Qian; Wang, Yan; Zanobetti, Antonella et al. (2017) Air Pollution and Mortality in the Medicare Population. N Engl J Med 376:2513-2522|
|Cefalu, Matthew; Dominici, Francesca; Arvold, Nils et al. (2017) Model averaged double robust estimation. Biometrics 73:410-421|
Showing the most recent 10 out of 178 publications