Recent advance of genomic sciences has significantly changed the landscape of environmental health science research. Collection of high throughput genomic data has become increasingly important for investigating the interplay of genes and environment in causing human diseases in environmental case-control and cohort studies. Analysis of such high-dimensional gene-environmental data presents substantial statistical and computational challenges, especially in investigating gene and environment interactions. Limited statistical developments have been made in this area so far. This methodological shortage has become a bottleneck for effectively studying the roles of genes and their interactions with environment in causing human diseases. The purpose of this proposal responds to this need by developing advanced semi-parametric statistical methods to analyze high throughput data from gene and environment studies. We plan (1) to develop semi-parametric locally efficient methods for double-robust estimation in a case-control study, of a model for the joint effect of a genetic factor, an environmental exposure and multiple extraneous confounding factors, (2) to develop semi-parametric methods for multiple robust estimation in cohort and case-control studies, of a model of interaction between a genetic factor and an environmental exposure in the effect that they produce on a binary disease outcome, (3) to develop semi-parametric methods for double robust inferences of genetic effects incorporating gene-environment interaction and confounding adjustment in a Cox proportional hazards model for censored survival data and (4) develop efficient and open access user-friendly algorithms and statistical software that implement these methods with the goal of disseminating them freely to the gene-environment research community. In addition, we will evaluate the performance of our methods in three ongoing GWAS we have been involved with as well as in simulation studies.

Public Health Relevance

The proposed project will develop cutting edge methods for discovery of novel genes and gene-environment interaction while efficiently incorporating prior knowledge. The impact of these methods to the field of public health promises to be significant through the development of improved methodology for robust investigation of the interplay of genes and environment in causing human diseases in environmental case-control and cohort studies.

National Institute of Health (NIH)
National Institute of Environmental Health Sciences (NIEHS)
Research Project (R01)
Project #
Application #
Study Section
Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer
Mcallister, Kimberly A
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Harvard University
Public Health & Prev Medicine
Schools of Public Health
United States
Zip Code
Prague, Melanie; Wang, Rui; Stephens, Alisa et al. (2016) Accounting for interactions and complex inter-subject dependency in estimating treatment effect in cluster-randomized trials with missing outcomes. Biometrics 72:1066-1077
Nguyen, Thu T; Tchetgen Tchetgen, Eric J; Kawachi, Ichiro et al. (2016) Instrumental variable approaches to identifying the causal effect of educational attainment on dementia risk. Ann Epidemiol 26:71-6.e1-3
Nguyen, Thu T; Tchetgen Tchetgen, Eric J; Kawachi, Ichiro et al. (2016) Comparing Alternative Effect Decomposition Methods: The Role of Literacy in Mediating Educational Effects on Mortality. Epidemiology 27:670-6
Walter, S; Glymour, M M; Koenen, K et al. (2015) Do genetic risk scores for body mass index predict risk of phobic anxiety? Evidence for a shared genetic risk factor. Psychol Med 45:181-91
VanderWeele, Tyler J; Tchetgen Tchetgen, Eric J (2015) Alternative decompositions for attributing effects to interactions. Epidemiology 26:e32-4
Tchetgen Tchetgen, Eric J; Phiri, Kelesitse; Shapiro, Roger (2015) A Simple Regression-based Approach to Account for Survival Bias in Birth Outcomes Research. Epidemiology 26:473-80
Walter, Stefan; Kubzansky, Laura D; Koenen, Karestan C et al. (2015) Revisiting Mendelian randomization studies of the effect of body mass index on depression. Am J Med Genet B Neuropsychiatr Genet 168B:108-15
Naimi, Ashley I; Tchetgen Tchetgen, Eric J (2015) Invited commentary: Estimating population impact in the presence of competing events. Am J Epidemiol 181:571-4
Tchetgen Tchetgen, Eric J; Walter, Stefan; Vansteelandt, Stijn et al. (2015) Instrumental variable estimation in a survival context. Epidemiology 26:402-10
VanderWeele, Tyler J; Tchetgen Tchetgen, Eric J (2014) Attributing effects to interactions. Epidemiology 25:711-22

Showing the most recent 10 out of 33 publications