Recent developments in The Human Genome Project and breakthroughs in different types of high throughput technologies have changed how researchers approach complex diseases by moving toward cross- disciplinary studies, collecting data on all facets of disease. The objective of this application is to develop efficient statistica and computational approaches to integrating genetics, genomics and epidemiologic data for understanding the interplay of genetics and environment in complex diseases, with the long-term goal of devising personalized strategies to prevent and treat these diseases. Genome-wide association studies have identified thousands of trait associated genetic variants, and provided valuable insights into the genetic architecture of these traits. However, most variants identified so far confer relatively small increments in risk, and explain only a small proportion o heritability, leading many to question how the remaining 'missing' heritability can be explained. This application addresses this 'missing' heritability from several aspects: rare variant association analysis, gene-environment interaction, and heritability estimation beyond additive genetic effects. Accordingly, we propose the following specific aims.
Aim 1 is to develop methods for integrating functional information into rare variants association analysis. To achieve this goal, Aim 1 includes developing databases of tissue-specific functional annotation and constructing regulatory expression networks (eQTL) from public data generated from large collaborative projects such as the Encyclopedia of DNA Elements and the Genotype Tissue Expression. The theoretical properties of the rare variants analysis will also be studied to devise powerful tests in consideration of genomic features such as linkage disequilibrium and sparse signals.
Aim 2 is to develop methods for rare variants gene-environment interaction (GxE) that incorporates functional information. Efficient and versatile screening strategies will also be developed for genome-wide discovery of GxE. Even though this aim is focused on GxE, the methods are also applicable to gene-gene interaction (GxG).
Aim 3 is to develop methods for estimating heritability that incorporates GxE and GxG to understand the complex interplay between genetic susceptibility and environment The proposed work is motivated by a large consortium on colorectal cancer, which has over 40,000 participants from well-characterized studies with detailed data on both environmental risk factors and GWAS and whole genome sequencing data. The developed methods will be applied to the consortium to gain new insights in colorectal cancer and demonstrate the feasibility of the methods. Since the methods are applicable to other complex diseases and traits, R-based open source software will be developed and submitted to the Comprehensive R Archive Network for broad dissemination.

Public Health Relevance

Recent developments in next generation sequencing technologies and other types of high-throughput technologies have changed how researchers approach complex diseases by moving toward cross-disciplinary studies, collecting data on all facets of disease. The objective of this application is to develop efficient statistical and computational approaches to integrating genetics, genomics and epidemiologic data for understanding the interplay between genetics and environment in complex diseases, with the long-term goal of devising personalized strategies to prevent and treat these diseases.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
5R01CA189532-04
Application #
9512853
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
Divi, Rao L
Project Start
2015-07-01
Project End
2019-06-30
Budget Start
2018-07-01
Budget End
2019-06-30
Support Year
4
Fiscal Year
2018
Total Cost
Indirect Cost
Name
Fred Hutchinson Cancer Research Center
Department
Type
DUNS #
078200995
City
Seattle
State
WA
Country
United States
Zip Code
98109
Conley, Christopher J; Ozbek, Umut; Wang, Pei et al. (2018) Characterizing functional consequences of DNA copy number alterations in breast and ovarian tumors by spaceMap. J Genet Genomics 45:361-371
Su, Yu-Ru; Di, Chongzhi; Bien, Stephanie et al. (2018) A Mixed-Effects Model for Powerful Association Tests in Integrative Functional Genomics. Am J Hum Genet 102:904-919
Liu, Jianyu; Sun, Wei; Liu, Yufeng (2018) Joint skeleton estimation of multiple directed acyclic graphs for heterogeneous population. Biometrics :
Neumeyer, Sonja; Banbury, Barbara L; Arndt, Volker et al. (2018) Mendelian randomisation study of age at menarche and age at menopause and the risk of colorectal cancer. Br J Cancer 118:1639-1647
Liu, Yanyan; Xiong, Sican; Sun, Wei et al. (2018) Joint Analysis of Strain and Parent-of-Origin Effects for Recombinant Inbred Intercrosses Generated from Multiparent Populations with the Collaborative Cross as an Example. G3 (Bethesda) 8:599-605
He, Qianchuan; Liu, Yang; Peters, Ulrike et al. (2018) Multivariate association analysis with somatic mutation data. Biometrics 74:176-184
Dai, James Y; Peters, Ulrike; Wang, Xiaoyu et al. (2018) Diagnostics for Pleiotropy in Mendelian Randomization Studies: Global and Individual Tests for Direct Effects. Am J Epidemiol 187:2672-2680
Sun, Wei; Bunn, Paul; Jin, Chong et al. (2018) The association between copy number aberration, DNA methylation and gene expression in tumor samples. Nucleic Acids Res 46:3009-3018
Ritchie, Marylyn D; Davis, Joe R; Aschard, Hugues et al. (2017) Incorporation of Biological Knowledge Into the Study of Gene-Environment Interactions. Am J Epidemiol 186:771-777
Gorfine, Malka; Berndt, Sonja I; Chang-Claude, Jenny et al. (2017) Heritability Estimation using a Regularized Regression Approach (HERRA): Applicable to continuous, dichotomous or age-at-onset outcome. PLoS One 12:e0181269

Showing the most recent 10 out of 19 publications