Colorectal cancer (CRC) is the second leading cause of cancer death in the US. Linkage studies and genome-wide association studies (GWAS) have successfully identified high-penetrance mutations such as those that occur in APC or DNA mismatch-repair genes, as well as low-penetrance variants such as 8q24 and SMAD7. However, these variants explain only a fraction of the heritability of CRC. This is not surprising, as contributions from large classes of genetic variation, specifically less frequent and rare singl nucleotide variants (SNV) with allele frequency of 0.1-5%, insertion/deletions (indels), and copy number variants (CNVs), have not been systematically investigated across the genome. These genetic variants are predicted to have stronger effect sizes than common low-penetrance variants and are postulated to explain a substantial proportion of the heritability of CRC. To comprehensively identify these variants across the genome, we propose to use next generation technology to sequence the whole genome with 12x coverage in 2,123 high-risk CRC cases and 2,123 controls (Aim 1.1). These cases and controls will be selected from our existing Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO;U01CA137088, PI: Peters) of 15 well-characterized prospective cohorts and case-control studies. We demonstrate that combining whole genome sequence data with imputation using existing GWAS data in large sets of case-control studies allows a powerful and efficient screen for CRC susceptibility loci. This method is particularly well suited to identifying less frequent and rare SNVs, indels, and CNVs. Accordingly, in Aim 1.2 we use the sequencing data from Aim 1.1 to impute ~20M variants in an additional 8,958 CRC cases and 10,212 controls with existing GWAS data. We will test the associations between CRC risk and variants (sequenced and imputed) in a total of 11,081 cases and 12,335 controls.
In Aim 1. 3, we will replicate the most promising loci by genotyping 3,000 variants in 8,827 independent CRC cases and 8,595 controls.
In Aim 2, we will investigate gene-environment interactions for directly sequenced and imputed variants, utilizing GECCO studies, which have detailed clinical and epidemiologic data that have already been harmonized across studies. To improve the power for Aim 1 and 2, we will apply novel statistical methods. This project brings together a highly qualified, multidisciplinary team of investigators with expertise in CRC research, biostatistics, population and statistical genetics, epidemiology, and next generation sequencing. We expect to identify several novel CRC susceptibility variants with effect sizes larger than previous GWAS findings. These results will improve our understanding of which genes are impacting CRC. Such knowledge about the underlying biology could have long term impacts on screening, treatment and disease prevention.

Public Health Relevance

This multidisciplinary effort will investigate whether different types of genetic variations, including rare variants and structural variation, influence colorectal cancer risk in humans. Specifically, we will examine genetic variants across the entire genomes of colorectal cancer cases and controls to identify new genetic risk factors for colorectal cancer. Findings from this study will improve our knowledge of the full spectrum of genes that affect the risk of this severe disease.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project--Cooperative Agreements (U01)
Project #
1U01CA164930-01A1
Application #
8370804
Study Section
Special Emphasis Panel (ZRG1-PSE-Q (02))
Program Officer
Mechanic, Leah E
Project Start
2012-09-26
Project End
2016-08-31
Budget Start
2012-09-26
Budget End
2013-08-31
Support Year
1
Fiscal Year
2012
Total Cost
$3,333,442
Indirect Cost
$303,685
Name
Fred Hutchinson Cancer Research Center
Department
Type
DUNS #
078200995
City
Seattle
State
WA
Country
United States
Zip Code
98109
Su, Yu-Ru; Di, Chongzhi; Bien, Stephanie et al. (2018) A Mixed-Effects Model for Powerful Association Tests in Integrative Functional Genomics. Am J Hum Genet 102:904-919
Jeon, Jihyoun; Du, Mengmeng; Schoen, Robert E et al. (2018) Determining Risk of Colorectal Cancer and Starting Age of Screening Based on Lifestyle, Environmental, and Genetic Factors. Gastroenterology 154:2152-2164.e19
Rashkin, Sara; Jun, Goo; Chen, Sai et al. (2017) Optimal sequencing strategies for identifying disease-associated singletons. PLoS Genet 13:e1006811
Bien, Stephanie A; Auer, Paul L; Harrison, Tabitha A et al. (2017) Enrichment of colorectal cancer associations in functional regions: Insight for using epigenomics data in the analysis of whole genome sequence-imputed GWAS data. PLoS One 12:e0186518
Dimitrakopoulou, Vasiliki I; Tsilidis, Konstantinos K; Haycock, Philip C et al. (2017) Circulating vitamin D concentration and risk of seven cancers: Mendelian randomisation study. BMJ 359:j4761
Zhao, Wei; Chen, Ying Qing; Hsu, Li (2017) On estimation of time-dependent attributable fraction from population-based case-control studies. Biometrics 73:866-875
Su, Yu-Ru; Di, Chong-Zhi; Hsu, Li et al. (2017) A unified powerful set-based test for sequencing data analysis of GxE interactions. Biostatistics 18:119-131
Lindström, Sara; Finucane, Hilary; Bulik-Sullivan, Brendan et al. (2017) Quantifying the Genetic Correlation between Multiple Cancer Types. Cancer Epidemiol Biomarkers Prev 26:1427-1435
McCarthy, Shane; Das, Sayantan; Kretzschmar, Warren et al. (2016) A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet 48:1279-83
Kocarnik, Jonathan M; Chan, Andrew T; Slattery, Martha L et al. (2016) Relationship of prediagnostic body mass index with survival after colorectal cancer: Stage-specific associations. Int J Cancer 139:1065-72

Showing the most recent 10 out of 52 publications