Genome-wide association studies (GWAS) have been highly successful identifying loci associated with complex traits, including prostate cancer (PrCa). Although these newly discovered loci have shed some light into the underlying biology, the clinical impact for preventing or treating PrCa has been limited. Furthermore, the known variation fails to explain a majority of PrCa. GWAS primarily focus on common genetic variation (MAF>5%), however, empirical evidence suggests low-frequency variants play a role in cancer, including PrCa. Although low-frequency variants will explain a portion of the missing heritability for PrCa and could lead to actionable targets for prevention or treatment, current discovery strategies are not feasible. Whole-genome sequencing studies are too costly for the sample sizes needed to adequately evaluate low-frequency variants in PrCa. Innovative strategies are needed. Several studies have demonstrated that the accuracy of imputation for low-frequency variants improves with increasing size of the reference panel. Here, we propose the following:
AIM 1 - Assemble an enhanced reference panel of over 10,000 whole-genomes and perform imputation for 83,991 PrCa cases and 58,430 disease-free controls assembled from available GWAS data (Aim 1a). Using the above data, define operating characteristics of imputation for less common and rare variants using existing GWAS data. Clarify statistical relationship between GWAS marker frequency spectrum, target market frequency spectrum, reference panel size, and imputation success as measured by imputation R2, determine for above data a target set of imputable common, less common, and rare variants (Aim 1b);
AIM 2 - Perform a whole-genome scan of imputable common, less common, and rare variants associated with overall and aggressive PrCa susceptibility and PSA levels among disease-free controls;
AIM 3 - Develop novel statistical approaches to improve estimation and partition of heritability by incorporating genomic annotation (e.g. exonic, regulatory regions, pathways, etc.) while accounting for imputation error to improve risk prediction models. Recently, we performed a meta-analysis for PrCa utilizing GWAS data imputed to the 1000 Genomes Project for 31,663 PrCa cases and 35,870 disease-free controls identifying over 30 common novel susceptibility loci that are currently being validated. Furthermore, we will be genotyping an additional 90,000 individuals of European ancestry (2/3 PrCa cases and 1/3 controls) using a customized array with a GWAS backbone as part of the NCI Genetic Associations and Mechanisms in Oncology """"""""post-GWAS"""""""" initiative (GAME-ON). Our recent experience demonstrates our ability to implement imputation in a large data resource and our strong collaborations to assemble the world's largest PrCa case-control series with available GWAS data. We are collaborating with several groups to coordinate efforts assembling an enhanced reference panel for imputation and will provide access to the research community. This proposal not only addresses a fundamental question in PrCa but will greatly aid quantifying the role of genetics in other complex phenotypes.
Prostate cancer (PrCa) is the second most common cancer worldwide among men with very few known risk factors. Genome-wide association studies (GWAS) have successfully demonstrated common variation plays a central role in PrCa etiology but known loci fail to explain the inherited component entirely. Utilizing a cost- efficient stratey we will evaluate low-frequency variation by leveraging available resources to identify the missing heritability of PrCa and identify potential therapeutic targets.
|Sud, Amit; Thomsen, Hauke; Orlando, Giulia et al. (2018) Genome-wide association study implicates immune dysfunction in the development of Hodgkin lymphoma. Blood 132:2040-2052|
|Pande, Mala; Joon, Aron; Brewster, Abenaa M et al. (2018) Genetic susceptibility markers for a breast-colorectal cancer phenotype: Exploratory results from genome-wide association studies. PLoS One 13:e0196245|
|Went, Molly; Sud, Amit; Försti, Asta et al. (2018) Identification of multiple risk loci and regulatory mechanisms influencing susceptibility to multiple myeloma. Nat Commun 9:3707|
|Matejcic, Marco; Saunders, Edward J; Dadaev, Tokhir et al. (2018) Germline variation at 8q24 and prostate cancer risk in men of European ancestry. Nat Commun 9:4616|
|Loveday, Chey; Litchfield, Kevin; Levy, Max et al. (2018) Validation of loci at 2q14.2 and 15q21.3 as risk factors for testicular cancer. Oncotarget 9:12630-12638|
|Mancuso, Nicholas; Gayther, Simon; Gusev, Alexander et al. (2018) Large-scale transcriptome-wide association study identifies new prostate cancer risk regions. Nat Commun 9:4079|
|Loveday, Chey; Law, Philip; Litchfield, Kevin et al. (2018) Large-scale Analysis Demonstrates Familial Testicular Cancer to have Polygenic Aetiology. Eur Urol 74:248-252|
|Fejzo, Marlena S; Sazonova, Olga V; Sathirapongsasuti, J Fah et al. (2018) Placenta and appetite genes GDF15 and IGFBP7 are associated with hyperemesis gravidarum. Nat Commun 9:1178|
|Litchfield, Kevin; Levy, Max; Orlando, Giulia et al. (2017) Identification of 19 new risk loci and potential regulatory mechanisms influencing susceptibility to testicular germ cell tumor. Nat Genet 49:1133-1140|
|Gusev, Alexander; Shi, Huwenbo; Kichaev, Gleb et al. (2016) Atlas of prostate cancer heritability in European and African-American men pinpoints tissue-specific regulation. Nat Commun 7:10979|