Statistical Methods for Analysis of Tumor Heterogeneity in Genetic Epidemiology

Hsu, Li

Abstract

Cancer is a major morbidity and mortality burden throughout the world. While much progress has been made, the elimination of cancer has not yet been achieved. In the currently funded grant, we have developed statistical methods for genome-wide association analysis of cancer and studied cancer by the site of origin. However, even within a site, cancer can have distinct mutational profiles across patients. Pooling all cancer cases occurring at one site as one disease may miss important clinical and etiological insights. Recently technology advances have made it possible to characterize somatic mutations at great detail in large numbers of tumors, providing a unique opportunity to study tumor heterogeneity. The objective of this competitive renewal is to continue our statistical methods development for association analyses of tumor heterogeneity with clinical outcomes, and for studying the underlying genetic and environmental etiology. There are challenges in analyzing the somatic mutation data. First, somatic mutation may only exist in a subset of tumor cells of a patient, so called intra-tumor heterogeneity. While our application is focused on tumor heterogeneity across patients, because intra-tumor heterogeneity can also impact clinical outcomes, important insight could be missed if it were not accounted for. The goal of Aim 1 is to develop statistical methods to account for intra-tumor heterogeneity when assessing the association of somatic mutations with clinical outcomes. Second, it is of great interest to discover germline-somatic mutation link; however, despite that tumor studies are considerably larger than before due to technology advances, the power for discovering such links remains limited because of moderate genetic effects and the burden of accounting for multiple comparison from testing millions of variants. The goal of Aim 2 is to develop novel screening strategies for prioritizing genetic variants in testing genome-wide association with tumor heterogeneity. We will achieve optimal power by using the weighted hypothesis testing framework, allowing for correlated genetic variants and continuous screening statistics. Third, it is common that tumor blocks can usually only be retrieved from a subset of cases and tumor sequencing data are thus only available for this subset. Meanwhile, extensive risk factor information has already been collected for the larger study. The goal of Aim 3 is to develop a robust and efficient approach to incorporate the summary statistics information from the larger study for characterizing the effects of genetic and environmental risk factors on risk of developing cancer with specific tumor feature. The methods will be applied to the Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO, PI: Ulrike Peters; Lead Biostatistician: Li Hsu), which includes over 125,000 colorectal cancer cases and controls all with GWAS data and additionally 7,000 tumors sequencing data. As our methods are also applicable to other cancer studies, we will implement them in computationally efficient and user-friendly software packages and disseminate them to the community through R/CRAN, R/Bioconductor, or Github.

Public Health Relevance

Tumor heterogeneity can lead to different clinical outcomes and understanding the underlying etiology can lead to novel insights into cancer prevention and treatment. Recent developments in sequencing technologies have made it possible to characterize somatic mutations at great detail in large numbers of tumors, providing a unique opportunity to study tumor heterogeneity. The objective of this application is to develop statistical methods for assessing association of tumor heterogeneity with clinical outcomes and to identify genetic and environmental risk factors that lead to tumor heterogeneity.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Cancer Institute (NCI)
Type: Research Project (R01)
Project #: 2R01CA189532-05
Application #: 9817026
Study Section: Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer: Divi, Rao L

Project Start: 2015-07-01
Project End: 2024-06-30
Budget Start: 2019-07-01
Budget End: 2020-06-30
Support Year: 5
Fiscal Year: 2019
Total Cost
Indirect Cost

Institution

Name: Fred Hutchinson Cancer Research Center
Department
Type
DUNS #: 078200995

City: Seattle
State: WA
Country: United States
Zip Code: 98109

Related projects


NIH 2020 R01 CA	Statistical Methods for Analysis of Tumor Heterogeneity in Genetic Epidemiology Hsu, Li / Fred Hutchinson Cancer Research Center
NIH 2019 R01 CA	Statistical Methods for Analysis of Tumor Heterogeneity in Genetic Epidemiology Hsu, Li / Fred Hutchinson Cancer Research Center
NIH 2018 R01 CA	Methods for Integrating Functional Data into Complex Disease Genetic Analyses Hsu, Li / Fred Hutchinson Cancer Research Center
NIH 2017 R01 CA	Methods for Integrating Functional Data into Complex Disease Genetic Analyses Hsu, Li / Fred Hutchinson Cancer Research Center
NIH 2016 R01 CA	Methods for Integrating Functional Data into Complex Disease Genetic Analyses Hsu, Li / Fred Hutchinson Cancer Research Center
NIH 2015 R01 CA	Methods for Integrating Functional Data into Complex Disease Genetic Analyses Hsu, Li / Fred Hutchinson Cancer Research Center

Publications

Conley, Christopher J; Ozbek, Umut; Wang, Pei et al. (2018) Characterizing functional consequences of DNA copy number alterations in breast and ovarian tumors by spaceMap. J Genet Genomics 45:361-371

Su, Yu-Ru; Di, Chongzhi; Bien, Stephanie et al. (2018) A Mixed-Effects Model for Powerful Association Tests in Integrative Functional Genomics. Am J Hum Genet 102:904-919

Liu, Jianyu; Sun, Wei; Liu, Yufeng (2018) Joint skeleton estimation of multiple directed acyclic graphs for heterogeneous population. Biometrics :

Neumeyer, Sonja; Banbury, Barbara L; Arndt, Volker et al. (2018) Mendelian randomisation study of age at menarche and age at menopause and the risk of colorectal cancer. Br J Cancer 118:1639-1647

Liu, Yanyan; Xiong, Sican; Sun, Wei et al. (2018) Joint Analysis of Strain and Parent-of-Origin Effects for Recombinant Inbred Intercrosses Generated from Multiparent Populations with the Collaborative Cross as an Example. G3 (Bethesda) 8:599-605

He, Qianchuan; Liu, Yang; Peters, Ulrike et al. (2018) Multivariate association analysis with somatic mutation data. Biometrics 74:176-184

Dai, James Y; Peters, Ulrike; Wang, Xiaoyu et al. (2018) Diagnostics for Pleiotropy in Mendelian Randomization Studies: Global and Individual Tests for Direct Effects. Am J Epidemiol 187:2672-2680

Sun, Wei; Bunn, Paul; Jin, Chong et al. (2018) The association between copy number aberration, DNA methylation and gene expression in tumor samples. Nucleic Acids Res 46:3009-3018

Zhao, Wei; Chen, Ying Qing; Hsu, Li (2017) On estimation of time-dependent attributable fraction from population-based case-control studies. Biometrics 73:866-875

Su, Yu-Ru; Di, Chong-Zhi; Hsu, Li et al. (2017) A unified powerful set-based test for sequencing data analysis of GxE interactions. Biostatistics 18:119-131

Showing the most recent 10 out of 19 publications

Comments

Be the first to comment on this grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: