Statistical Methods for Cancer Genetic Association Studies with Hidden Population

Thornton, Timothy

Abstract

Advancements in the field of human genetics have created many new opportunities and challenges for statisticians. Cancer genetic studies are now routinely being conducted on a genome-wide basis. Some of the characteristics of the data include missing information and the need to analyze hundreds of thousands or even millions of markers in a single study, which puts a premium on computational speed of the methods. A fundamental problem of interest is to identify genetic risk factors that predispose some people to get a particular type of cancer. With a National Cancer Institute Mentored Research Scientist Development Award to Promote Diversity, the applicant plans to focus on developing statistical methods which seek genetic loci contributing to cancer genetics. The emphasis will be on methods that are computationally feasible for the analysis of genetic studies with millions of markers. The applicant's research will focus on developing new approaches for association testing with related individuals in structured populations. Case-control association testing has proven to be a valuable tool for the mapping of complex traits. Genetic association studies essentially seek genomic regions where the cases (individuals affected with the trait) and the controls (unaffected individuals) differ significantly. Case-control association methods, however, are not robust to population stratification, the presence of subgroups in the population with ancestry differences. A number of approaches have been proposed to control the false positive rate in samples with cryptic population structure, provided that the individuals in the sample are unrelated. Many cancer genetic association studies, however, contain related individuals and there has been little focus on methods that will correct for unknown population structure for samples with related individuals. Cryptic population and pedigree structure can lead to seriously spurious associations, and the applicant proposes using genome-screen data to infer both pedigree and population structure in the sample. Statistical methods that incorporate this structure will be developed to (1) better control the false positive rate and (2) improve the power to detect susceptibility variants in structured samples with related individuals. Statistical methods will also be developed to accommodate quantitative traits and the analysis of X-chromosome markers in cancer genetic studies for samples with related individuals. The methods will be applied to ongoing prostate cancer and haematological cancer genetic studies with collaborators in Australia, as well as a number of cancer genetic studies from the """"""""Gene Environment Association Studies"""""""" (GENEVA) program with collaborators at the University of Washington. Furthermore, the applicant plans to provide implementation of the methods in freely available software, which will allow for ready use by statisticians and biologists alike and insure a broad dissemination of the methods to the scientific community. One of the applicant's future research goals is to develop statistical methods that improve the power to detect causal cancer genes by incorporating relevant environmental covariates in the model. Most types of cancers are complex disorders that are influenced by complex interactions between genes and environmental factors, and an epidemiological perspective is a very important part of understanding the etiology of complex disorders. The applicant also hopes to contribute to the area of optimal study design in cancer genetic studies as well as methods to differentiate causal markers from associated markers. Ultimately, the aim of the proposed research is to facilitate a better understanding of the complicated biological processes of cancer genetics using statistical methodology.

Public Health Relevance

The biological process of many types of cancer is not well understood, and for this reason, identifying genes that cause or influence the disease is of extreme importance. Cancer genetic studies are now routinely being conducted on a genome-wide basis to identify regions of the genome that are involved with the disorder. We focus on developing novel statistical methodology for cancer genetic studies that have samples with related individuals, where the ancestry may not be completely known.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Cancer Institute (NCI)
Type: Research Scientist Development Award - Research & Training (K01)
Project #: 5K01CA148958-04
Application #: 8539309
Study Section: Subcommittee G - Education (NCI)
Program Officer: Vallejo-Estrada, Yolanda

Project Start: 2010-09-28
Project End: 2015-08-31
Budget Start: 2013-09-01
Budget End: 2014-08-31
Support Year: 4
Fiscal Year: 2013
Total Cost: $132,831
Indirect Cost: $9,839

Institution

Name: University of Washington
Department: Biostatistics & Other Math Sci
Type: Schools of Public Health
DUNS #: 605799469

City: Seattle
State: WA
Country: United States
Zip Code: 98195

Related projects


NIH 2014 K01 CA	Statistical Methods for Cancer Genetic Association Studies with Hidden Population Thornton, Timothy Alvin / University of Washington
NIH 2013 K01 CA	Statistical Methods for Cancer Genetic Association Studies with Hidden Population Thornton, Timothy Alvin / University of Washington	$132,831
NIH 2012 K01 CA	Statistical Methods for Cancer Genetic Association Studies with Hidden Population Thornton, Timothy Alvin / University of Washington	$135,312
NIH 2011 K01 CA	Statistical Methods for Cancer Genetic Association Studies with Hidden Population Thornton, Timothy Alvin / University of Washington	$138,924
NIH 2010 K01 CA	Statistical Methods for Cancer Genetic Association Studies with Hidden Population Thornton, Timothy Alvin / University of Washington	$139,599

Publications

Fohner, Alison E; Wang, Zhican; Yracheta, Joseph et al. (2016) Genetics, Diet, and Season Are Associated with Serum 25-Hydroxycholecalciferol Concentration in a Yup'ik Study Population from Southwestern Alaska. J Nutr 146:318-25

Schick, Ursula M; Jain, Deepti; Hodonsky, Chani J et al. (2016) Genome-wide Association Study of Platelet Count Identifies Ancestry-Specific Loci in Hispanic/Latino Americans. Am J Hum Genet 98:229-42

Conomos, Matthew P; Laurie, Cecelia A; Stilp, Adrienne M et al. (2016) Genetic Diversity and Association Studies in US Hispanic/Latino Populations: Applications in the Hispanic Community Health Study/Study of Latinos. Am J Hum Genet 98:165-84

Chen, Han; Wang, Chaolong; Conomos, Matthew P et al. (2016) Control for Population Structure and Relatedness for Binary Traits in Genetic Association Studies via Logistic Mixed Models. Am J Hum Genet 98:653-66

Shirasaka, Y; Chaudhry, A S; McDonald, M et al. (2016) Interindividual variability of CYP2C19-catalyzed drug metabolism due to differences in gene diplotypes and cytochrome P450 oxidoreductase content. Pharmacogenomics J 16:375-87

Morrison, Jean; Laurie, Cathy C; Marazita, Mary L et al. (2016) Genome-wide association study of dental caries in the Hispanic Communities Health Study/Study of Latinos (HCHS/SOL). Hum Mol Genet 25:807-16

Conomos, Matthew P; Reiner, Alexander P; Weir, Bruce S et al. (2016) Model-free Estimation of Recent Genetic Relatedness. Am J Hum Genet 98:127-48

McHugh, Caitlin; Brown, Lisa; Thornton, Timothy A (2016) Detecting Heterogeneity in Population Structure Across the Genome in Admixed Populations. Genetics 204:43-56

Hong, Xiumei; Hao, Ke; Ladd-Acosta, Christine et al. (2015) Genome-wide association study identifies peanut allergy-specific loci and evidence of epigenetic mediation in US children. Nat Commun 6:6304

Thornton, Timothy A (2015) Statistical methods for genome-wide and sequencing association studies of complex traits in related samples. Curr Protoc Hum Genet 84:1.28.1-9

Showing the most recent 10 out of 24 publications

Comments

Be the first to comment on Timothy Thornton's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: