Rare variants have been heralded as key to uncovering missing heritability"""""""" in complex diseases such as cancers. These variants can now be genotyped using next-generation sequencing technologies;nonetheless, rare haplotypes may also result from combination of common SNPs available from Genome-Wide Association Studies (GWAS). In this regard, there may be a great deal of treasure that are yet to be mined from the GWAS data to explore the common disease rare variant hypothesis. Recently, we have proposed an approach named Logistic Bayesian LASSO (LBL) to identify association with rare haplotypes in a case-control setting. LBL is an adaptation of the Bayesian counterpart of penalized regression approach LASSO. Our approach is able to weed out unassociated (especially common) haplotypes to achieve enough noise reduction so that the signals contained in the associated rare haplotypes can be more easily detected. Using LBL, we were able to implicate a specific rare haplotype for Age-related Macular Degeneration (AMD) in the Complement Factor H (CFH) gene for the first time. In addition to rare variants, gene-environment interaction (GXE) is believed to be another important contributor to missing heritability. LBL has a flexible framework that can incorporate non-genetic (environmental) covariates and gene- environment interactions. In this project we propose methods for exploring interactions between rare haplotypes and environmental factors in cancer epidemiology, rst in the setting of simple random sampling and then for stratified random sampling. We will develop methods both with and without the assumption of gene-environment independence. The methods will be extensively studied through simulations under a variety of settings. They will be applied to several cancer datasets available from NIH's database of Genotypes and Phenotypes (dbGaP) and the AMD data. Further, the method for stratified sampling will be used to analyze the NCI-sponsored Kidney Cancer Case-Control Study, wherein the controls were selected by stratified sampling using frequency matching with cases. We will implement the proposed methods in a well-documented user-friendly software and make it available to the larger scientific community.

Public Health Relevance

Unraveling the interplay between gene and environment is fundamental to the understanding of many cancers and other complex diseases, however, this problem is particularly challenging when the causal genetic variants occur infrequently in population. We propose to develop statistical methods to identify interactions of rare genetic variants with environmental factors in causing disease and apply them to several cancer datasets. Successful implementation and application of these methods will contribute greatly to uncovering the role and interplay of rare variants and environmental factors in the etiology of cancers and other complex diseases, which can be of potential value in aiding the prevention and treatments of those diseases.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Small Research Grants (R03)
Project #
1R03CA171011-01
Application #
8386879
Study Section
Special Emphasis Panel (ZCA1-SRLB-Q (M1))
Program Officer
Chen, Huann-Sheng
Project Start
2012-07-10
Project End
2012-08-14
Budget Start
2012-07-10
Budget End
2012-08-14
Support Year
1
Fiscal Year
2012
Total Cost
$31,988
Indirect Cost
$9,928
Name
University of North Texas
Department
Biostatistics & Other Math Sci
Type
Schools of Public Health
DUNS #
110091808
City
Fort Worth
State
TX
Country
United States
Zip Code
76107
Zhang, Yuan; Hofmann, Jonathan N; Purdue, Mark P et al. (2017) Logistic Bayesian LASSO for genetic association analysis of data from complex sampling designs. J Hum Genet 62:819-829
Zhang, Yuan; Lin, Shili; Biswas, Swati (2017) Detecting rare and common haplotype-environment interaction under uncertainty of gene-environment independence assumption. Biometrics 73:344-355
Datta, Ananda S; Zhang, Yuan; Zhang, Lei et al. (2016) Association of rare haplotypes on ULK4 and MAP4 genes with hypertension. BMC Proc 10:363-369
Zhang, Yuan; Biswas, Swati (2015) An Improved Version of Logistic Bayesian LASSO for Detecting Rare Haplotype-Environment Interactions with Application to Lung Cancer. Cancer Inform 14:11-6
Lin, Shili (2015) Kullback-Leibler divergence for detection of rare haplotype common disease association. Eur J Hum Genet 23:1558-65
Wang, Meng; Lin, Shili (2015) Detecting associations of rare variants with common diseases: collapsing or haplotyping? Brief Bioinform 16:759-68
Satten, Glen A; Biswas, Swati; Papachristou, Charalampos et al. (2014) Population-based association and gene by environment interactions in Genetic Analysis Workshop 18. Genet Epidemiol 38 Suppl 1:S49-56
Biswas, Swati; Xia, Shuang; Lin, Shili (2014) Detecting rare haplotype-environment interaction with logistic Bayesian LASSO. Genet Epidemiol 38:31-41
Wang, Meng; Lin, Shili (2014) FamLBL: detecting rare haplotype disease association based on common SNPs using case-parent triads. Bioinformatics 30:2611-8
Biswas, Swati; Papachristou, Charalampos (2014) Evaluation of logistic Bayesian LASSO for identifying association with rare haplotypes. BMC Proc 8:S54