Genome-wide association studies (GWAS) have become the primary approach for dissecting the genetic basis of complex diseases and are a powerful approach for detecting common alleles that influence disease risk. To date, hundreds of putative disease gene loci have been identified in GWAS. Despite this progress, these newly discovered loci typically account for only a small fraction of disease heritability. This raises new questions about where and how we can find the remaining genetic variation contributing to the susceptibility of complex and common diseases. Potential sources of missing heritability are (1) the contribution of rare variants, (2) gene-gene and gene-environment interaction, (3) combination of multiple SNPs, each with small genetic effect, but collectively conferring large risk, (4) structural variation. Current statistical methods for genetic analysis are well suited for detecting common variants, but new models and methods of analysis are needed for revealing the sources of missing disease heritability. To this end, the goals of this proposal are to develop novel and powerful statistical methods for studying rare variants and gene-gene interactions in the context of next-generation sequencing and GWAS data. Specifically, the methods we will develop will provide a unified analytical framework for testing associations with both common and rare alleles as well as their interaction with genetic and environmental factors. We will also develop graphical models and other statistical methods for co-association and interaction network analysis. The power of these methods will be rigorously analyzed by theoretical and simulation approaches, and will be applied to existing GWAS data sets (psoriasis and rheumatoid arthritis) and next generation sequencing data of extreme cardiovascular phenotypes funded by NIH grant 1RC2 HL02419-01.

Public Health Relevance

This project aims to develop novel and powerful statistical methods for genetic association and interaction analysis of next-generation sequencing data and finding missing heritability unexplained by the current GWAS. Application of these methods to the sequence data will facilitate to identify entire spectrum of genetic variations that influence diseases and provide potential valuable tools for the development of diagnostic and interventional strategies.

National Institute of Health (NIH)
National Heart, Lung, and Blood Institute (NHLBI)
Research Project (R01)
Project #
Application #
Study Section
Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer
Wolz, Michael
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Texas Health Science Center Houston
Biostatistics & Other Math Sci
Schools of Public Health
United States
Zip Code
Wang, Panpan; Rahman, Mohammad; Jin, Li et al. (2016) A new statistical framework for genetic pleiotropic analysis of high dimensional phenotype data. BMC Genomics 17:881
Zhao, Jinying; Zhu, Yun; Xiong, Momiao (2016) Genome-wide gene-gene interaction analysis for next-generation sequencing. Eur J Hum Genet 24:421-8
Zhang, Futao; Xie, Dan; Liang, Meimei et al. (2016) Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits. PLoS Genet 12:e1005965
Jiang, Junhai; Lin, Nan; Guo, Shicheng et al. (2015) Multiple functional linear model for association analysis of RNA-seq with imaging. Quant Biol 3:90-102
Li, Lerong; Xiong, Momiao (2015) Dynamic Model for RNA-seq Data Analysis. Biomed Res Int 2015:916352
Zhao, Jinying; Zhu, Yun; Boerwinkle, Eric et al. (2015) Pathway analysis with next-generation sequencing data. Eur J Hum Genet 23:507-15
Lin, Nan; Jiang, Junhai; Guo, Shicheng et al. (2015) Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis. PLoS One 10:e0132945
Dong, Chengliang; Wei, Peng; Jian, Xueqiu et al. (2015) Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies. Hum Mol Genet 24:2125-37
Tang, Hongwei; Wei, Peng; Duell, Eric J et al. (2014) Genes-environment interactions in obesity- and diabetes-associated pancreatic cancer: a GWAS data analysis. Cancer Epidemiol Biomarkers Prev 23:98-106
Zhang, Futao; Boerwinkle, Eric; Xiong, Momiao (2014) Epistasis analysis for quantitative traits by functional regression model. Genome Res 24:989-98

Showing the most recent 10 out of 34 publications