Genome-wide association studies (GWAS) have become the primary approach for dissecting the genetic basis of complex diseases and are a powerful approach for detecting common alleles that influence disease risk. To date, hundreds of putative disease gene loci have been identified in GWAS. Despite this progress, these newly discovered loci typically account for only a small fraction of disease heritability. This raises new questions about where and how we can find the remaining genetic variation contributing to the susceptibility of complex and common diseases. Potential sources of missing heritability are (1) the contribution of rare variants, (2) gene-gene and gene-environment interaction, (3) combination of multiple SNPs, each with small genetic effect, but collectively conferring large risk, (4) structural variation. Current statistical methods for genetic analysis are well suited for detecting common variants, but new models and methods of analysis are needed for revealing the sources of missing disease heritability. To this end, the goals of this proposal are to develop novel and powerful statistical methods for studying rare variants and gene-gene interactions in the context of next-generation sequencing and GWAS data. Specifically, the methods we will develop will provide a unified analytical framework for testing associations with both common and rare alleles as well as their interaction with genetic and environmental factors. We will also develop graphical models and other statistical methods for co-association and interaction network analysis. The power of these methods will be rigorously analyzed by theoretical and simulation approaches, and will be applied to existing GWAS data sets (psoriasis and rheumatoid arthritis) and next generation sequencing data of extreme cardiovascular phenotypes funded by NIH grant 1RC2 HL02419-01.
This project aims to develop novel and powerful statistical methods for genetic association and interaction analysis of next-generation sequencing data and finding missing heritability unexplained by the current GWAS. Application of these methods to the sequence data will facilitate to identify entire spectrum of genetic variations that influence diseases and provide potential valuable tools for the development of diagnostic and interventional strategies.
|Xu, Kelin; Jin, Li; Xiong, Momiao (2017) Functional regression method for whole genome eQTL epistasis analysis with sequencing data. BMC Genomics 18:385|
|Guo, Shicheng; Li, Yuan; Wang, Yi et al. (2016) Copy Number Variation of HLA-DQA1 and APOBEC3A/3B Contribute to the Susceptibility of Systemic Sclerosis in the Chinese Han Population. J Rheumatol 43:880-6|
|Zhang, Futao; Xie, Dan; Liang, Meimei et al. (2016) Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits. PLoS Genet 12:e1005965|
|Wang, Panpan; Rahman, Mohammad; Jin, Li et al. (2016) A new statistical framework for genetic pleiotropic analysis of high dimensional phenotype data. BMC Genomics 17:881|
|Zhao, Jinying; Zhu, Yun; Xiong, Momiao (2016) Genome-wide gene-gene interaction analysis for next-generation sequencing. Eur J Hum Genet 24:421-8|
|Jiang, Junhai; Lin, Nan; Guo, Shicheng et al. (2015) Multiple functional linear model for association analysis of RNA-seq with imaging. Quant Biol 3:90-102|
|Lin, Nan; Jiang, Junhai; Guo, Shicheng et al. (2015) Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis. PLoS One 10:e0132945|
|Li, Lerong; Xiong, Momiao (2015) Dynamic Model for RNA-seq Data Analysis. Biomed Res Int 2015:916352|
|Zhao, Jinying; Zhu, Yun; Boerwinkle, Eric et al. (2015) Pathway analysis with next-generation sequencing data. Eur J Hum Genet 23:507-15|
|Dong, Chengliang; Wei, Peng; Jian, Xueqiu et al. (2015) Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies. Hum Mol Genet 24:2125-37|
Showing the most recent 10 out of 36 publications