Statistical Methods for Modeling Polygenic Architecture in Association and Re-sequencing Studies

Zhou, Xiang

Abstract

Array- and sequencing-based association studies have identified many loci harboring genetic variants associated with complex traits and common diseases. Altogether, these associated variants only explain a small proportion of heritability, suggesting that most traits and diseases have a polygenic background and are influenced by many variants with small effects. Early attempts to model polygenic complex traits, notably via the linear mixed models (LMMs) and the best linear unbiased predictor (BLUP), have shown promising outcomes for estimating chip heritability, identifying causal variants, and predicting disease risks. However, statistical methods for modeling polygenic architecture remain in their infancy. In particular, existing methods rely on simple effect size assumptions, are not flexible nor adaptive to the underlying genetic architecture of a given trait or disease, and hence cannot take full advantage of the polygenic natural of most traits and diseases. To increase the power of association test and enable more precise phenotype and risk prediction, I propose to develop a suite of novel statistical methods to accurately and flexibly model the polygenic architecture. These new methods will facilitate evaluation and integration of variant functional annotations, multiple phenotype association mapping, and phenotype and risk prediction in association studies. In particular, we will (1) develop methods to evaluate and integrate variant genomic functional annotations to better understand the polygenic architecture of traits and diseases, and enable powerful association mapping; (2) develop strategies for association mapping with multiple correlated phenotypes to identify pleiotropic associations by taking advantage of the shared polygenic background among phenotypes; and (3) develop methods to flexibly model polygenic architecture and use all variants jointly to achieve accurate phenotype and risk prediction. We will develop efficient algorithms to accompany these methods and implement them in free open-source software. We will perform rigorous simulations and comparisons to evaluate our methods. Finally, we will perform in- depth analysis on several large-scale real data sets, including data from the Global Lipids Genetics Consortium, T2D-GENES and METSIM projects, to demonstrate the power of the proposed methods.

Public Health Relevance

We propose to develop new statistical methods to identify causal variants and predict disease risks for array- and sequencing-based association studies. To increase power of association test and enable more precise phenotype and risk prediction, we will take advantage of the polygenic natural of most traits and diseases by accurately and flexibly modeling the underlying polygenic architecture. These new methods will facilitate integrative analysis with variant functional annotations, multiple phenotype association mapping, and phenotype and risk prediction in association studies. Application of the methods to array-based and whole genome sequencing-based association studies will help identify new associations and facilitate the development of precision medicine.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Human Genome Research Institute (NHGRI)
Type: Research Project (R01)
Project #: 5R01HG009124-03
Application #: 9692738
Study Section: Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer: Sofia, Heidi J

Project Start: 2017-06-14
Project End: 2022-04-30
Budget Start: 2019-05-01
Budget End: 2020-04-30
Support Year: 3
Fiscal Year: 2019
Total Cost
Indirect Cost

Institution

Name: University of Michigan Ann Arbor
Department
Type: Schools of Public Health
DUNS #: 073133571

City: Ann Arbor
State: MI
Country: United States
Zip Code: 48109

Related projects


NIH 2020 R01 HG	Statistical Methods for Modeling Polygenic Architecture in Association and Re-sequencing Studies Zhou, Xiang / University of Michigan Ann Arbor
NIH 2019 R01 HG	Statistical Methods for Modeling Polygenic Architecture in Association and Re-sequencing Studies Zhou, Xiang / University of Michigan Ann Arbor
NIH 2018 R01 HG	Statistical Methods for Modeling Polygenic Architecture in Association and Re-sequencing Studies Zhou, Xiang / University of Michigan Ann Arbor
NIH 2017 R01 HG	Statistical Methods for Modeling Polygenic Architecture in Association and Re-sequencing Studies Zhou, Xiang / University of Michigan Ann Arbor	$338,886

Publications

Zeng, Ping; Hao, Xingjie; Zhou, Xiang (2018) Pleiotropic mapping and annotation selection in genome-wide association studies with penalized Gaussian mixture models. Bioinformatics 34:2797-2807

Chen, Mengjie; Zhou, Xiang (2018) VIPER: variability-preserving imputation for accurate gene expression recovery in single-cell RNA sequencing studies. Genome Biol 19:196

Hao, Xingjie; Zeng, Ping; Zhang, Shujun et al. (2018) Identifying and exploiting trait-relevant tissues with multiple functional annotations in genome-wide association studies. PLoS Genet 14:e1007186

Zeng, Ping; Zhou, Xiang (2017) Non-parametric genetic prediction of complex traits with latent Dirichlet process regression models. Nat Commun 8:456

Zhou, Xiang (2017) A UNIFIED FRAMEWORK FOR VARIANCE COMPONENT ESTIMATION WITH SUMMARY STATISTICS IN GENOME-WIDE ASSOCIATION STUDIES. Ann Appl Stat 11:2027-2051

Sun, Shiquan; Hood, Michelle; Scott, Laura et al. (2017) Differential expression analysis for RNAseq using Poisson mixed models. Nucleic Acids Res 45:e106

Crawford, Lorin; Zeng, Ping; Mukherjee, Sayan et al. (2017) Detecting epistasis with the marginal epistasis test in genetic mapping studies of quantitative traits. PLoS Genet 13:e1006869

Yang, Jingjing; Fritsche, Lars G; Zhou, Xiang et al. (2017) A Scalable Bayesian Method for Integrating Functional Information in Genome-wide Association Studies. Am J Hum Genet 101:404-416

Comments

Be the first to comment on this grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: