This project is about the development of better statistical methods to dissect complex trait variation and to predict outcome from genome-wide marker data. It anticipates that individual risk prediction for disease will become an integral part of Genomic Medicine in the USA and elsewhere. To predict an individual's risk of disease from genetic data it is not necessary to have identified the causal variant or fully understand the biology - all that is needed is a predictor that is correlated with outcome. The statistically best predictor depends on the genetic architecture of the trait: the distribution of effect sizes of causal variants, the distribution of their allele frequency, and the correlation between the two. Therefore, methods to better understand the genetic architecture of complex traits will lead to better statistical prediction methods and the performance of prediction methods will lead to new inference on genetic architecture. We will develop, test and apply statistical genetic methods that utilize whole-genome genotype or sequence data from population based samples that have also been phenotyped for one or more complex traits, estimate locus-specific, chromosome-wide and whole genome matrices of genetic covariance between all pairs of individuals, and estimate variance components associated with these. We will use the results and those from large genomewide association studies to estimate the distribution of SNP and chromosome segment effects by fitting mixture models using an EM-algorithm. We will use simulation models to calibrate the observed distribution of risk allele frequencies for disease with evolutionary models that include the mode of natural selection and pleiotropic relationships in effects on fitness and disease as parameters. We will develop and test Bayesian and non-Bayesian statistical linear mixed models that utilize all available genetic data simultaneously to predict an individual's risk of disease. We will implement prediction methods using data from the Program Grant investigators, from large international research consortia and from data in the public domain, and test their efficiency by correlating outcome with predictors in independent data sets.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Program Projects (P01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-GGG-M)
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Washington
United States
Zip Code
Chen, Guo-Bo; Lee, Sang Hong; Brion, Marie-Jo A et al. (2014) Estimation and partitioning of (co)heritability of inflammatory bowel disease from GWAS and immunochip data. Hum Mol Genet 23:4710-20
(2014) Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet 46:1173-86
Marigorta, Urko M; Gibson, Greg (2014) A simulation study of gene-by-environment interactions in GWAS implies ample hidden effects. Front Genet 5:225
Hemani, Gibran; Shakhbazov, Konstantin; Westra, Harm-Jan et al. (2014) Detection and replication of epistasis influencing transcription in humans. Nature 508:249-53
Robinson, Matthew R; Wray, Naomi R; Visscher, Peter M (2014) Explaining additional genetic variation in complex traits. Trends Genet 30:124-32
Zheng, Chaozhi; Kuhner, Mary K; Thompson, Elizabeth A (2014) Joint inference of identity by descent along multiple chromosomes from population samples. J Comput Biol 21:185-200
Yang, Jian; Zaitlen, Noah A; Goddard, Michael E et al. (2014) Advantages and pitfalls in the application of mixed-model association methods. Nat Genet 46:100-6
Gratten, Jacob; Wray, Naomi R; Keller, Matthew C et al. (2014) Large-scale genomics unveils the genetic architecture of psychiatric disorders. Nat Neurosci 17:782-90
Zheng, Chaozhi; Kuhner, Mary K; Thompson, Elizabeth A (2014) Bayesian inference of local trees along chromosomes by the sequential Markov coalescent. J Mol Evol 78:279-92
Preeprem, Thanawadee; Gibson, Greg (2014) SDS, a structural disruption score for assessment of missense variant deleteriousness. Front Genet 5:82

Showing the most recent 10 out of 27 publications