Previously fellow Shashaank Vattikuti and I analyzed the entire GWAS SNP marker set simultaneously with respect to the genetic contribution to obesity and type II diabetes traits. We used a mixed-effects linear model of all the SNPs to estimate the heritability of a set of traits. In addition, we also estimated the genetic contribution that is shared between traits. We showed that approximately half of the known heritability estimated using classical methods are captured collectively by the common SNPs. Previously, only a small fraction of the heritability could be explained from sets of single SNPs, which led to what has been called the problem of missing heritability. Our work showed that the heritability is not missing but merely hidden in the noise. We also showed that the heritability estimated by the SNPs increases with the number of SNPs, which also indicates that the genetic information may be spread over large segments of the genome. We are continuing the work by validating in other data sets and to look for more specific large scale patterns of in the markers for each phenotype in these data sets as was specified in our original request. With James Lee, I showed that the mixed-effects loses validity when the GWAS markers have too much linkage disequilibrium. With Stephen Hsu, Shashaank Vattikuti and James Lee, we applied the statistical theory of compressed sensing to analyzing GWAS data. This is a method that can find sparse nonzero regression coefficients in data sets where the number of parameters far exceeds the number of samples, which is not possible using classical linear regression methods. More importantly, there is a sharp transition from poor to good recovery as the sample size is increased. We use this transition in an algorithm to extract trait associated loci with high confidence. We can also estimate a lower bound on the number of loci associated with a given trait. We are now applying the method to case and control traits. I have also participated in a collaborative project to update the software tool Plink.

Project Start
Project End
Budget Start
Budget End
Support Year
4
Fiscal Year
2015
Total Cost
Indirect Cost
Name
U.S. National Inst Diabetes/Digst/Kidney
Department
Type
DUNS #
City
State
Country
Zip Code
Lee, James J; Vattikuti, Shashaank; Chow, Carson C (2016) Uncovering the Genetic Architectures of Quantitative Traits. Comput Struct Biotechnol J 14:28-34
Chang, Christopher C; Chow, Carson C; Tellier, Laurent Cam et al. (2015) Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4:7
Lee, James J; Chow, Carson C (2014) Conditions for the validity of SNP-based heritability estimation. Hum Genet 133:1011-22
Vattikuti, Shashaank; Lee, James J; Chang, Christopher C et al. (2014) Applying compressed sensing to genome-wide association studies. Gigascience 3:10
Lee, James J; Chow, Carson C (2013) The causal meaning of Fisher's average effect. Genet Res (Camb) 95:89-109
Vattikuti, Shashaank; Guo, Juen; Chow, Carson C (2012) Heritability and genetic correlations explained by common SNPs for metabolic syndrome traits. PLoS Genet 8:e1002637