This subproject is one of many research subprojects utilizing the resources provided by a Center grant funded by NIH/NCRR. The subproject and investigator (PI) may have received primary funding from another NIH source, and thus could be represented in other CRISP entries. The institution listed is for the Center, which is not necessarily the institution for the investigator. To overcome the """"""""spurious"""""""" association caused by population stratification in population-based association studies, we propose a principal component based method that can utilize both family and unrelated samples at the same time. More specifically, we adapt a multivariate logistic model, which is often used in segregation analysis and can allow for the family correlation structure, for association analysis. To correct the effect of hidden population structure, the first ten principal components calculated from the matrix of marker genotype data are incorporated as covariates in the model. To test for the association, the marker of interest is also incorporated as a covariate in the model. We applied the proposed method to the second generation, i.e. the Offspring Cohort, in the GAW16 Framingham Heart Study 50k Data Set to evaluate the performance of the method. Although there may have been difficulty in the convergence while maximizing the likelihood function, as indicated by a flat likelihood, the distribution of the empirical P-values for the test statistic does show that the method has a correct Type I error rate whenever the variance-covariance matrix of the estimates can be computed.
Showing the most recent 10 out of 922 publications