To help to analyze and understand aging-related """"""""complex"""""""" traits that are affected by many genes and environmental factors, we propose to develop three statistical algorithms for the analyses of genome-wide genotyping and high-throughput sequencing studies. To test these algorithms, we take advantage of the special features of the SardiNIA project (see Z01 AG000675-06), which has collected longitudinal data for >300 quantitative traits together with the whole-genome genetic data in the founder Sardinia population. To analyze mitochondrial DNA variation and its possible effects on aging-related traits, the genotype-calling and analytic programs developed for nuclear DNA are not adequate, because each cell has 100-10,000 mtDNA copies that can vary at any site (heteroplasmy), and can therefore have each of the 4 bases at any position in various copies. We have developed an algorithm that is specific to identify variants in mtDNA;it incorporates the sequencing error rate at each base in each sequence read and has the flexibility to allow for different allele fractions at a variant site across all individuals. It has thus far been successful in assessing homoplasmies in the mtDNA sequence from 1,000 sequenced individuals. Especially because the Sardinian cohort is highly inter-related, we have also been able to distinguish newly-arising variants in children compared to their mothers and other relatives. To take advantage of repeated visits, which can increase the accuracy of data and thereby provide more highly significant results with a given size sample, we have, instead of using the average of multiple measurements, developed an empirical Bayes shrinking estimator that uses weighted estimates from all measurements. Simulations and analysis of real data from the SardiNIA data set show that combining values from repeated visits in an association study yields an increase in GWAS signals compared to using a single visit for measures of many traits at 3 visits over a 10-year period. Furthermore, we have showed that in unbalanced data sets (that is, with different individuals having different numbers of visits), the shrinking estimator further improves GWAS signals relative to the average. This work has been accepted for publication in the European Journal of Human Genetics. To take advantage of correlations between related traits and hence to increase statistical power for genetic studies, in ongoing work, we are developing a method to search for genes/variants that have pleiotropic effects on multiple quantitative traits.
Showing the most recent 10 out of 14 publications