To help to analyze and understand aging-related "complex" traits that are affected by many genes and environmental factors, we propose to develop three statistical algorithms for the analyses of genome-wide genotyping and high-throughput sequencing studies. Our proposed new statistical methods provide means to analyze additional types of data e.g., mitochondrial DNA (mtDNA) variants from sequencing, or variants on the X chromosome for genome-wide association studies (GWAS) and data with more complicated structures (e.g., multiple related traits). To test these algorithms, we take advantage of the special features of the SardiNIA project (see Annual Report AG000675-07), which has collected longitudinal data for >300 quantitative traits together with the whole-genome genetic data in the founder Sardinia population. To analyze mitochondrial DNA variation and its possible effects on aging-related traits, the genotype-calling and analytic programs developed for nuclear DNA are not adequate, because each cell has 100-10,000 mtDNA copies that can vary at any site (heteroplasmy), and can therefore have each of the 4 bases at any position in various copies. We have developed an algorithm that is targeted to identify variants in mtDNA;it incorporates the sequencing error rate of each base in each sequence read and is flexible to allow for different allele fractions at a variant site across all individuals. Our procedure is further adapted to the circular mitochondrial genome, a key difference from the linear chromosomes assumed by most mapping algorithms. We are assessing homoplasmies and heteroplasmies in mtDNA sequences of lymphocytes from whole-genome sequencing of 2,000 SardiNIA Project participants. The results to date provide information about mtDNA haplogroups and the inheritance of homo- and heteroplasmies in Sardinia. As expected, mothers and their children share essentially all homoplasmies but a lesser proportion of heteroplasmies. The overall heteroplasmy increases with age, but the slope is small in the estimates thus far, yielding an average increase of 1 heteroplasmy between ages 20 and 80 with the minor allele fraction threshold at 4%. To take advantage of correlations between related traits and hence to increase statistical power for genetic studies, we are developing a method to search for genes/variants that have pleiotropic effects on multiple quantitative traits. Our method projects a group of related traits and a set of SNPs (defined, for example, by gene boundaries) into their respective orthogonal principal components such that we are able to jointly test the association between traits and SNPs using a new summary statistic. Because of the orthogonality, the significance of association can be efficiently evaluated using simulations under the null hypothesis instead of more computationally intensive permutations. We apply our method to the SardiNIA project data, where we have first focused on three lipid traits HDL, LDL, and Triglycerides, and use RefSeq gene boundaries to group SNPs into gene units. To show that our method is able to identify genes that are associated with more than one blood lipid trait, we use results published in Teslovich et al., Nature (2010), in which 95 loci for blood lipids were identified and 21 loci were associated with multiple lipid traits. We are able to show that our method can enrich those pleiotropic loci: for example, when the 95 loci are ranked by our method, the top 20% loci include 40% of the pleiotropic loci in the original study. Our method is able to detect joint associations between multiple traits and multiple genetic variants. It will have significant advantages when a gene has moderate effects on multiple traits that a standard GWAS is unable to detect.

National Institute of Health (NIH)
National Institute on Aging (NIA)
Investigator-Initiated Intramural Research Projects (ZIA)
Project #
Application #
Study Section
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
National Institute on Aging
Zip Code
van den Berg, Stéphanie M; de Moor, Marleen H M; Verweij, Karin J H et al. (2016) Meta-analysis of Genome-Wide Association Studies for Extraversion: Findings from the Genetics of Personality Consortium. Behav Genet 46:170-82
(2016) Genome-wide association study identifies 74 loci associated with educational attainment. Nature 533:539-42
Okbay, Aysu; Baselmans, Bart M L; De Neve, Jan-Emmanuel et al. (2016) Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses. Nat Genet 48:624-33
Ding, Jun; Sidore, Carlo; Butler, Thomas J et al. (2015) Assessing Mitochondrial DNA Variation and Copy Number in Lymphocytes of ~2,000 Sardinians Using Tailored Sequencing Analysis Tools. PLoS Genet 11:e1005306
Terracciano, Antonio; Strait, James; Scuteri, Angelo et al. (2014) Personality traits and circadian blood pressure patterns: a 7-year prospective study. Psychosom Med 76:237-43
Pelosi, Emanuele; Omari, Shakib; Michel, Marc et al. (2013) Constitutively active Foxo3 in oocytes preserves ovarian reserve in mice. Nat Commun 4:1843
Meirelles, Osorio D; Ding, Jun; Tanaka, Toshiko et al. (2013) SHAVE: shrinkage estimator measured for multiple visits increases power in GWAS of quantitative traits. Eur J Hum Genet 21:673-9
Hek, Karin; Demirkan, Ayse; Lahti, Jari et al. (2013) A genome-wide association study of depressive symptoms. Biol Psychiatry 73:667-78
Zhang, Mingfeng; Liang, Liming; Morar, Nilesh et al. (2012) Integrating pathway analysis and genetics of gene expression for genome-wide association study of basal cell carcinoma. Hum Genet 131:615-23
Voight, Benjamin F; Kang, Hyun Min; Ding, Jun et al. (2012) The metabochip, a custom genotyping array for genetic studies of metabolic, cardiovascular, and anthropometric traits. PLoS Genet 8:e1002793

Showing the most recent 10 out of 12 publications