The overall objective of the proposed research is to significantly improve quality of health forecasting for the US elderly. This objective will be reached by constructing a set of new health predicting models having different levels of complexity, evaluating quality of their predictions, and using verified models to predict future prevalence of cancer, coronary heart disease (CHD), stroke, diabetes, and Alzheimer's disease (AD) under different scenarios. The models will use information about factors affecting health and survival available in five datasets including the Framingham Heart Study (FHS), Health and Retirement study merged with Medicare files (HRS-M), National Long Term Care Survey linked to Medicare records (NLTCS-M), the Surveillance, the Epidemiology and End Results data merged with Medicare records (SEER-M), and the 5% Medicare (5%-M) file. The most sophisticated models will use information about genetic and non-genetic factors, and take pleiotropic, polygenic, and age-specific effects of genes on health and survival, as well as dynamic mechanisms of aging related changes, into account. The following specific aims will be addressed: 1. Predict age patterns of prevalence for cancer, CHD, stroke, diabetes, and AD for years 2020, 2025, 2030, and 2035 using models having different levels of complexity constructed using data from SEER-M, and 5%-M files, NLTCS-M and HRS-M (without genetic data) for males and females under different scenarios.2. Identify sets of genetic variants showing individual and pleiotropic associations with health and survival traits in the FHS and HRS-M data using candidate genomic regions enriched for pleiotropic genetic effects on health traits. Identify genes related to selected genetic variants and evaluate their roles in metabolic and signaling pathways and disease networks. Construct polygenic score indices and evaluate their influence on health and survival traits. 3. Predict age patterns of prevalence for the same diseases and time horizons as in Aim 1, however applying advanced modeling approaches incorporating the genetic information about pleiotropic, polygenic and age-specific effects of genetic variants on health and survival and using different scenarios. Test the quality of health predictions using subsets of available data. Use verified models in health forecasting for time horizons specified above. 4. Predict age patterns of prevalence of diseases listed above using extended multistate health and mortality models by considering risks of health transitions as functions of genetic factors, as well as observed covariates and physiological variables. For these purposes, evaluate risks of transitions and their time trends for subsequent birth cohorts using FHS and HRS-M data. Test quality of health predictions using subsets of available data. Use verified models in health forecasting under different scenarios. Compare results of health predictions using different models constructed in this project, as well as models available in the literature. Make recommendations concerning the proper use of data and models in health forecasting for time horizons specified above.

Public Health Relevance

The result of these analyses will clarify roles of genetic mechanisms in forming health and longevity traits, reduce uncertainty in health forecasting, and contribute to improvement of functioning of health care system which will result in improvement of population health in the US elderly.

National Institute of Health (NIH)
National Institute on Aging (NIA)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
King, Jonathan W
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Duke University
Organized Research Units
United States
Zip Code
Zhbannikov, Ilya Y; Arbeev, Konstantin G; Yashin, Anatoliy I (2017) rqt: an R package for gene-level meta-analysis. Bioinformatics 33:3129-3130
Zhbannikov, Ilya Y; Arbeev, Konstantin G; Yashin, Anatoliy I (2017) cophesim: a comprehensive phenotype simulator for testing novel association methods. F1000Res 6:1294
Akushevich, I; Yashkin, A P; Kravchenko, J et al. (2017) Theory of partitioning of disease prevalence and mortality in observational data. Theor Popul Biol 114:117-127
Yashin, Anatoliy I; Fang, Fang; Kovtun, Mikhail et al. (2017) Hidden heterogeneity in Alzheimer's disease: Insights from genetic association studies and other analyses. Exp Gerontol :
Zhbannikov, Ilya Y; Arbeev, Konstantin; Akushevich, Igor et al. (2017) stpm: an R package for stochastic process model. BMC Bioinformatics 18:125
Yashin, Anatoliy I; Zhbannikov, Ilya; Arbeeva, Liubov et al. (2016) Pure and Confounded Effects of Causal SNPs on Longevity: Insights for Proper Interpretation of Research Findings in GWAS of Populations with Different Genetic Structures. Front Genet 7:188
Yashin, Anatoliy I; Arbeev, Konstantin G; Arbeeva, Liubov S et al. (2016) How the effects of aging and stresses of life are integrated in mortality rates: insights for genetic studies of human health and longevity. Biogerontology 17:89-107
Yashin, Anatoliy I; Arbeev, Konstantin G; Wu, Deqing et al. (2016) How Genes Modulate Patterns of Aging-Related Changes on the Way to 100: Biodemographic Models and Methods in Genetic Analyses of Longitudinal Data. N Am Actuar J 20:201-232
Ukraintseva, Svetlana; Yashin, Anatoliy; Arbeev, Konstantin et al. (2016) Puzzling role of genetic risk factors in human longevity: ""risk alleles"" as pro-longevity variants. Biogerontology 17:109-27
Arbeev, Konstantin G; Cohen, Alan A; Arbeeva, Liubov S et al. (2016) Optimal Versus Realized Trajectories of Physiological Dysregulation in Aging and Their Relation to Sex-Specific Mortality Risk. Front Public Health 4:3

Showing the most recent 10 out of 20 publications