There has been a growing interest in investigating genetic architecture of time-varying functional traits such as blood pressure, cholesterol levels or growth rate. Few of the methods proposed in the literature, however, are sufficiently general to apply to complicated situations in a computationally feasible fashion. The goal of this research is to develop general and more powerful statistical methods to map functional quantitative genetic traits. More specifically, in the first step we propose a non-parametric permutation test for overall genetic effect of functional traits by examining familial aggregation. When there is evidence for genetic contribution, the natural second step is to estimate this overall polygenic effect. We then develop methods based on mixed effects models for estimation and use functional principal components analysis to summarize the major temporal variation of the polygenic effect. When the overall genetic effect is reasonably strong, research interest lies in locating influential genes on the genome. In the third step, we propose general functional variance components models to test and estimate quantitative trait locus (QTL) genetic effects using marker genotype data in a genome-wide linkage study. Current ad-hoc methods either uses averages of repeated measurements in a univariate analysis or specifies a parametric form of time- dependent genetic effects in a longitudinal analysis. We propose a family of basis systems to capture genetic effects and estimate age-specific QTL heritability. The flexibility of such basis systems allow for identification of temporal trends of any shape. Within this functional mapping framework, we can answer research questions such as when is a QTL effect expressed to affect a trait, how does gene affect rate of change of traits and so on. Lastly, we propose to investigate our methods using Genetic Analysis Workshop (GAW) 13 simulated data, apply them to the Framingham Heart Study data, and implement them in a software package. Framingham Heart Study is a large prospective study of cardiovascular disease which aims to investigate risk factors and genetic architecture of this disease. The GAW13 simulation data was generated closely based on the Framingham Study, which provides a realistic and valuable resource for methods evaluation and comparison. An application of the developed methods to Framingham data may enhance our understanding of the genetic architecture of cardiovascular disease related traits. The developed software will be made publicly available to all investigators free of charge.

Public Health Relevance

Dissecting genetic determinants of complex time-varying functional traits such as blood pressure, cholesterol levels or growth rate has been one of the most daunting tasks in genetic studies due to complicated nature of their etiology. This project develops new statistical methods to map genetic variants predisposing complex functional traits and applies methods to the Framingham Heart Study data. The study will offer general and more powerful analysis methods for mapping functional quantitative genetic trait and to answer research questions such as when is a gene expressed to affect a trait, how long does genetic effect last, and how does gene affect rate of change of traits.

Agency
National Institute of Health (NIH)
Institute
National Institute on Aging (NIA)
Type
Small Research Grants (R03)
Project #
1R03AG031113-01A2
Application #
7658423
Study Section
Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer
Rossi, Winifred K
Project Start
2009-06-15
Project End
2011-05-31
Budget Start
2009-06-15
Budget End
2010-05-31
Support Year
1
Fiscal Year
2009
Total Cost
$65,723
Indirect Cost
Name
Columbia University (N.Y.)
Department
Biostatistics & Other Math Sci
Type
Schools of Public Health
DUNS #
621889815
City
New York
State
NY
Country
United States
Zip Code
10032
Wang, Yuanjia; Chen, Tianle; Zeng, Donglin (2016) Support Vector Hazards Machine: A Counting Process Framework for Learning Risk Scores for Censored Outcomes. J Mach Learn Res 17:
Wang, Yuanjia; Huang, Chiahui (2012) Semiparametric variance components models for genetic studies with longitudinal phenotypes. Biostatistics 13:482-96
Yang, Qiong; Wang, Yuanjia (2012) Methods for Analyzing Multivariate Phenotypes in Genetic Association Studies. J Probab Stat 2012:652569
Wang, Yuanjia; Chen, Huaihou (2012) On testing an unspecified function through a linear mixed effects model with multiple variance components. Biometrics 68:1113-25
Wang, Yuanjia; Huang, Chiahui; Fang, Yixin et al. (2012) Flexible semiparametric analysis of longitudinal genetic studies by reduced rank smoothing. J R Stat Soc Ser C Appl Stat 61:1-24
Wang, Yuanjia; Garcia, Tanya P; Ma, Yanyuan (2012) Nonparametric estimation for censored mixture data with application to the Cooperative Huntington's Observational Research Trial. J Am Stat Assoc 107:1324-1338
Wang, Yuanjia; Chen, Yin-Hsiu; Yang, Qiong (2012) Joint rare variant association test of the average and individual effects for sequencing studies. PLoS One 7:e32485
Wang, Yuanjia; Chen, Huaihou; Li, Runze et al. (2011) Prediction-based structured variable selection through the receiver operating characteristic curves. Biometrics 67:896-905
Wang, Yuanjia; Chen, Huaihou; Schwartz, Theresa et al. (2011) Assessment of a disease screener by hierarchical all-subset selection using area under the receiver operating characteristic curves. Stat Med 30:1751-60
Chen, Huaihou; Wang, Yuanjia (2011) A penalized spline approach to functional mixed effects model analysis. Biometrics 67:861-70

Showing the most recent 10 out of 16 publications