Infection and cardiovascular disease are two main sources of mortality in the dialysis population. Even though acute infections have been associated with an increased risk of myocardial infarction and stroke in the general population, the extent to which infection is a contributing factor to increased risk of cardiovascular events longitudinally in the dialysis population is largely unknown. The largest source of research data for the dialysis population is the United States Renal Data System database, which contains hospitalization records of nearly all patients on maintenance dialysis. Our long-term goal is to study the dynamic association of cardiovascular events and various contributing risk factors, particularly infection. Towards this goal, we will develop generalized semiparametric regression models to study trends over time generally, over time (years) on dialysis and over age, specifically. Determining the age- and time-dependent association between infection and the occurrence of cardiovascular events and obtaining the predicted subject- specific risk trajectory (probability) of cardiovascular events based on predictors, for instance, from the previous one to three months (i.e., time-lagged prediction) are critical steps towards the development of targeted intervention strategies in the US dialysis population. Innovation. The main challenge towards this goal is the lack of methods able to handle the extreme/ challenging structure of the longitudinal data available for analysis, characterized by extreme- (ultra-) sparsity, unsynchronized measurements, and imprecision/measurement error. This results from data collected on patient hospitalization records, which is extremely irregular and infrequent. In addition, longitudinal clinical inflammatory markers data (available for a subset of the USRDS cohort) are at unsynchronized time points with the outcome, possibly contaminated with measurement error. Currently there are no existing methods for generalized semiparametric regression modeling of longitudinal binary outcome (e.g., occurrence of cardiovascular events) or modeling of count/rate outcome that can handle 1) irregular, 2) infrequent, 3) unsynchronized and 4) error-prone longitudinal data.
Aims. The proposed research will fill this gap, by developing new estimation &inference procedures for generalized semiparametric regression models (GSRMs) for longitudinal data under these emerging challenges using functional data analysis (FDA). This will be achieved through the following specific aims: 1) Develop a unified functional analysis framework for estimation and inference for GSRMs, including generalized and generalized partial linear varying coefficient models, for highly irregular, infrequent, unsynchronized and noise-contaminated longitudinal data;2) Develop methods to predict subject-specific response trajectories;3) Characterize the efficiency of our proposed FDA approach. Furthermore, these methods will be used to determine, for the first time, the cardiovascular-infection risk longitudinal dynamics in the dialysis population.

Public Health Relevance

burden directly related to infection and cardiovascular disease in the dialysis population is substantial. The proposal involves developing the necessary estimation and inference framework to use the United States Renal Data System database in modeling age- and time-varying dynamics of the association between cardiovascular events and various contributing risk factors including infection. Understanding this cardiovascular-infection risk dynamics in patients over time is important to the development of targeted intervention strategies in the US dialysis population.

National Institute of Health (NIH)
National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK)
Research Project (R01)
Project #
Application #
Study Section
Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer
Flessner, Michael Francis
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California Los Angeles
Biostatistics & Other Math Sci
Schools of Public Health
Los Angeles
United States
Zip Code
Dalrymple, Lorien S; Johansen, Kirsten L; Romano, Patrick S et al. (2014) Comparison of hospitalization rates among for-profit and nonprofit dialysis facilities. Clin J Am Soc Nephrol 9:73-81
Estes, Jason P; Nguyen, Danh V; Dalrymple, Lorien S et al. (2014) Cardiovascular event risk dynamics over time in older patients on dialysis: a generalized multiple-index varying coefficient model approach. Biometrics 70:754-64
Kürüm, Esra; Li, Runze; Wang, Yang et al. (2014) Nonlinear Varying Coefficient Models with Applications to Studying Photosynthesis. J Agric Biol Environ Stat 19:57-81
Sentürk, Damla; Dalrymple, Lorien S; Nguyen, Danh V (2014) Functional linear models for zero-inflated count data with application to modeling hospitalizations in patients on dialysis. Stat Med 33:4825-40
Sentürk, Damla; Ghosh, Samiran; Nguyen, Danh V (2014) Exploratory time varying lagged regression: modeling association of cognitive and functional trajectories with expected clinic visits in older adults. Comput Stat Data Anal 73:1-15
Sentürk, Damla; Dalrymple, Lorien S; Mu, Yi et al. (2014) Weighted hurdle regression method for joint modeling of cardiovascular events likelihood and rate in the US dialysis population. Stat Med 33:4387-401
Senturk, Damla; Dalrymple, Lorien S; Mohammed, Sandra M et al. (2013) Modeling time-varying effects with generalized and unsynchronized longitudinal data. Stat Med 32:2971-87
Mohammed, Sandra M; Dalrymple, Lorien S; Senturk, Damla et al. (2013) Naive hypothesis testing for case series analysis with time-varying exposure onset measurement error: inference for infection-cardiovascular risk in patients on dialysis. Biometrics 69:520-9
Mohammed, Sandra M; Dalrymple, Lorien S; Senturk, Damla et al. (2013) Design considerations for case series models with exposure onset measurement error. Stat Med 32:772-86