Understanding the association between risk factors and chronic diseases is crucially important for improvement in treatment and patient care. Population based data registries for chronic disease are often available and provide excellent platforms to serve this purpose. A crucial aspect of traditional regression modeling is the assumption that the effect of each risk factor is constant. However, there is growing evidence that the etiology of many chronic diseases is complex and the influence of a risk factor on disease outcomes may not be constant. Through our ongoing collaborative work on Cystic Fibrosis Foundation patient registry (CFFPR), the investigators of this grant have demonstrated that varying coefficient regression, termed as dynamic regression in survival analysis (Martinussen and Scheike, 2006), provides a powerful tool for discovering important changes in associations between risk factors and CF outcomes (time to, and frequency of, major CF events). This research project is motivated by several important unsolved, open questions arising from CFFPR: (i) competing risk, double censoring and left truncation to the observation of CF events, inherited with the design of CFFPR;(ii) the high dimensional covariates, which necessitate the development of variable selection procedures that accommodate varying covariate effects, (iii) the large sample size, which demands efficient computation;(iv) the need of utilizing time-dependent follow-up information to aid in disease prognosis and address substantive scientific questions. Current dynamic regression approaches have several limitations: inability to accommodate complex features of data, difficulties in interpretation and prediction, computational issues. Moreover, there is very limited work on variable selection under survival varying coefficient models. The overall objective of this proposal is to develop a comprehensive dynamic regression framework that resolves the key limitations of the existing approaches and possesses the capacity to handle many realistic data-related issues. To accomplish this goal, we will first lay out a unified framework of survival dynamic regression by introducing sensible modeling and developing inferential procedures that account for common survival data features (Aim 1). We will tackle the challenging problem of high dimensional dynamic regression (Aim 2), where the existing methods that assume constant effects can have poor performance. We will propose a seminal dynamic regression strategy for investigating the relationship between time-dependent covariates and survival outcomes with sensible interpretations and predictions permitted (Aim 3). The proposed statistical methods will be applied to CFFPR (Aim 4) and user-friendly software will be develop and made available to general research communities (Aim 5). Methodological development proposed in this grant will have a broad impact on scientific investigations not only on CFFPR but also on other registry based chronic disease studies.

Public Health Relevance

We propose statistical methods to identify risk factors for chronic disease studies, such as Cystic Fibrosis. These methods will enhance the understanding of the mechanism and prognosis of chronic diseases that will lead to improved disease treatment and patient care.

National Institute of Health (NIH)
National Heart, Lung, and Blood Institute (NHLBI)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-HDM-T (90))
Program Officer
Banks-Schlegel, Susan P
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Emory University
Biostatistics & Other Math Sci
Schools of Public Health
United States
Zip Code
Ma, Huijuan; Peng, Limin; Zhang, Zhumin et al. (2018) Generalized accelerated recurrence time model for multivariate recurrent event data with missing event type. Biometrics 74:954-965
Yang, Jing; Peng, Limin (2018) Estimating cross quantile residual ratio with left-truncated semi-competing risks data. Lifetime Data Anal 24:652-674
Zheng, Qi; Peng, Limin; He, Xuming (2018) HIGH DIMENSIONAL CENSORED QUANTILE REGRESSION. Ann Stat 46:308-343
Liu, Shuling; Manatunga, Amita K; Peng, Limin et al. (2017) A joint modeling approach for multivariate survival data with random length. Biometrics 73:666-677
Ong, Thida; Schechter, Michael; Yang, Jing et al. (2017) Socioeconomic Status, Smoke Exposure, and Health Outcomes in Young Children With Cystic Fibrosis. Pediatrics 139:
Zheng, Qi; Peng, Limin (2017) Consistent model identification of varying coefficient quantile regression with BIC tuning parameter selection. Commun Stat Theory Methods 46:1031-1049
Rahman, Akm Fazlur; Peng, Limin; Manatunga, Amita et al. (2017) Nonparametric Regression Method for Broad Sense Agreement. J Nonparametr Stat 29:280-300
Li, Ruosha; Peng, Limin (2017) Assessing quantile prediction with censored quantile regression models. Biometrics 73:517-528
Limoli, D H; Yang, J; Khansaheb, M K et al. (2016) Staphylococcus aureus and Pseudomonas aeruginosa co-infection is associated with cystic fibrosis-related diabetes and poor clinical outcomes. Eur J Clin Microbiol Infect Dis 35:947-53
Sun, Xiaoyan; Peng, Limin; Manatunga, Amita et al. (2016) Quantile regression analysis of censored longitudinal data with irregular outcome-dependent follow-up. Biometrics 72:64-73

Showing the most recent 10 out of 20 publications