The broad, long-term objectives of this research are the developments of semiparametric regression models and associated inferential and computational methods for the analysis of censored failure time data com- monly encountered in medical studies.
The specific aims of the extension period include:1) to assess the predictive accuracy of clinical and genetic variables in predicting time to disease occurrence or death and to quantify the impact of genetic mutations and environmental exposures on the population over time;2) to stu- dy a broad class of mixture cure models that combines a binary regression model for the cure probability with a generalized Cox model for the failure times of the uncured individuals;3) to construct kernel-based es- timation methods for outcome-dependent two-stage designs, such as case-cohort and nested case-control studies;4) to pursue variable selection strategies for generalized Cox models and accelerated failure time models;5) to extend the Cox proportional hazards model to accommodate nonproportional hazards structures by allowing the regression coefficients to vary over time or to change from one value to another at a certain time point;6) to explore empirical likelihood methods for utilizing auxiliary baseline covariate infor- mation to improve the efficiency of treatment comparisons in randomized clinical trials;and 7) to study a broad class of semiparametric regression models for spatially correlated failure time data. All these aims are built on the observations and ideas that have been generated during the MERIT award period and address the most timely and important issues in medical research. In each specific aim, valid and efficient statistical methods will be constructed and their theoretical properties be rigorously established. Efficient and reliable numerical algorithms will be devised to implement the corresponding inference procedures. The performance of the numerical and inferential procedures will be assessed through extensive simulation studies. Applica- tions to a variety of clinical, epidemiological and genetic studies will be provided. User-friendly, open-source software will developed and disseminated. This research will yield novel and powerful statistical and commputational tools that can be readily used by medical investigators.

Public Health Relevance

The ultimate goal of medical research is to prevent disease and prolong life. The times to disease occur- rence or death are not fully observed for all study subjects. The proposed research will produce novel and powerful statistical and computational tools to assess the effects of covariates (e.g., treatments, environmental exposures, and genetic variants) on such incompletely observed failure times.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Method to Extend Research in Time (MERIT) Award (R37)
Project #
Application #
Study Section
Special Emphasis Panel (NSS)
Program Officer
Marcus, Stephen
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of North Carolina Chapel Hill
Biostatistics & Other Math Sci
Schools of Public Health
Chapel Hill
United States
Zip Code
Liu, Jingchen; Ying, Zhiliang; Zhang, Stephanie (2015) A Rate Function Approach to Computerized Adaptive Testing for Cognitive Diagnosis. Psychometrika 80:468-90
Chen, Qingxia; Zeng, Donglin; Ibrahim, Joseph G et al. (2015) Quantifying the average of the time-varying hazard ratio via a class of transformations. Lifetime Data Anal 21:259-79
Zeng, Donglin; Lin, D Y (2014) Efficient Estimation of Semiparametric Transformation Models for Two-Phase Cohort Studies. J Am Stat Assoc 109:371-383
Hu, Wenrong; Cai, Jianwen; Zeng, Donglin (2014) Sample size/power calculation for stratified case-cohort design. Stat Med 33:3973-85
Xu, Gongjun; Sen, Bodhisattva; Ying, Zhiliang (2014) Bootstrapping a change-point Cox model for survival data. Electron J Stat 8:1345-1379
Lin, Dan-Yu; Tao, Ran; Kalsbeek, William D et al. (2014) Genetic association analysis under complex survey sampling: the Hispanic Community Health Study/Study of Latinos. Am J Hum Genet 95:675-88
Chen, Ming-Hui; Ibrahim, Joseph G; Zeng, Donglin et al. (2014) Bayesian design of superiority clinical trials for recurrent events data with applications to bleeding and transfusion events in myelodyplastic syndrome. Biometrics 70:1003-13
Chen, Tianle; Wang, Yuanjia; Chen, Huaihou et al. (2014) Targeted Local Support Vector Machine for Age-Dependent Classification. J Am Stat Assoc 109:1174-1187
Zhou, Haibo; Xu, Wangli; Zeng, Donglin et al. (2014) Semiparametric Inference for Data with a Continuous Outcome from a Two-Phase Probability Dependent Sampling Scheme. J R Stat Soc Series B Stat Methodol 76:197-215
Lin, D Y (2014) Survival analysis with incomplete genetic data. Lifetime Data Anal 20:16-22

Showing the most recent 10 out of 26 publications