The broad, long-term objectives of this research are the developments of semiparametric regression models and associated inferential and computational methods for the analysis of censored failure time data com- monly encountered in medical studies.
The specific aims of the extension period include:1) to assess the predictive accuracy of clinical and genetic variables in predicting time to disease occurrence or death and to quantify the impact of genetic mutations and environmental exposures on the population over time;2) to stu- dy a broad class of mixture cure models that combines a binary regression model for the cure probability with a generalized Cox model for the failure times of the uncured individuals;3) to construct kernel-based es- timation methods for outcome-dependent two-stage designs, such as case-cohort and nested case-control studies;4) to pursue variable selection strategies for generalized Cox models and accelerated failure time models;5) to extend the Cox proportional hazards model to accommodate nonproportional hazards structures by allowing the regression coefficients to vary over time or to change from one value to another at a certain time point;6) to explore empirical likelihood methods for utilizing auxiliary baseline covariate infor- mation to improve the efficiency of treatment comparisons in randomized clinical trials;and 7) to study a broad class of semiparametric regression models for spatially correlated failure time data. All these aims are built on the observations and ideas that have been generated during the MERIT award period and address the most timely and important issues in medical research. In each specific aim, valid and efficient statistical methods will be constructed and their theoretical properties be rigorously established. Efficient and reliable numerical algorithms will be devised to implement the corresponding inference procedures. The performance of the numerical and inferential procedures will be assessed through extensive simulation studies. Applica- tions to a variety of clinical, epidemiological and genetic studies will be provided. User-friendly, open-source software will developed and disseminated. This research will yield novel and powerful statistical and commputational tools that can be readily used by medical investigators.

Public Health Relevance

The ultimate goal of medical research is to prevent disease and prolong life. The times to disease occur- rence or death are not fully observed for all study subjects. The proposed research will produce novel and powerful statistical and computational tools to assess the effects of covariates (e.g., treatments, environmental exposures, and genetic variants) on such incompletely observed failure times.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Method to Extend Research in Time (MERIT) Award (R37)
Project #
Application #
Study Section
Special Emphasis Panel (NSS)
Program Officer
Marcus, Stephen
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of North Carolina Chapel Hill
Biostatistics & Other Math Sci
Schools of Public Health
Chapel Hill
United States
Zip Code
Wang, Yuanjia; Fu, Haoda; Zeng, Donglin (2018) Learning Optimal Personalized Treatment Rules in Consideration of Benefit and Risk: with an Application to Treating Type 2 Diabetes Patients with Insulin Therapies. J Am Stat Assoc 113:1-13
Li, Quefeng; Cheng, Guang; Fan, Jianqing et al. (2018) Embracing the Blessing of Dimensionality in Factor Models. J Am Stat Assoc 113:380-389
Chen, Yunxiao; Li, Xiaoou; Liu, Jingchen et al. (2018) Recommendation System for Adaptive Learning. Appl Psychol Meas 42:24-41
Li, Xiang; Xie, Shanghong; Zeng, Donglin et al. (2018) Efficient ?0 -norm feature selection based on augmented and penalized minimization. Stat Med 37:473-486
Li, Xiaoou; Liu, Jingchen; Ying, Zhiliang (2018) Chernoff Index for Cox Test of Separate Parametric Families. Ann Stat 46:1-29
Chen, Yunxiao; Li, Xiaoou; Liu, Jingchen et al. (2017) Exploratory Item Classification Via Spectral Graph Clustering. Appl Psychol Meas 41:579-599
Kim, Sehee; Zeng, Donglin; Taylor, Jeremy M G (2017) Joint partially linear model for longitudinal data with informative drop-outs. Biometrics 73:72-82
Sit, Tony; Liu, Mengling; Shnaidman, Michael et al. (2016) Design and analysis of clinical trials in the presence of delayed treatment effect. Stat Med 35:1774-9
Chen, Yunxiao; Li, Xiaoou; Liu, Jingchen et al. (2016) Regularized Latent Class Analysis with Application in Cognitive Diagnosis. Psychometrika :
He, Qianchuan; Zhang, Hao Helen; Avery, Christy L et al. (2016) Sparse meta-analysis with high-dimensional data. Biostatistics 17:205-20

Showing the most recent 10 out of 73 publications