Randomized clinical trials are and will continue to be the key vehicle for evaluation of new and existing cancer therapies. This revolutionary era of advances in the biological sciences is leading to the discovery of novel biomarkers and complex genetic and genomic information that may be highly associated with various clinical outcomes, offering the tantalizing opportunity to exploit this information to both improve the precision of the analyses of trials and to develop models of longitudinal disease progression that may reveal important insights. A recurrent challenge is that missing data and subject drop-out are commonplace, presenting complications for analyses of these trials. Through a series of aims addressing these issues, this project proposes research that will have a significant impact on the quality and strength of inferences possible from current cancer clinical trials. That it is possible to improve efficiency of primary analyses of clinical trials by exploiting prognostic baseline auxiliary information is well known;however, such analyses are controversial because of the temptation to choose the analysis that leads to the most dramatic treatment effect. In the first aim, new methods for such """"""""covariate adjustment"""""""" will be studied that circumvent this issue and can improve over existing approaches. In the second aim, these methods will be extended so that they may be used in the common case where outcomes are missing due to drop-out. Efficient methods for longitudinal analysis of measures such as quality of life and biomarkers in the presence of drop-out will also be developed. Understanding the relationship between such longitudinal measures and clinical outcomes such as time to recurrence or survival time is of key importance.
The third aim focuses on development of methods for assessing the correctness of so-called joint statistical models used for this purpose and for assessing the influence of particular observations on the fit ofthe model, where the data used to develop the model may be missing. Finally, taking appropriate account of missing data sometimes requires unverifiable assumptions about why the data are missing, which are incorporated in models that thus cannot be checked based on the data.
The fourth aim i s devoted to development of a new statistical framework for assessing how sensitive conclusions are to the modeling assumptions made.

Public Health Relevance

Randomized clinical trials in cancer research are the most important mechanism for the evaluation of new and existing therapies. Statistical methods will be developed that will improve the precision of the analyses of these trials and provide tools for drawing valid conclusions when some of the data intended to be collected are missing, e.g., if some subjects drop out ofthe trial, offering cancer researchers an expanded set of tools that will greatly improve the quality and strength of analyses of current cancer clinical trials.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Research Program Projects (P01)
Project #
Application #
Study Section
Special Emphasis Panel (ZCA1-RPRB-7)
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of North Carolina Chapel Hill
Chapel Hill
United States
Zip Code
Jung, Sin-Ho; Lee, Ho Yun; Chow, Shein-Chung (2018) Statistical Methods for Conditional Survival Analysis. J Biopharm Stat 28:927-938
Jiang, Yuchao; Wang, Rujin; Urrutia, Eugene et al. (2018) CODEX2: full-spectrum copy number variation detection by high-throughput DNA sequencing. Genome Biol 19:202
Kim, Soyoung; Zeng, Donglin; Cai, Jianwen (2018) Analysis of multiple survival events in generalized case-cohort designs. Biometrics :
Chung, Yunro; Ivanova, Anastasia; Hudgens, Michael G et al. (2018) Partial likelihood estimation of isotonic proportional hazards models. Biometrika 105:133-148
Liang, Shuhan; Lu, Wenbin; Song, Rui (2018) Deep advantage learning for optimal dynamic treatment regime. Stat Theory Relat Fields 2:80-88
Chen, Xiaolin; Cai, Jianwen (2018) Reweighted estimators for additive hazard model with censoring indicators missing at random. Lifetime Data Anal 24:224-249
Yang, Yuchen; Huh, Ruth; Culpepper, Houston W et al. (2018) SAFE-clustering: Single-cell Aggregated (From Ensemble) Clustering for Single-cell RNA-seq Data. Bioinformatics :
Nasution, Marlina D; Wang, Xiaofei (2018) Statistical issues and advances in cancer precision medicine research. J Biopharm Stat 28:215-216
Wu, Yuan; Chambers, Christina D; Xu, Ronghui (2018) Semiparametric sieve maximum likelihood estimation under cure model with partly interval censored and left truncated data for application to spontaneous abortion. Lifetime Data Anal :
Ibrahim, Joseph G; Kim, Sungduk; Chen, Ming-Hui et al. (2018) Bayesian multivariate skew meta-regression models for individual patient data. Stat Methods Med Res :962280218801147

Showing the most recent 10 out of 549 publications