Treatment of cancer is an ongoing process during which clinicians make a series of therapeutic decisions over the course of the disease. However, while there is increasing interest in identifying the overall strategy of sequential decisions leading to the most beneficial clinical outcomes, where those decisions may be predicated on complex information on the patient up to that point, current cancer clinical trials evaluate only the therapeutic options available at a single decision point, mostly in a """"""""one-size-fits-all"""""""" manner. Attempts to synthesize information from several isolated trials conducted at different milestones in the disease are problematic, because the best treatment at any one decision point may not be best when placed in the context of the entire decision process owing to possible delayed effects of past treatments on the efficacy of future treatments. Considering cancer treatment strategies as dynamic treatment regimes, which are formal algorithms for sequential decision-making that use accrued information on the patient at each decision point in an evidence-based manner to determine the next step of treatment, along with analytical reinforcement learning methods from computer science that provide a principled framework for identifying the optimal such regime, offers the potential to revolutionize how cancer treatment is viewed and effect a paradigm shift in the design and conduct of cancer clinical trials. The four specific aims of this project seek to catalyze this advance by studying these issues for the first time in the cancer treatment context.
The first aim will evaluate various learning methods to establish the best techniques for use in developing optimal dynamic treatment regimes for cancer, and the second will focus on a specific version of this methodology when clinicians are interested in finding the best regime among a particular set of regimes.
The third aim will develop new methods for making formal statistical inference on regimes developed based on data, which have been heretofore unavailable owing to the theoretical complexity of the problem. In the fourth aim, methods for design of so-called sequentially randomized trials for the specific purpose of developing dynamic treatment regimes, including determination of sample sizes that will ensure identification ofthe best regimes from among those in the trial, will be developed. Coupling trial design with learning methods for analysis, a new model, the clinical reinforcement trial, will be developed and applied to designing studies to identify optimal regimes for non-small cell lung cancer and other cancers. Collectively, these aims will result in high-impact, new methodology that will allow individualization of the therapy to the patient over time.

Public Health Relevance

Although treatment of cancer involves a series of therapeutic decisions over time, cancer clinical trials evaluate treatments only at specific decision points, and hence the best treatment in such a trial may not be best when placed in the context of the overall decision-making process. This research will study cancer treatment formally as an overall, individualized strategy so that the entire series of decisions leading to the best outcomes can be determined, promoting a paradigm shift in the way cancer therapies are evaluated.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Research Program Projects (P01)
Project #
Application #
Study Section
Special Emphasis Panel (ZCA1-RPRB-7)
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of North Carolina Chapel Hill
Chapel Hill
United States
Zip Code
Jung, Sin-Ho; Lee, Ho Yun; Chow, Shein-Chung (2018) Statistical Methods for Conditional Survival Analysis. J Biopharm Stat 28:927-938
Jiang, Yuchao; Wang, Rujin; Urrutia, Eugene et al. (2018) CODEX2: full-spectrum copy number variation detection by high-throughput DNA sequencing. Genome Biol 19:202
Kim, Soyoung; Zeng, Donglin; Cai, Jianwen (2018) Analysis of multiple survival events in generalized case-cohort designs. Biometrics :
Chung, Yunro; Ivanova, Anastasia; Hudgens, Michael G et al. (2018) Partial likelihood estimation of isotonic proportional hazards models. Biometrika 105:133-148
Liang, Shuhan; Lu, Wenbin; Song, Rui (2018) Deep advantage learning for optimal dynamic treatment regime. Stat Theory Relat Fields 2:80-88
Chen, Xiaolin; Cai, Jianwen (2018) Reweighted estimators for additive hazard model with censoring indicators missing at random. Lifetime Data Anal 24:224-249
Yang, Yuchen; Huh, Ruth; Culpepper, Houston W et al. (2018) SAFE-clustering: Single-cell Aggregated (From Ensemble) Clustering for Single-cell RNA-seq Data. Bioinformatics :
Nasution, Marlina D; Wang, Xiaofei (2018) Statistical issues and advances in cancer precision medicine research. J Biopharm Stat 28:215-216
Wu, Yuan; Chambers, Christina D; Xu, Ronghui (2018) Semiparametric sieve maximum likelihood estimation under cure model with partly interval censored and left truncated data for application to spontaneous abortion. Lifetime Data Anal :
Ibrahim, Joseph G; Kim, Sungduk; Chen, Ming-Hui et al. (2018) Bayesian multivariate skew meta-regression models for individual patient data. Stat Methods Med Res :962280218801147

Showing the most recent 10 out of 549 publications