Treatment of cancer is an ongoing process during which clinicians make a series of therapeutic decisions over the course of the disease. However, while there is increasing interest in identifying the overall strategy of sequential decisions leading to the most beneficial clinical outcomes, where those decisions may be predicated on complex information on the patient up to that point, current cancer clinical trials evaluate only the therapeutic options available at a single decision point, mostly in a """"""""one-size-fits-all"""""""" manner. Attempts to synthesize information from several isolated trials conducted at different milestones in the disease are problematic, because the best treatment at any one decision point may not be best when placed in the context of the entire decision process owing to possible delayed effects of past treatments on the efficacy of future treatments. Considering cancer treatment strategies as dynamic treatment regimes, which are formal algorithms for sequential decision-making that use accrued information on the patient at each decision point in an evidence-based manner to determine the next step of treatment, along with analytical reinforcement learning methods from computer science that provide a principled framework for identifying the optimal such regime, offers the potential to revolutionize how cancer treatment is viewed and effect a paradigm shift in the design and conduct of cancer clinical trials. The four specific aims of this project seek to catalyze this advance by studying these issues for the first time in the cancer treatment context.
The first aim will evaluate various learning methods to establish the best techniques for use in developing optimal dynamic treatment regimes for cancer, and the second will focus on a specific version of this methodology when clinicians are interested in finding the best regime among a particular set of regimes.
The third aim will develop new methods for making formal statistical inference on regimes developed based on data, which have been heretofore unavailable owing to the theoretical complexity of the problem. In the fourth aim, methods for design of so-called sequentially randomized trials for the specific purpose of developing dynamic treatment regimes, including determination of sample sizes that will ensure identification ofthe best regimes from among those in the trial, will be developed. Coupling trial design with learning methods for analysis, a new model, the clinical reinforcement trial, will be developed and applied to designing studies to identify optimal regimes for non-small cell lung cancer and other cancers. Collectively, these aims will result in high-impact, new methodology that will allow individualization of the therapy to the patient over time.

Public Health Relevance

Although treatment of cancer involves a series of therapeutic decisions over time, cancer clinical trials evaluate treatments only at specific decision points, and hence the best treatment in such a trial may not be best when placed in the context of the overall decision-making process. This research will study cancer treatment formally as an overall, individualized strategy so that the entire series of decisions leading to the best outcomes can be determined, promoting a paradigm shift in the way cancer therapies are evaluated.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Research Program Projects (P01)
Project #
Application #
Study Section
Special Emphasis Panel (ZCA1)
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of North Carolina Chapel Hill
Chapel Hill
United States
Zip Code
Yang, Yuchen; Huh, Ruth; Culpepper, Houston W et al. (2018) SAFE-clustering: Single-cell Aggregated (From Ensemble) Clustering for Single-cell RNA-seq Data. Bioinformatics :
Nasution, Marlina D; Wang, Xiaofei (2018) Statistical issues and advances in cancer precision medicine research. J Biopharm Stat 28:215-216
Wu, Yuan; Chambers, Christina D; Xu, Ronghui (2018) Semiparametric sieve maximum likelihood estimation under cure model with partly interval censored and left truncated data for application to spontaneous abortion. Lifetime Data Anal :
Ibrahim, Joseph G; Kim, Sungduk; Chen, Ming-Hui et al. (2018) Bayesian multivariate skew meta-regression models for individual patient data. Stat Methods Med Res :962280218801147
Gao, Fei; Zeng, Donglin; Wei, Helen et al. (2018) Estimating Treatment Effects for Recurrent Events in the Presence of Rescue Medications: An Application to the Immune Thrombocytopenia Study. Stat Biosci 10:473-489
Zhang, Chong; Pham, Minh; Fu, Sheng et al. (2018) Robust Multicategory Support Vector Machines using Difference Convex Algorithm. Math Program 169:277-305
Chen, Jingxiang; Zhang, Chong; Kosorok, Michael R et al. (2018) Double Sparsity Kernel Learning with Automatic Variable Selection and Data Extraction. Stat Interface 11:401-420
Yang, Shu; Tsiatis, Anastasios A; Blazing, Michael (2018) Modeling survival distribution as a function of time to treatment discontinuation: A dynamic treatment regime approach. Biometrics 74:900-909
Liu, Ying; Wang, Yuanjia; Kosorok, Michael R et al. (2018) Augmented outcome-weighted learning for estimating optimal dynamic treatment regimens. Stat Med 37:3776-3788

Showing the most recent 10 out of 549 publications