Treatment of cancer is an ongoing process during which clinicians make a series of therapeutic decisions over the course of the disease. However, while there is increasing interest in identifying the overall strategy of sequential decisions leading to the most beneficial clinical outcomes, where those decisions may be predicated on complex information on the patient up to that point, current cancer clinical trials evaluate only the therapeutic options available at a single decision point, mostly in a """"""""one-size-fits-all"""""""" manner. Attempts to synthesize information from several isolated trials conducted at different milestones in the disease are problematic, because the best treatment at any one decision point may not be best when placed in the context of the entire decision process owing to possible delayed effects of past treatments on the efficacy of future treatments. Considering cancer treatment strategies as dynamic treatment regimes, which are formal algorithms for sequential decision-making that use accrued information on the patient at each decision point in an evidence-based manner to determine the next step of treatment, along with analytical reinforcement learning methods from computer science that provide a principled framework for identifying the optimal such regime, offers the potential to revolutionize how cancer treatment is viewed and effect a paradigm shift in the design and conduct of cancer clinical trials. The four specific aims of this project seek to catalyze this advance by studying these issues for the first time in the cancer treatment context.
The first aim will evaluate various learning methods to establish the best techniques for use in developing optimal dynamic treatment regimes for cancer, and the second will focus on a specific version of this methodology when clinicians are interested in finding the best regime among a particular set of regimes.
The third aim will develop new methods for making formal statistical inference on regimes developed based on data, which have been heretofore unavailable owing to the theoretical complexity of the problem. In the fourth aim, methods for design of so-called sequentially randomized trials for the specific purpose of developing dynamic treatment regimes, including determination of sample sizes that will ensure identification ofthe best regimes from among those in the trial, will be developed. Coupling trial design with learning methods for analysis, a new model, the clinical reinforcement trial, will be developed and applied to designing studies to identify optimal regimes for non-small cell lung cancer and other cancers. Collectively, these aims will result in high-impact, new methodology that will allow individualization of the therapy to the patient over time.

Public Health Relevance

Although treatment of cancer involves a series of therapeutic decisions over time, cancer clinical trials evaluate treatments only at specific decision points, and hence the best treatment in such a trial may not be best when placed in the context of the overall decision-making process. This research will study cancer treatment formally as an overall, individualized strategy so that the entire series of decisions leading to the best outcomes can be determined, promoting a paradigm shift in the way cancer therapies are evaluated.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Research Program Projects (P01)
Project #
Application #
Study Section
Special Emphasis Panel (ZCA1-RPRB-7)
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of North Carolina Chapel Hill
Chapel Hill
United States
Zip Code
Acharya, Chaitanya R; McCarthy, Janice M; Owzar, Kouros et al. (2016) Exploiting expression patterns across multiple tissues to map expression quantitative trait loci. BMC Bioinformatics 17:257
Laber, Eric B; Zhao, Ying-Qi; Regh, Todd et al. (2016) Using pilot data to size a two-arm randomized trial to find a nearly optimal personalized treatment strategy. Stat Med 35:1245-56
Li, Zhiguo; Owzar, Kouros (2016) Fitting Cox Models with Doubly Censored Data Using Spline-Based Sieve Marginal Likelihood. Scand Stat Theory Appl 43:476-486
Wang, Xiaofei; Berry, Mark F (2016) Risk calculators are useful but.... J Thorac Cardiovasc Surg 151:706-7
Wang, Xuefeng; Chen, Mengjie; Yu, Xiaoqing et al. (2016) Global copy number profiling of cancer genomes. Bioinformatics 32:926-8
Ivanova, Anastasia; Wang, Yunfei; Foster, Matthew C (2016) The rapid enrollment design for Phase I clinical trials. Stat Med 35:2516-24
Zhang, Daowen; Sun, Jie Lena; Pieper, Karen (2016) Bivariate Mixed Effects Analysis of Clustered Data with Large Cluster Sizes. Stat Biosci 8:220-233
Schifano, Elizabeth D; Wu, Jing; Wang, Chun et al. (2016) Online Updating of Statistical Inference in the Big Data Setting. Technometrics 58:393-403
Lizotte, Daniel J; Laber, Eric B (2016) Multi-Objective Markov Decision Processes for Data-Driven Decision Support. J Mach Learn Res 17:
Minsker, Stanislav; Zhao, Ying-Qi; Cheng, Guang (2016) Active Clinical Trials for Personalized Medicine. J Am Stat Assoc 111:875-887

Showing the most recent 10 out of 378 publications