Biomarkers in cancer research are considered a central component of the expected improvements in prevention, detection, treatment and monitoring. There are potentially useful in many different types of studies and for many different purposes. Critical questions are whether they are valid to use, how can they be utilized in a valid and efficient way, and then if they are used how confident is one in the conclusions that are obtained. The use of biomarkers to advance understanding in cancer science has great potential, but also has some risks. Biomarkers are subject to uncertainty in their measurement, they may not be measuring exactly the quantity of interest, and since they are not explicitly measures of symptoms their use to aid in decision making or evaluation of therapies in a clinical setting is subject to uncertainty. Thus careful analysis of data from studies that involve biomarkers is crucial. There are many statistical challenges that arise in such studies. This application is concerned with developing, evaluating and applying statistical methods for data that involves biomarkers.
The first aim i s concerned with adding biomarkers to prediction models that may be used to stratify or classify patients. In this aim we develop approaches for integrating data from other sources to improve the prediction models. This research will have broad applicability. Innovative aspects involve the use of targeted ridge regression, multi-kernel machine modeling, and importance sampling to incorporate information from the literature.
The second aim i s concerned with clinical trials where the biomarker is to be used to evaluate a therapy as a surrogate endpoint. Because of the nature of the scientific question causal modeling is very natural in this context. We propose to develop both potential outcomes and structural causal models. We will investigate both single trial and multi trial settings with different endpoint types.
The third aim i s concerned with therapies that may be effective only for a subgroup of patients, and to be useful this subgroup is determined by a small number of predictive biomarkers. For data from randomized clinical trials we suggest a unified modeling approach, and will investigate the use of single index models with variable selection and multivariate partial least squares to aid in the subgroup identification. Inference following subgroup identification is challenging, we suggest an innovative scheme to simulate data under an appropriate null distribution. All 3 aims in this proposal address fundamental and significant problems in translational oncology research. Successful completion of the aims will have an impact both in understanding and utilizing biomarkers and also in developing statistical methodology that can be more broadly applicable to other fields.

Public Health Relevance

Biomarkers are considered a central component of the expected improvements in prevention, detection, treatment and monitoring in cancer. Critical questions about biomarkers are when and whether they are valid to use, how can they be utilized in a valid and efficient way, and then if they are used how confident is one in the conclusions that are obtained. This proposal is concerned with developing proper and efficient statistical methods for evaluation of biomarker data.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Research Project (R01)
Project #
Application #
Study Section
Cancer Biomarkers Study Section (CBSS)
Program Officer
Feuer, Eric J
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Michigan Ann Arbor
Biostatistics & Other Math Sci
Schools of Public Health
Ann Arbor
United States
Zip Code
Rizopoulos, Dimitris; Taylor, Jeremy M G; Van Rosmalen, Joost et al. (2016) Personalized screening intervals for biomarkers using joint models for longitudinal and survival data. Biostatistics 17:149-64
Foster, Jared C; Nan, Bin; Shen, Lei et al. (2016) Permutation Testing for Treatment-Covariate Interactions and Subgroup Identification. Stat Biosci 8:77-98
Shen, Jincheng; Wang, Lu; Taylor, Jeremy M G (2016) Estimation of the optimal regime in treatment of prostate cancer recurrence from observational data using flexible weighting models. Biometrics :
Prince, Victoria; Bellile, Emily L; Sun, Yilun et al. (2016) Individualized risk prediction of outcomes for oral cavity cancer patients. Oral Oncol 63:66-73
Zhan, Xiang; Epstein, Michael P; Ghosh, Debashis (2015) An Adaptive Genetic Association Test Using Double Kernel Machines. Stat Biosci 7:262-281
Taylor, Jeremy M G; Cheng, Wenting; Foster, Jared C (2015) Reader reaction to "a robust method for estimating optimal treatment regimes" by Zhang et al. (2012). Biometrics 71:267-71
Elliott, Michael R; Conlon, Anna S C; Li, Yun et al. (2015) Surrogacy marker paradox measures in meta-analytic settings. Biostatistics 16:400-12
Taylor, Jeremy M G; Conlon, Anna S C; Elliott, Michael R (2015) Surrogacy assessment using principal stratification with multivariate normal and Gaussian copula models. Clin Trials 12:317-22
Boonstra, Philip S; Mukherjee, Bhramar; Taylor, Jeremy M G (2015) A Small-Sample Choice of the Tuning Parameter in Ridge Regression. Stat Sin 25:1185-1206
Ghosh, Debashis; Zhu, Yeying; Coffman, Donna L (2015) Penalized regression procedures for variable selection in the potential outcomes framework. Stat Med 34:1645-58

Showing the most recent 10 out of 45 publications