Risk bimarkers have become increasingly important in clinical decision making, guiding patients and their clinicians in choosing the most appropriate course of therapy or surveillance after treatment. Constructing accurate and individualized prediction rules and conducting rigorous validation are critical to the cancer biomarker field. Prospective cohort studies are crucial for such evaluation as time to event carries more information about a marker's value on early detection and prognosis than a simple measure of disease status. But prospective biomarker evaluation is challenging. Until now there has been little guidance pro- vided for statistical design and analysis of these studies. We propose to extend our previously funded effort to address several new challenges in prospective marker evaluation. The proposal will emphasize three unique aspects in prospective biomarker evaluation. First, for many cancers disease outcome may be heterogeneous due to the biological nature of the disease or selection of treatments. Constructing and validating prognostic and treatment selection rules based on more specific prediction of the risk of develop- ing aggressive cancer as opposed to indolent cancer is of great clinical interest yet analytically challenging.
In Aim 1 we will provide statistical tools for developing and validating risk markers in a population with an unknown mixture of indolent and aggressive cancers. We propose statistical methods that facilitate the development and evaluation of prognostic markers for risk stratification. Methods for deriving and evalu- ating individualized treatment rules in the presence of a mixture of indolent and aggressive cancers will be considered. Second, among patients diagnosed with cancer who chose to be on active surveillance, developing monitoring tools to make adaptive monitoring or intervention recommendations with longitudinal biomarkers may alleviate overtreatment without missing signs of progression.
In Aim 2 we will consider flexible procedures to quantify the updated predictive accuracy of longitudinal markers. In addition, we will develop and evaluate decision rules on the basis of risk, incorporating both cross-sectional and longi- tudinal marker information. The ascertainment of marker information in a large cohort requires enormous resources. Cost-effective cohort sampling is therefore highly desirable.
In Aim 3 we will develop procedures to improve the efficiency of estimating risk and accuracy parameters and rigorously evaluate and compare different choices of matching/stratification rules and identify optimal pairs of analyses and sampling strate- gies. We will also develop estimation procedures for evaluating longitudinal markers in two-phase studies. Applications in cancer biomarker development provide a context for our research. Data from the Early De- tection Research Network and from several large cohort studies will be analyzed. Programs and algorithms developed in this proposal will be made available to public.

Public Health Relevance

The research proposal addresses the pressing need for strong statistical input in the cancer biomarker field, providing comprehensive statistical tools that will enable investigators to conduct valid and more powerful biomarker validation studies and to evaluate the prognostic and treatment-selection potential of novel biomarkers. Integrating our research into clinical settings will help improve survival outcomes and reduce the burden of cancer treatment.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Cancer Biomarkers Study Section (CBSS)
Program Officer
Lyster, Peter
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Fred Hutchinson Cancer Research Center
United States
Zip Code
Zhou, Qian M; Zheng, Yingye; Cai, Tianxi (2013) Assessment of biomarkers for risk prediction with nested case-control studies. Clin Trials 10:677-9
Zheng, Yingye; Cai, Tianxi; Pepe, Margaret S (2013) Adopting nested case-control quota sampling designs for the evaluation of risk markers. Lifetime Data Anal 19:568-88
Sun, Jianping; Zheng, Yingye; Hsu, Li (2013) A unified mixed-effects model for rare-variant association in sequencing studies. Genet Epidemiol 37:334-44
Schenk, Jeannette M; Hunter-Merrill, Rachel; Zheng, Yingye et al. (2013) Should modest elevations in prostate-specific antigen, International Prostate Symptom Score, or their rates of increase over time be used as surrogate measures of incident benign prostatic hyperplasia? Am J Epidemiol 178:741-51
Zhou, Qian M; Zheng, Yingye; Cai, Tianxi (2013) Subgroup specific incremental value of new markers for risk prediction. Lifetime Data Anal 19:142-69
Zheng, Yingye; Heagerty, Patrick J; Hsu, Li et al. (2010) On combining family-based and population-based case-control data in association studies. Biometrics 66:1024-33
Zheng, Yingye; Cai, Tianxi; Stanford, Janet L et al. (2010) Semiparametric models of time-dependent predictive values of prognostic biomarkers. Biometrics 66:50-60