ROC Curve Methodology

Schisterman, Enrique

Abstract

Just as there are many markers of oxidative stress, the rapid growth of biotechnology means that researchers increasingly must consider which screening or diagnostic test to use in their research. My work with ROC curves is aimed at providing evidence-based approaches for making these choices. The ROC curve simultaneously plots the proportion of both abnormal and normal subjects correctly diagnosed at various test cutoff points. This graphical display facilitates the selection of an optimal threshold and enables easy comparison of the abilities of different tests. Increasingly, ROC curves are used in population based settings as opposed to settings where individuals have been pre-screened to some degree. However, ROC curve methods were not developed to account for common problems such as missing data, measurement error, linear combinations, confounding, referral bias, LODs, and other challenges. ? We have proposed estimators of the mean of a K-sample U-statistic (of which the area under the ROC curve (AUC) is a special case) when data on the outcomes of interest are missing in some sampled units and auxiliary variables are available in the entire sample. The proposed estimators exploit the information available in the auxiliaries without requiring assumptions about the joint distribution of the auxiliaries and outcomes. The properties of the proposed estimators are derived from general results on efficient semi-parametric estimation of the mean of a K-sample U-statistic with missing at random outcomes, observed auxiliary variables and known missingness probabilities. ? Random measurement error can attenuate a biomarkers ability to discriminate between diseased and non-diseased populations. We present an approach for estimating the Youden index, the AUC and its associated optimal cut-point for a normally distributed biomarker that corrects for normally distributed random measurement error. We also developed confidence intervals for these corrected estimates using the delta method and coverage probability through simulation of a variety of situations. Applying these techniques to the biomarker thiobarbituric acid reaction substance (TBARS), a measure of oxidative stress that has been proposed as a discriminating measurement for infertility, yields a 50% increase in diagnostic effectiveness at the optimal cut-point. This result may lead to biomarkers that were once naively considered ineffective becoming useful diagnostic devices.? Since multiple markers are often available, we considered combining them to improve diagnostic accuracy. The linear combinations derived by Su and Liu (1993) that maximize the AUC may have unsatisfactorily low sensitivity over a certain range of desired specificity. We considered maximization of sensitivity over a range of specificity, and presented alternative linear combinations that have higher sensitivity over a range of high (or low) specificity. Additionally, we evaluated covariate effects on this linear combination assuming that the multiple markers or a transformation thereof, follow a multivariate normal distribution. We estimated the ROC curve of this linear combination of markers adjusted for covariates and approximate confidence intervals for the corresponding AUC. ? Another frequently encountered problem in studies that evaluate new diagnostic tests is that not all patients undergo disease verification due to the expense and/or invasiveness of the test. In fact, the decision to subject patients to verification testing often depends on the results of the new test and other predictors of disease status. For diagnostic tests where AUC estimation is based only on patients with verified disease status, the usual estimators are biased. We developed estimators that adjust for this bias.? When information on disease status is missing, it is necessary either to model the missing data or the process leading to the missingness to obtain well-behaved estimators of the AUC. We have described a doubly robust estimator that is unbiased when the model for disease or the missingness is correct. This estimator does not require EM-type iterations and is easy to compute using standard software. It can accommodate both discrete and continuous markers and allows for the possibility that selection to verification is non-ignorable. In addition, the doubly robust estimator offers more protection against model misspecification than other currently available methods. ? We have applied the methods described above to show that TBARS, has discriminating abilities above and beyond chance. This work has yielded 23 publications in peer reviewed journals including Biometrika and the Journal of the American Statistical Association.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: Eunice Kennedy Shriver National Institute of Child Health & Human Development (NICHD)
Type: Intramural Research (Z01)
Project #: 1Z01HD008761-05
Application #: 7594226
Study Section

Project Start
Project End
Budget Start
Budget End
Support Year: 5
Fiscal Year: 2007
Total Cost: $140,939
Indirect Cost

Institution

Name: Eunice Kennedy Shriver National Institute of Child Health & Human Development
Department
Type
DUNS #

City
State
Country: United States
Zip Code

Related projects


NIH 2008 Z01 HD	ROC Curve Methodology Schisterman, Enrique / Eunice Kennedy Shriver National Institute of Child Health & Human Development	$38,772
NIH 2007 Z01 HD	ROC Curve Methodology Schisterman, Enrique / Eunice Kennedy Shriver National Institute of Child Health & Human Development	$140,939
NIH 2006 Z01 HD	ROC Curve Methodology Liu, Aiyi / U.S. National Inst/Child Hlth/Human Dev
NIH 2005 Z01 HD	ROC Curve Methodology Liu, Aiyi / U.S. National Inst/Child Hlth/Human Dev
NIH 2004 Z01 HD	ROC Curve Methodology Liu, Aiyi / U.S. National Inst/Child Hlth/Human Dev
NIH 2003 Z01 HD	ROC Curve Methodology Liu, Aiyi / U.S. National Inst/Child Hlth/Human Dev

Publications

Albert, Paul S; Harel, Ofer; Perkins, Neil et al. (2010) Use of multiple assays subject to detection limits with regression modeling in assessing the relationship between exposure and outcome. Epidemiology 21 Suppl 4:S35-43

Perkins, Neil J; Schisterman, Enrique F; Vexler, Albert (2009) Generalized ROC curve inference for a biomarker subject to a limit of detection and measurement error. Stat Med 28:1841-60

Schisterman, Enrique F; Whitcomb, Brian W (2009) Reply to Commentaries: Biology and methodology - the quest for parsimonious models of a complex reality. Paediatr Perinat Epidemiol 23:421-423

Vexler, Albert; Liu, Aiyi; Eliseeva, Ekaterina et al. (2008) Maximum likelihood ratio tests for comparing the discriminatory ability of biomarkers subject to limit of detection. Biometrics 64:895-903

Schisterman, Enrique F; Faraggi, David; Reiser, Benjamin et al. (2008) Youden Index and the optimal threshold for markers with mass at zero. Stat Med 27:297-315

Vexler, Albert; Schisterman, Enrique F; Liu, Aiyi (2008) Estimation of ROC curves based on stably distributed biomarkers subject to measurement error and pooling mixtures. Stat Med 27:280-96

Howards, Penelope P; Schisterman, Enrique F; Heagerty, Patrick J (2007) Potential confounding by exposure history and prior outcomes: an example from perinatal epidemiology. Epidemiology 18:544-51

Bloom, Michael S; Schisterman, Enrique F; Hediger, Mary L (2007) The use and misuse of matching in case-control studies: the example of polycystic ovary syndrome. Fertil Steril 88:707-10

Perkins, Neil J; Schisterman, Enrique F; Vexler, Albert (2007) Receiver operating characteristic curve inference from a sample with a limit of detection. Am J Epidemiol 165:325-33

Schisterman, Enrique F; Vexler, Albert; Whitcomb, Brian W et al. (2006) The limitations due to exposure detection limits for regression models. Am J Epidemiol 163:374-83

Showing the most recent 10 out of 23 publications

Comments

Be the first to comment on Enrique Schisterman's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: