Common but often overlooked threats to the validity of comparative effectiveness research (CER) studies include the misclassification or missingness of binary variables that are crucial to the ultimate analysis of the data. These variables potentially include the outcome of interest in standard or repeated measures logistic regression models, the factor (exposure) of interest, or an important confounder of the association under study. This proposal seeks to facilitate the investigation of the resulting biases to which a given CER analysis may be subject, and to provide study design-based remedial measures via which validity can be restored. The focus is upon statistical methods for conducting sensitivity analyses, as well as methods designed to make efficient use of supplemental data sources. The latter include validation data (in the case of misclassification), and so-called reassessment data (in the case of potentially informative missingness). A primary consideration throughout includes the incorporation of subject-specific covariates into the model of interest, as well as into models for the underlying misclassification or missingness process. Another primary goal is to establish a relatively consistent likelihood-based framework for all proposed analyses incorporating supplemental data, and to provide user-friendly programs utilizing common statistical software in order to make the methods broadly and readily accessible to those conducting CER. While not limited to specific applications, the proposed research draws motivation from and lends itself to illustration via two real-world studies. The first is the HIV Epidemiology Research Study (HERS), an observational cohort study in which the binary diagnosis of bacterial vaginosis was made at repeated visits via both error-prone and sophisticated assay techniques. The second is an emergency department-based ophthalmologic study in which non-dilated ocular fundus photography will be used for diagnosing serious ocular conditions, and will be compared against existing standard diagnostic methods. Both studies involve internal validation data to facilitate corrections for misclassification based on a fallible diagnostic method, and both are also subject to missing outcome and/or predictor data.

Public Health Relevance

The goal of this project is to provide statistical methods to aid comparative effectiveness research (CER) investigators with common problems encountered in data analysis. The problems upon which the project focuses come about when binary (""""""""yes/no"""""""") data are subject to being incorrectly measured (misclassified), or when they are sometimes not observed (missing) for reasons that might relate to information about subjects in the study. The intention is to provide CER investigators with methods that are relatively easy to use, yet effective and powerful for combating these challenges to valid data analysis.

Agency
National Institute of Health (NIH)
Institute
National Institute of Nursing Research (NINR)
Type
High Impact Research and Research Infrastructure Programs—Multi-Yr Funding (RC4)
Project #
1RC4NR012527-01
Application #
8037394
Study Section
Special Emphasis Panel (ZRG1-HDM-C (56))
Program Officer
Huss, Karen
Project Start
2010-09-24
Project End
2013-08-31
Budget Start
2010-09-24
Budget End
2013-08-31
Support Year
1
Fiscal Year
2010
Total Cost
$441,691
Indirect Cost
Name
Emory University
Department
Biostatistics & Other Math Sci
Type
Schools of Public Health
DUNS #
066469933
City
Atlanta
State
GA
Country
United States
Zip Code
30322
Lyles, Robert H; Mitchell, Emily M; Weinberg, Clarice R et al. (2016) An efficient design strategy for logistic regression using outcome- and covariate-dependent pooling of biospecimens prior to assay. Biometrics 72:965-75
Mitchell, Emily M; Lyles, Robert H; Manatunga, Amita K et al. (2015) Semiparametric regression models for a right-skewed outcome subject to pooling. Am J Epidemiol 181:541-8
Mitchell, Emily M; Lyles, Robert H; Schisterman, Enrique F (2015) Positing, fitting, and selecting regression models for pooled biomarker data. Stat Med 34:2544-58
Tang, Li; Lyles, Robert H; King, Caroline C et al. (2015) Regression Analysis for Differentially Misclassified Correlated Binary Outcomes. J R Stat Soc Ser C Appl Stat 64:433-449
Lin, Ji; Lyles, Robert H (2015) Accounting for informatively missing data in logistic regression by means of reassessment sampling. Stat Med 34:1925-39
Tang, Li; Lyles, Robert H; King, Caroline C et al. (2015) Binary regression with differentially misclassified response and exposure variables. Stat Med 34:1605-20
Mitchell, Emily M; Lyles, Robert H; Manatunga, Amita K et al. (2014) Regression for skewed biomarker outcomes subject to pooling. Biometrics 70:202-11
Mitchell, Emily M; Lyles, Robert H; Manatunga, Amita K et al. (2014) A highly efficient design strategy for regression with outcome pooling. Stat Med 33:5028-40
Lyles, Robert H; Kupper, Lawrence L (2013) Approximate and Pseudo-Likelihood Analysis for Logistic Regression Using External Validation Data to Model Log Exposure. J Agric Biol Environ Stat 18:22-38
Lyles, Robert H; Guo, Ying; Greenland, Sander (2012) Reducing Bias and Mean Squared Error Associated With Regression-Based Odds Ratio Estimators. J Stat Plan Inference 142:3235-3241

Showing the most recent 10 out of 13 publications