The goal of this application is to develop statistical methods for the analysis of failure time data in the presence of missing diagnoses or classification, missing observation time and progression measurement, and missing segments of the target population. These types of missingness are not individual-specific, but rather are characteristic of the entire study population. That is, all subjects are missing a histologically-based diagnosis, or all subjects are missing continuous-valued, continuous-time measurements of progression, or all subjects of a certain type are missing from the study population. These three dimensions of population- wide missingness encompass a broad range of real problems that I have encountered in studies of brain tumors, schwannomas, Multiple Sclerosis (MS), and coronary heart disease (CHD). The brain tumor and schwannoma studies measure several histologic features, with the goal of refining diagnoses to be more prognostic for clinical outcomes. The MS study features a failure endpoint that is defined by a discrete time ordinal longitudinal process. This is common in many cancer studies, which have scored endpoints such as radiologic progression and performance status. The CHD study typifies an emerging common design in which prospective cohorts are sampled mid-study for genetic analysis. This is commonly done to investigate genetic associations with various cancers, as well. An array of statistical methods, including Bayesian and frequentist latent class models, transitional models for ordinal processes, and pseudo likelihood estimation for a biased sample, are used to address these problems. Relevance:
This research aims to provide improved statistical methods for the design and analysis of clinical and laboratory studies of cancer. The methods may lead to faster discovery of cancer genes and effective treatments and to better understanding of disease progression through more efficient use of resources.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-HOP-Q (03))
Program Officer
Mariotto, Angela B
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Harvard University
Biostatistics & Other Math Sci
Schools of Public Health
United States
Zip Code
Chiou, Sy Han; Austin, Matthew D; Qian, Jing et al. (2018) Transformation model estimation of survival under dependent truncation and independent censoring. Stat Methods Med Res :962280218817573
Swanson, D M; Anderson, C D; Betensky, R A (2018) Hypothesis Tests for Neyman's Bias in Case-Control Studies. J Appl Stat 45:1956-1977
Emerson, Sarah C; Waikar, Sushrut S; Fuentes, Claudio et al. (2018) Biomarker validation with an imperfect reference: Issues and bounds. Stat Methods Med Res 27:2933-2945
Matsouaka, Roland A; Singhal, Aneesh B; Betensky, Rebecca A (2018) An optimal Wilcoxon-Mann-Whitney test of mortality and a continuous outcome. Stat Methods Med Res 27:2384-2400
Atem, Folefac D; Qian, Jing; Maye, Jacqueline E et al. (2017) Linear Regression with a Randomly Censored Covariate: Application to an Alzheimer's Study. J R Stat Soc Ser C Appl Stat 66:313-328
Rentz, Dorene M; Mormino, Elizabeth C; Papp, Kathryn V et al. (2017) Cognitive resilience in clinical and preclinical Alzheimer's disease: the Association of Amyloid and Tau Burden on cognitive performance. Brain Imaging Behav 11:383-390
Atem, Folefac D; Qian, Jing; Maye, Jacqueline E et al. (2016) Multiple Imputation of a Randomly Censored Covariate Improves Logistic Regression Analysis. J Appl Stat 43:2886-2896
Swanson, David M; Betensky, Rebecca A (2015) Research participant compensation: A matter of statistical inference as well as ethics. Contemp Clin Trials 45:265-269
Betensky, Rebecca A (2015) Measures of follow-up in time-to-event studies: Why provide them and what should they be? Clin Trials 12:403-8
Matsouaka, Roland A; Betensky, Rebecca A (2015) Power and sample size calculations for the Wilcoxon-Mann-Whitney test in the presence of death-censored observations. Stat Med 34:406-31

Showing the most recent 10 out of 39 publications