Measurement of outcomes related to the quality of physical and mental health states in population-based cancer studies has increased in recent years as more and more researchers realize the importance of such endpoints. These endpoints are measured alongside conventional clinical outcomes and for the most part rely on patient self- report. A key problem has been missing data as patients may die or may be too sick to complete the study. This loss of information represents, besides the loss of efficiency, a potentially large threat to validity of the study results. There is strong evidence that such data are not missing at random, and cannot be ignored without introducing bias. Two extreme views on how to deal with incomplete data are (1) to delete cases with incomplete information altogether and (2) to construct complicated joint models for the measurement and missingness processes. These extreme views are surrounded with problems, largely due to the untestable nature of the assumptions one has to make regarding the missingness mechanism. A more versatile methodology that embeds the treatment of incomplete data within a sensitivity analysis is then required. Developing such a methodology necessitates extensive knowledge of biology and epidemiology of cancer. The K01 mechanism will help integrate mentoring and formal basic training in cancer research with prior training in Biostatistics and Population Health by building on a solid foundation in the development of new statistical methodologies for handling missing data. The research plan integrates training and mentoring to study, for example, how baseline and time dependent characteristics impact cancer patients'functional and mental states across time with missing data adjustment. Our approach is to develop a family of non ignorable models with sensitivity parameters that can be interpretable by subject matter experts. A global sensitivity analysis for the proposed non-ignorable models will be developed and implemented in the context of estimation and hypothesis testing using the classical frequentist approach and the Bayesian posterior predictive check principle. And finally, central theoretical questions about the proposed methods will be investigated using both analytic techniques and Monte Carlo simulations. A key goal of this K01 grant mechanism is to improve our ability to help, through collaborations, design complex clinical trials and observational studies in cancer research, analyze the generated data while adjusting for dropouts and missing data, and interpret the findings to public health professionals and the public.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Research Scientist Development Award - Research & Training (K01)
Project #
Application #
Study Section
Subcommittee G - Education (NCI)
Program Officer
Ojeifo, John O
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Michigan State University
Public Health & Prev Medicine
Schools of Medicine
East Lansing
United States
Zip Code
Todem, David; Hsu, Wei-Wen; Fine, Jason P (2018) A quasi-score statistic for homogeneity testing against covariate-varying heterogeneity. Scand Stat Theory Appl 45:465-481
Hsu, Wei-Wen; Todem, David; Kim, KyungMann (2016) A sup-score test for the cure fraction in mixture models for long-term survivors. Biometrics 72:1348-1357
Todem, David; Kim, KyungMann; Hsu, Wei-Wen (2016) Marginal mean models for zero-inflated count data. Biometrics 72:986-94
Cao, Guanqun; Hsu, Wei-Wen; Todem, David (2014) A score-type test for heterogeneity in zero-inflated models in a stratified population. Stat Med 33:2103-14
Cao, Guanqun; Todem, David; Yang, Lijian et al. (2013) Evaluating Statistical Hypotheses Using Weakly-Identifiable Estimating Functions. Scand Stat Theory Appl 40:256-273
Cao, Guanqun; Yang, Lijian; Todem, David (2012) Simultaneous Inference For The Mean Function Based on Dense Functional Data. J Nonparametr Stat 24:359-377
Todem, David; Hsu, Wei-Wen; Kim, KyungMann (2012) On the efficiency of score tests for homogeneity in two-component parametric models for discrete data. Biometrics 68:975-82
Cao, Guanqun; Wang, Jing; Wang, Li et al. (2012) Spline Confidence Bands for Functional Derivatives. J Stat Plan Inference 142:1557-1570
Todem, D; Fine, J; Peng, L (2010) A global sensitivity test for evaluating statistical hypotheses with nonidentifiable models. Biometrics 66:558-66
Todem, David; Kim, Kyungmann; Fine, Jason et al. (2010) Semiparametric regression models and sensitivity analysis of longitudinal data with nonrandom dropouts. Stat Neerl 64:133-156