In response to the strong demand for developing appropriate analytic techniques for use with new kinds of data and new approaches to behavioral and social science research, we propose to develop principled and parsimonious statistical measures that are applicable in studies using intensive measurement methods, such as Ecological Momentary Assessment (EMA) methods, to quantify the reliability and validity of empirical findings to nonignorable missingness. Like any study involving human subjects, missing data are common in EMA studies. For example, when studying the question Are moods just prior smoking different than moods during random background times, there can be a moderate amount of missing data because of study participants' nonresponses to those random prompts. It is often suspected that the missing data caused by such prompt nonresponses are nonrandom in that the prompt nonresponse behaviors are related to contemporaneous mood outcomes and consequently the observed data may be a selected nonrandom subset of a person's background mood even though the planned prompts are random. Such nonignorable missingness needs to be properly accounted for in the analysis of EMA data. However, unlike in more traditional studies, nonignorable missingness in intensive EMA data poses significant new analytic challenges and calls for more general, flexible and robust methods that are applicable in EMA studies to quantify and improve the reliability, validity and usability of the collected data. Thus, the aims of the proposed study are to (1) develop general, robust and tractable statistical measures and accessible software for assessing the impact of missing data on analysis of EMA data, and (2) examine the role of smoking on mood regulation in adolescents while accounting for the impact of nonrandom missingness, using data from our program project grant, Social and Emotional Contexts of Adolescent Smoking Patterns (NCI grant #PO1 2CA98262), which established a cohort of adolescents at high risk for the development of smoking and nicotine dependence. This study has the potential to make methodological and substantive contributions to EMA data analysis and understanding the relationship between mood variation and smoking dependence. The principled and simple statistical measures and accessible software to be developed will allow researchers to conveniently quantify the robustness of empirical findings from studies using EMA or other types of measurement-intensive methods to nonignorable missingness for a wide range of data types and models, missing data patterns and mechanisms. These methods can also easily generalize to a variety of cancer-relevant research areas, including studies using other types of new intensive measurements, such as mHealth (mobile heath) studies.
As the use of measurement-intensive methods, such as Ecological Momentary Assessment (EMA) methods, has become a new and vital approach to understanding health-related (e.g., smoking- and cancer- related) behaviors, there is a strong demand for developing appropriate analytic techniques for these new kinds of data and approaches to public health research. Like any study involving human subjects, missing data caused by nonresponse are ubiquitous in EMA studies and are often suspected to be nonignorable (e.g., when study participants' response behaviors to random prompts are related to contemporaneous but unobserved mood outcomes); unlike in more traditional studies, nonignorable missingness in such intensive data can pose significant new analytic challenges and calls for more general, flexible and tractable methods that are applicable in EMA studies to quantify and improve the reliability, validity and usability of the collected data. This proposal i aimed at addressing this need by developing principled and simple statistical measures and accessible software that allow researchers to conveniently quantify the robustness of empirical findings from studies using EMA or other types of measurement-intensive methods to nonignorable missingness for a wide range of data types and models, for complex missing data patterns and mechanisms and for studies that may involve a large number of analyses, and consequently facilitate more reliable understanding of the cancer- related health behaviors.
Xie, Hui; Gao, Weihua; Xing, Baodong et al. (2018) Measuring the Impact of Nonignorable Missingness Using the R Package isni. Comput Methods Programs Biomed 164:207-220 |
Gao, Weihua; Hedeker, Donald; Mermelstein, Robin et al. (2016) A scalable approach to measuring the impact of nonignorable nonresponse with an EMA application. Stat Med 35:5579-5602 |