Sequential mixed-mode surveys use a mix of modes or data collection methods such as mail, telephone, in-person, and web to increase the number of people who respond to a survey. In sequential designs, there is usually no control in assigning subgroups of respondents to modes. As a result, nonrandom assignment of modes is an inherent characteristic of sequential mixed-mode surveys. This design is important since there are usually limited funds to probe people to respond. While the goal of using mixed modes is clear, one compelling research question is how the nonrandom mix of mode impacts survey data and how these effects should be handled in estimating survey population characteristics such as mean income, and health insurance coverage. To date, since the nonrandom mix of modes poses a challenge in evaluating the mode effects, the existing inference methods assume that mode effects can be ignored in sequential mixed-mode surveys despite their unknown impact on the quality of the survey estimates. This research develops and evaluates the statistical inference methods accounting for nonrandom mode effects to test the comparability of the survey estimates from the different modes. In parallel, this project also develops statistical inference methods accounting for both nonresponse and nonrandom mode effects in the presence of nonignorable mode effects. The public-use Current Population Survey (CPS), 1973, and Social Security Records Exact Match, and the nonpublic-use American Community Survey (ACS) data will be used to conduct empirical and simulation evaluations.

This research provides federal agencies, survey organizations, research centers, and other data producers assessment and inferential methods that adjust for both nonresponse and nonrandom mode effects in the context of sequential mixed-mode surveys. Some large surveys have employed some variation of mixed-mode surveys in order to meet budget constraints. On the other hand, in the presence of nonignorable mode effects, the bias properties for the survey population characteristics are not known and the existing assessment and inferential methods do not control for the nonrandom mode effects. This research produces sequential mixed-mode assessment methods which will test the ignorability of the mode effects which can be a threat for the quality of survey data. In parallel, this research also produces methods of inference which will yield higher quality survey estimates in the presence of nonignorable mode effects.

Project Report

Work addressed two specific research questions related to multi-mode surveys: 1) Are the measurement error differences between modes ignorable? and 2) What are the properties of statistical inference methods that incorporate nonignorable measurement error differences under a mixed-mode survey design? Work was completed on three alternative mixed-mode survey mean estimators. The general steps in the estimation method were: 1) Impute counterfactual data for the alternative mode(s) under specified mode choice and response models, 2) Use both observed and actual data for the complete set of respondents to compute mode specific survey means, 3) Adjust for nonresponse, 4) Compare mode specific mean estimates, and 5) Combine mode specific means under three alternative combining rules. Step 1) created data that were counterfactual in the sense that imputations for cases that responded using one mode were made as if they had responded via another mode. For the purpose of this research, nonresponse adjustments were omitted, although future research should address it. Using 1973 Current Population Survey (CPS)-Internal Revenue Service (IRS) Match Data and 2012 CPS data, comparisons were made between the naïve and three alternative methods for Income and Health Insurance Coverage. Three alternative methods were: 1) Simple average estimator, in which mode specific means are combined by using a simple average estimator, 2) Combined estimator in which estimated means are weighted inversely according to their variances, and 3) Combined estimator in which estimated means are weighted inversely according to their mean square errors (MSEs). A fourth choice was the naïve mean estimator ignoring any mode effects, i.e., 4) Combine data together in one sample and estimate the mean ignoring any differences in modes. An empirical study and a simulation study on 1973 CPS Match Data were conducted to study the bias properties of naïve estimator and the three alternative estimators. A sensitivity analysis was conducted using the 2012 CPS data. The empirical study was conducted on a subset of 1973 CPS-IRS Match Data using wage and salary income as reported in the CPS. CPS-reported wage and salary income for 1972 were compared against the 1972 person-level IRS wage and salary income data. The average relative differences by mode were not significantly different between the CPS in-person and the telephone modes. On the other hand, the direct comparisons of individual in-person respondents’ earnings to telephone respondent earnings suggest a possible nonignorable mode choice mechanism, that is, the mode choice mechanism depends on the variable itself. On average, in-person respondents reported their earnings $1,369 less per year than telephone respondents. After controlling for personal characteristics, education, work experience, race (white vs. other), occupation type (professional, sales, craft, laborer), and industry (construction, manufacturing, transportation, trade, service) and residential (household) characteristics, central city, suburb, region, the difference remained significant. The empirical study considers this subset as the population and draws 50 simulation samples under 400/800 sample sizes of persons. In addition to sample size variation, the simulations also varied two imputation methods: 1) ignorable mode choice, and 2) nonignorable mode choice and 1) item missing included, and 2) item missing excluded conditions. The results were in the expected direction. That is, the standard procedure of ignoring potential mode differences led to biased estimators. The estimator that combined means inversely according to MSEs was generally least biased. For this estimator to be feasible in practice, one of the modes must be selected as producing the most nearly unbiased estimators. The differences between item missing excluded and included conditions showed that sensitivity analysis should also be incorporated in selecting an imputation model. In follow-up simulations, a hypothetical population was simulated using 1973 CPS Match data for household income. This simulation was used to test the code, and compare the three alternative methods for a population with known bias properties. The results confirmed that when the differential measurement error increases, the relative biases for the naïve estimator increase. Under reasonable assumptions, alternative methods yield smaller relative biases. The project also included a sensitivity analysis based on the 2012 CPS data for personal income and health insurance coverage. This set of empirical comparisons used bootstrap replicates to compute mean personal income and percent health insurance coverage by each method. Replicate mean personal income and percent health insurance coverage for each method were compared using repeated measurement ANOVA. Similar to the other empirical and simulation studies, two imputation methods were implemented—ignorable and nonignorable mode choice models. The results confirmed that further research is required in modeling the choice and the response mechanisms. If modeling assumptions are validated, statistical inference methods that incorporate adjustment for differential measurement errors will often be required in mixed-mode survey inference.

Agency
National Science Foundation (NSF)
Institute
Division of Social and Economic Sciences (SES)
Type
Standard Grant (Standard)
Application #
1238612
Program Officer
Cheryl Eavey
Project Start
Project End
Budget Start
2012-09-01
Budget End
2014-08-31
Support Year
Fiscal Year
2012
Total Cost
$15,153
Indirect Cost
Name
Regents of the University of Michigan - Ann Arbor
Department
Type
DUNS #
City
Ann Arbor
State
MI
Country
United States
Zip Code
48109