Ecological studies may be defined examining associations at the group level. They are appealing in that they make use of routinely available data, and also offer the potential of high power due to large populations and broad exposure contrasts. However, they are also susceptible to a range of biases with respect to individual-level associations, collectively termed ecological bias, and may lead to the ecological fallacy. In epidemiology, the fundamental difficulty is the inability of ecological data to characterize within-group variability in exposures and confounders. This results in an inability to control for confounding, and general non-identifiability of the individual-level model. The only solution to the ecological inference problem is to supplement ecological data with individual-level samples; in this proposal we describe and develop a variety of hybrid studies that pursue this solution. Specifically, we develop a hybrid design in which a case-control study is embedded within an ecological study. The intuitive appeal is that the individual-level data provide the basis for the control of bias, while the ecological data provide efficiency gains. In addition, we extend current methods, including the aggregate data design and two-phase method, to the ecological setting. This will be based on the development of Bayesian methods for these designs, which have not been explored. Further, we will compare performance of the various methods in a variety of data/sampling scenarios. A key research question is whether the group-level data provide useful information for the collection of individuals. We will explore optimal study design in terms of how many individuals to sample and from which groups. The methods are illustrated with two cancer data sets and one influenza data set. ? ? ?

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
5R01CA125081-02
Application #
7434489
Study Section
Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer
Dunn, Michelle C
Project Start
2007-06-01
Project End
2010-05-31
Budget Start
2008-06-01
Budget End
2009-05-31
Support Year
2
Fiscal Year
2008
Total Cost
$183,647
Indirect Cost
Name
Group Health Cooperative
Department
Type
DUNS #
078198520
City
Seattle
State
WA
Country
United States
Zip Code
98101
Smoot, E; Haneuse, S (2015) On the analysis of hybrid designs that combine group- and individual-level data. Biometrics 71:227-236
Ross, Michelle; Wakefield, Jon (2013) Bayesian inference for two-phase studies with categorical covariates. Biometrics 69:469-77
Haneuse, Sebastien; Schildcrout, Jonathan; Gillen, Daniel (2012) A two-stage strategy to accommodate general patterns of confounding in the design of observational studies. Biostatistics 13:274-88
Haneuse, S; Chen, J (2011) A multiphase design strategy for dealing with participation bias. Biometrics 67:309-18
Wakefield, Jon; Haneuse, Sebastien; Dobra, Adrian et al. (2011) Bayes computation for ecological inference. Stat Med 30:1381-96
Haneuse, Sebastien; Bartell, Scott (2011) Designs for the combination of group- and individual-level data. Epidemiology 22:382-9
Koehler, Elizabeth; Brown, Elizabeth; Haneuse, Sebastien J-P A (2009) On the Assessment of Monte Carlo Error in Simulation-Based Statistical Analyses. Am Stat 63:155-162
Wakefield, Jon; Haneuse, Sebastien J-P A (2008) Overcoming ecologic bias using the two-phase study design. Am J Epidemiol 167:908-16