Skip Navigation
Grant Details

Grant Number: 5R01CA125081-03 Interpret this number
Primary Investigator: Haneuse, Sebastien
Organization: Group Health Cooperative
Project Title: Design and Inference for Hybrid Ecological Studies
Fiscal Year: 2009
Back to top


DESCRIPTION (provided by applicant): Ecological studies may be defined examining associations at the group level. They are appealing in that they make use of routinely available data, and also offer the potential of high power due to large populations and broad exposure contrasts. However, they are also susceptible to a range of biases with respect to individual-level associations, collectively termed ecological bias, and may lead to the ecological fallacy. In epidemiology, the fundamental difficulty is the inability of ecological data to characterize within-group variability in exposures and confounders. This results in an inability to control for confounding, and general non-identifiability of the individual-level model. The only solution to the ecological inference problem is to supplement ecological data with individual-level samples; in this proposal we describe and develop a variety of hybrid studies that pursue this solution. Specifically, we develop a hybrid design in which a case-control study is embedded within an ecological study. The intuitive appeal is that the individual-level data provide the basis for the control of bias, while the ecological data provide efficiency gains. In addition, we extend current methods, including the aggregate data design and two-phase method, to the ecological setting. This will be based on the development of Bayesian methods for these designs, which have not been explored. Further, we will compare performance of the various methods in a variety of data/sampling scenarios. A key research question is whether the group-level data provide useful information for the collection of individuals. We will explore optimal study design in terms of how many individuals to sample and from which groups. The methods are illustrated with two cancer data sets and one influenza data set.

Back to top