Bayes computation for ecological inference

Stat Med. 2011 May 30;30(12):1381-96. doi: 10.1002/sim.4214. Epub 2011 Feb 22.

Abstract

Ecological data are available at the level of the group, rather than at the level of the individual. The use of ecological data in spatial epidemiological investigations is particularly common. Although the computational methods described are more generally applicable, this paper concentrates on the situation in which the margins of 2 × 2 tables are observed in each of n geographical areas, with a Bayesian approach to inference. We consider auxiliary schemes that impute the missing data, and compare with a previously suggested normal approximation. The analysis of ecological data is subject to ecological bias, with the only reliable means of removing such bias being the addition of auxiliary individual-level information. Various schemes have been suggested for this supplementation, and we illustrate how the computational methods may be applied to the analysis of such enhanced data. The methods are illustrated using simulated data and two examples. In the first example, the ecological data are supplemented with a simple random sample of individual-level data, and in this example the normal approximation fails. In the second example case-control sampling provides the additional information.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Bayes Theorem*
  • Computer Simulation
  • Data Interpretation, Statistical*
  • Diabetes Mellitus / epidemiology
  • Epidemiologic Methods*
  • Female
  • Humans
  • Markov Chains
  • Models, Statistical*
  • Monte Carlo Method
  • Numerical Analysis, Computer-Assisted*