Quantifying the effect of media limitations on outbreak data in a global online web-crawling epidemic intelligence system, 2008-2011

Emerg Health Threats J. 2013 Nov 8:6:21621. doi: 10.3402/ehtj.v6i0.21621.

Abstract

Background: This is the first study quantitatively evaluating the effect that media-related limitations have on data from an automated epidemic intelligence system.

Methods: We modeled time series of HealthMap's two main data feeds, Google News and Moreover, to test for evidence of two potential limitations: first, human resources constraints, and second, high-profile outbreaks "crowding out" coverage of other infectious diseases.

Results: Google News events declined by 58.3%, 65.9%, and 14.7% on Saturday, Sunday and Monday, respectively, relative to other weekdays. Events were reduced by 27.4% during Christmas/New Years weeks and 33.6% lower during American Thanksgiving week than during an average week for Google News. Moreover data yielded similar results with the addition of Memorial Day (US) being associated with a 36.2% reduction in events. Other holiday effects were not statistically significant. We found evidence for a crowd out phenomenon for influenza/H1N1, where a 50% increase in influenza events corresponded with a 4% decline in other disease events for Google News only. Other prominent diseases in this database - avian influenza (H5N1), cholera, or foodborne illness - were not associated with a crowd out phenomenon.

Conclusions: These results provide quantitative evidence for the limited impact of editorial biases on HealthMap's web-crawling epidemic intelligence.

Keywords: HealthMap; crowd out effect; epidemic intelligence; infectious diseases; system evaluation.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Cholera / epidemiology
  • Communicable Diseases
  • Databases, Factual / standards*
  • Disease Outbreaks / statistics & numerical data*
  • Evaluation Studies as Topic
  • Foodborne Diseases / epidemiology
  • Humans
  • Influenza A Virus, H5N1 Subtype
  • Influenza, Human / epidemiology
  • Internet
  • Mass Media / trends*
  • Online Systems / organization & administration*
  • Personnel Staffing and Scheduling
  • Population Surveillance / methods