Identifying the geographic leading edge of Lyme disease in the United States with internet searches: A spatiotemporal analysis of Google Health Trends data

PLoS One. 2024 Nov 13;19(11):e0312277. doi: 10.1371/journal.pone.0312277. eCollection 2024.

Abstract

Background: The geographic footprint of Lyme disease is expanding in the United States, which calls for novel methods to identify emerging endemic areas. The ubiquity of internet use coupled with the dominance of Google's search engine makes Google user search data a compelling data source for epidemiological research.

Objective: We evaluated the potential of Google Health Trends to track spatiotemporal patterns in Lyme disease and identify the leading edge of disease risk in the United States.

Materials and methods: We analyzed internet search rates for Lyme disease-related queries at the designated market area (DMA) level (n = 206) for the 2011-2019 and 2020-2021 (COVID-19 pandemic) periods. We used maps and other exploratory methods to characterize changes in search behavior. To assess statistical correlation between searches and Lyme disease cases reported to Centers for Disease Control and Prevention (CDC) between 2011 and 2019, we performed a longitudinal ecological analysis with modified Poisson generalized estimating equation regression models.

Results: Mapping DMA-level changes in "Lyme disease" search rates revealed an expanding area of higher rates occurring along the edges of the northeastern focus of Lyme disease. Bivariate maps comparing search rates and CDC-reported incidence rates also showed a stronger than expected signal from Google Health Trends in some high-risk adjacent states such as Michigan, North Carolina, and Ohio, which may be further indication of a geographic leading edge of Lyme disease that is not fully apparent from routine surveillance. Searches for "Lyme disease" were a significant predictor of CDC-reported disease incidence. Each 100-unit increase in the search rate was significantly associated with a 10% increase in incidence rates (RR = 1.10, 95% CI: 1.07, 1.12) after adjusting for environmental covariates of Lyme disease identified in the literature.

Conclusion: Google Health Trends data may help track the expansion of Lyme disease and inform the public and health care providers about emerging risks in their areas.

MeSH terms

  • Humans
  • Internet*
  • Lyme Disease* / epidemiology
  • Search Engine* / statistics & numerical data
  • Search Engine* / trends
  • Spatio-Temporal Analysis*
  • United States / epidemiology

Grants and funding

This work was funded by a gift from the David P. Nolan Charitable Fund (to FCC, no grant number). The funder was not involved in data collection and analysis, decision to publish, or preparation of the manuscript.