An open-access database of infectious disease transmission trees to explore superspreader epidemiology

PLoS Biol. 2022 Jun 22;20(6):e3001685. doi: 10.1371/journal.pbio.3001685. eCollection 2022 Jun.

Abstract

Historically, emerging and reemerging infectious diseases have caused large, deadly, and expensive multinational outbreaks. Often outbreak investigations aim to identify who infected whom by reconstructing the outbreak transmission tree, which visualizes transmission between individuals as a network with nodes representing individuals and branches representing transmission from person to person. We compiled a database, called OutbreakTrees, of 382 published, standardized transmission trees consisting of 16 directly transmitted diseases ranging in size from 2 to 286 cases. For each tree and disease, we calculated several key statistics, such as tree size, average number of secondary infections, the dispersion parameter, and the proportion of cases considered superspreaders, and examined how these statistics varied over the course of each outbreak and under different assumptions about the completeness of outbreak investigations. We demonstrated the potential utility of the database through 2 short analyses addressing questions about superspreader epidemiology for a variety of diseases, including Coronavirus Disease 2019 (COVID-19). First, we found that our transmission trees were consistent with theory predicting that intermediate dispersion parameters give rise to the highest proportion of cases causing superspreading events. Additionally, we investigated patterns in how superspreaders are infected. Across trees with more than 1 superspreader, we found preliminary support for the theory that superspreaders generate other superspreaders. In sum, our findings put the role of superspreading in COVID-19 transmission in perspective with that of other diseases and suggest an approach to further research regarding the generation of superspreaders. These data have been made openly available to encourage reuse and further scientific inquiry.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • COVID-19* / epidemiology
  • Decision Trees*
  • Disease Outbreaks
  • Disease Transmission, Infectious
  • Humans

Associated data

  • Dryad/10.5061/dryad.nk98sf7w7

Grants and funding

JCT and JMD were supported by the Population Biology of Infectious Diseases REU Site, National Science Foundation grant DBI-1659683 (https://www.nsf.gov/awardsearch/showAward?AWD_ID=1659683). PBM was supported by National Science Foundation grant DGE-1545433 (https://www.nsf.gov/awardsearch/showAward?AWD_ID=1545433&HistoricalAwards=false). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.