Accommodating individual travel history and unsampled diversity in Bayesian phylogeographic inference of SARS-CoV-2

Nat Commun. 2020 Oct 9;11(1):5110. doi: 10.1038/s41467-020-18877-9.

Abstract

Spatiotemporal bias in genome sampling can severely confound discrete trait phylogeographic inference. This has impeded our ability to accurately track the spread of SARS-CoV-2, the virus responsible for the COVID-19 pandemic, despite the availability of unprecedented numbers of SARS-CoV-2 genomes. Here, we present an approach to integrate individual travel history data in Bayesian phylogeographic inference and apply it to the early spread of SARS-CoV-2. We demonstrate that including travel history data yields i) more realistic hypotheses of virus spread and ii) higher posterior predictive accuracy compared to including only sampling location. We further explore methods to ameliorate the impact of sampling bias by augmenting the phylogeographic analysis with lineages from undersampled locations. Our reconstructions reinforce specific transmission hypotheses suggested by the inclusion of travel history data, but also suggest alternative routes of virus migration that are plausible within the epidemiological context but are not apparent with current sampling efforts.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Betacoronavirus / classification
  • Betacoronavirus / genetics*
  • Betacoronavirus / isolation & purification
  • COVID-19
  • Coronavirus Infections / epidemiology*
  • Coronavirus Infections / transmission*
  • Coronavirus Infections / virology
  • Genome, Viral / genetics
  • Humans
  • Pandemics
  • Phylogeny
  • Phylogeography
  • Pneumonia, Viral / epidemiology*
  • Pneumonia, Viral / transmission*
  • Pneumonia, Viral / virology
  • SARS-CoV-2
  • Travel* / statistics & numerical data