Bayesian reconstruction of Mycobacterium tuberculosis transmission networks in a high incidence area over two decades in Malawi reveals associated risk factors and genomic variants

Microb Genom. 2020 Apr;6(4):e000361. doi: 10.1099/mgen.0.000361. Epub 2020 Apr 1.

Abstract

Understanding host and pathogen factors that influence tuberculosis (TB) transmission can inform strategies to eliminate the spread of Mycobacterium tuberculosis (Mtb). Determining transmission links between cases of TB is complicated by a long and variable latency period and undiagnosed cases, although methods are improving through the application of probabilistic modelling and whole-genome sequence analysis. Using a large dataset of 1857 whole-genome sequences and comprehensive metadata from Karonga District, Malawi, over 19 years, we reconstructed Mtb transmission networks using a two-step Bayesian approach that identified likely infector and recipient cases, whilst robustly allowing for incomplete case sampling. We investigated demographic and pathogen genomic variation associated with transmission and clustering in our networks. We found that whilst there was a significant decrease in the proportion of infectors over time, we found higher transmissibility and large transmission clusters for lineage 2 (Beijing) strains. By performing evolutionary convergence testing (phyC) and genome-wide association analysis (GWAS) on transmitting versus non-transmitting cases, we identified six loci, PPE54, accD2, PE_PGRS62, rplI, Rv3751 and Rv2077c, that were associated with transmission. This study provides a framework for reconstructing large-scale Mtb transmission networks. We have highlighted potential host and pathogen characteristics that were linked to increased transmission in a high-burden setting and identified genomic variants that, with validation, could inform further studies into transmissibility and TB eradication.

Keywords: Bayesian analysis; Mycobacterium tuberculosis; bioinformatics; molecular epidemiology; pathogen transmission; tuberculosis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Age Distribution
  • Bayes Theorem
  • Databases, Genetic
  • Female
  • Genome, Bacterial
  • Humans
  • Incidence
  • Malawi / epidemiology
  • Male
  • Middle Aged
  • Mycobacterium tuberculosis / classification*
  • Mycobacterium tuberculosis / genetics
  • Phylogeny
  • Polymorphism, Single Nucleotide*
  • Risk Factors
  • Tuberculosis / epidemiology
  • Tuberculosis / transmission*
  • Whole Genome Sequencing
  • Young Adult