Metagenomic compendium of 189,680 DNA viruses from the human gut microbiome

Nat Microbiol. 2021 Jul;6(7):960-970. doi: 10.1038/s41564-021-00928-6. Epub 2021 Jun 24.

Abstract

Bacteriophages have important roles in the ecology of the human gut microbiome but are under-represented in reference databases. To address this problem, we assembled the Metagenomic Gut Virus catalogue that comprises 189,680 viral genomes from 11,810 publicly available human stool metagenomes. Over 75% of genomes represent double-stranded DNA phages that infect members of the Bacteroidia and Clostridia classes. Based on sequence clustering we identified 54,118 candidate viral species, 92% of which were not found in existing databases. The Metagenomic Gut Virus catalogue improves detection of viruses in stool metagenomes and accounts for nearly 40% of CRISPR spacers found in human gut Bacteria and Archaea. We also produced a catalogue of 459,375 viral protein clusters to explore the functional potential of the gut virome. This revealed tens of thousands of diversity-generating retroelements, which use error-prone reverse transcription to mutate target genes and may be involved in the molecular arms race between phages and their bacterial hosts.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Archaea / virology
  • Bacteria / virology
  • Bacteriophages / genetics
  • Catalogs as Topic
  • DNA Viruses / classification
  • DNA Viruses / genetics*
  • DNA, Viral / genetics
  • Feces / microbiology
  • Gastrointestinal Microbiome / genetics*
  • Genetic Variation
  • Genome, Viral / genetics*
  • Humans
  • Metagenomics
  • Phylogeny
  • Viral Proteins / genetics

Substances

  • DNA, Viral
  • Viral Proteins