PhyloPlus: a Universal Tool for Phylogenetic Interrogation of Metagenomic Communities

mBio. 2023 Feb 28;14(1):e0345522. doi: 10.1128/mbio.03455-22. Epub 2023 Jan 16.

Abstract

Phylogeny is a powerful tool that can be incorporated into quantitative descriptions of community diversity, yet its use has been limited largely due to the difficulty in constructing phylogenies which incorporate the wide genomic diversity of microbial communities. Here, we describe the development of a web portal, PhyloPlus, which enables users to generate customized phylogenies that may be applied to any bacterial or archaeal communities. We demonstrate the power of phylogeny by comparing metrics that employ phylogeny with those that do not when applied to data sets from two metagenomic studies (fermented food, n = 58; human microbiome, n = 60). This example shows how inclusion of all bacterial species identified by taxonomic classifiers (Kraken2 and Kaiju) made the phylogeny perfectly congruent to the corresponding classification outputs. Our phylogeny-based approach also enabled the construction of more constrained null models which (i) shed light into community structure and (ii) minimize potential inflation of type I errors. Construction of such null models allowed for the observation of under-dispersion in 44 (75.86%) food samples, with the metacommunity defined as bacteria that were found in different food matrices. We also observed that closely related species with high abundance and uneven distribution across different sites could potentially exaggerate the dissimilarity between phylogenetically similar communities if they were measured using traditional species-based metrics (Padj. = 0.003), whereas this effect was mitigated by incorporating phylogeny (Padj. = 1). In summary, our tool can provide additional insights into microbial communities of interest and facilitate the use of phylogeny-based approaches in metagenomic analyses. IMPORTANCE There has been an explosion of interest in how microbial diversity affects human health, food safety, and environmental functions among many other processes. Accurately measuring the diversity and structure of those communities is central to understanding their effects. Here, we describe the development of a freely available online tool, PhyloPlus, which allows users to generate custom phylogenies that may be applied to any data set, thereby removing a major obstacle to the application of phylogeny to metagenomic data analysis. We demonstrate that the genetic relatedness of the organisms within those communities is a critical feature of their overall diversity, and that using a phylogeny which captures and quantifies this diversity allows for much more accurate descriptions while preventing misleading conclusions based on estimates that ignore evolutionary relationships.

Keywords: diversity; metagenomics; microbial genomics; microbiome; phylogeny.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Bacteria / genetics
  • Humans
  • Metagenome*
  • Metagenomics
  • Microbiota* / genetics
  • Phylogeny
  • RNA, Ribosomal, 16S / genetics

Substances

  • RNA, Ribosomal, 16S