Insights into the phylogeny and coding potential of microbial dark matter

Nature. 2013 Jul 25;499(7459):431-7. doi: 10.1038/nature12352. Epub 2013 Jul 14.

Abstract

Genome sequencing enhances our understanding of the biological world by providing blueprints for the evolutionary and functional diversity that shapes the biosphere. However, microbial genomes that are currently available are of limited phylogenetic breadth, owing to our historical inability to cultivate most microorganisms in the laboratory. We apply single-cell genomics to target and sequence 201 uncultivated archaeal and bacterial cells from nine diverse habitats belonging to 29 major mostly uncharted branches of the tree of life, so-called 'microbial dark matter'. With this additional genomic information, we are able to resolve many intra- and inter-phylum-level relationships and to propose two new superphyla. We uncover unexpected metabolic features that extend our understanding of biology and challenge established boundaries between the three domains of life. These include a novel amino acid use for the opal stop codon, an archaeal-type purine synthesis in Bacteria and complete sigma factors in Archaea similar to those in Bacteria. The single-cell genomes also served to phylogenetically anchor up to 20% of metagenomic reads in some habitats, facilitating organism-level interpretation of ecosystem function. This study greatly expands the genomic representation of the tree of life and provides a systematic step towards a better understanding of biological evolution on our planet.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Archaea / classification*
  • Archaea / genetics*
  • Archaea / isolation & purification
  • Archaea / metabolism
  • Bacteria / classification*
  • Bacteria / genetics*
  • Bacteria / isolation & purification
  • Bacteria / metabolism
  • Ecosystem
  • Genome, Archaeal / genetics
  • Genome, Bacterial / genetics
  • Metagenome / genetics
  • Metagenomics*
  • Molecular Sequence Data
  • Phylogeny*
  • Sequence Analysis, DNA
  • Single-Cell Analysis