Rooting the tree of life using nonubiquitous genes

Mol Biol Evol. 2007 Jan;24(1):130-6. doi: 10.1093/molbev/msl140. Epub 2006 Oct 5.

Abstract

Insertion and deletion (indel)-based analyses have great potential for rooting the tree of life, but their use has been limited because they require ubiquitous sequences that have not been horizontally/laterally transferred. Very few such sequences exist. Here we describe and demonstrate a new algorithm that can use nonubiquitous sequences for rooting. This algorithm, top-down indel rooting, uses the traditional logical framework of indel rooting, but by considering gene gains and losses in addition to indel gains and losses, it is able to analyze incomplete data sets. The method is demonstrated using theoretical examples and incomplete gene sets. In particular, it is applied to the well-studied Hsp70/MreB indel, a sequence set thought to have been compromised by gene transfers from Firmicutes to archaebacteria. By sequentially assigning all observable character states, including gene absences, to the questionable archaebacterial Hsp70 and MreB sequences, we demonstrate that this gene set robustly excludes the root of the tree of life from the Gram-negative, double-membrane prokaryotes independently of the archaeal character states. There are very few ubiquitous paralog gene sets, and most of them contain compromised data. The ability of top-down rooting to use incomplete and/or compromised gene sets promises to make rooting analyses more robust and to greatly increase the number of useful indel sets.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Archaea / classification
  • Archaea / genetics*
  • Bacteria / classification
  • Bacteria / genetics*
  • Computational Biology / methods*
  • Gene Deletion
  • HSP70 Heat-Shock Proteins / genetics
  • Mutagenesis, Insertional
  • Phylogeny*

Substances

  • HSP70 Heat-Shock Proteins