Toward more accurate ancestral protein genotype-phenotype reconstructions with the use of species tree-aware gene trees

Mol Biol Evol. 2015 Jan;32(1):13-22. doi: 10.1093/molbev/msu305. Epub 2014 Nov 4.

Abstract

The resurrection of ancestral proteins provides direct insight into how natural selection has shaped proteins found in nature. By tracing substitutions along a gene phylogeny, ancestral proteins can be reconstructed in silico and subsequently synthesized in vitro. This elegant strategy reveals the complex mechanisms responsible for the evolution of protein functions and structures. However, to date, all protein resurrection studies have used simplistic approaches for ancestral sequence reconstruction (ASR), including the assumption that a single sequence alignment alone is sufficient to accurately reconstruct the history of the gene family. The impact of such shortcuts on conclusions about ancestral functions has not been investigated. Here, we show with simulations that utilizing information on species history using a model that accounts for the duplication, horizontal transfer, and loss (DTL) of genes statistically increases ASR accuracy. This underscores the importance of the tree topology in the inference of putative ancestors. We validate our in silico predictions using in vitro resurrection of the LeuB enzyme for the ancestor of the Firmicutes, a major and ancient bacterial phylum. With this particular protein, our experimental results demonstrate that information on the species phylogeny results in a biochemically more realistic and kinetically more stable ancestral protein. Additional resurrection experiments with different proteins are necessary to statistically quantify the impact of using species tree-aware gene trees on ancestral protein phenotypes. Nonetheless, our results suggest the need for incorporating both sequence and DTL information in future studies of protein resurrections to accurately define the genotype-phenotype space in which proteins diversify.

Keywords: ancestral sequence reconstruction; gene tree reconciliation; lateral gene transfer; phylogeny; protein evolution; protein resurrection.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Bacterial Proteins / genetics
  • Computational Biology / methods*
  • Computer Simulation
  • Evolution, Molecular
  • Genotype
  • Gram-Positive Bacteria / enzymology
  • Gram-Positive Bacteria / genetics
  • Phenotype
  • Phylogeny
  • Proteins / genetics*

Substances

  • Bacterial Proteins
  • Proteins