An Integrated Perspective on Phylogenetic Workflows

Trends Ecol Evol. 2016 Feb;31(2):116-126. doi: 10.1016/j.tree.2015.12.007. Epub 2016 Jan 5.

Abstract

Molecular phylogenetics is the study of evolutionary relationships between biological sequences, often to infer the evolutionary relationships of organisms. These studies require many analysis components, including sequence assembly, identification of homologous sequences, gene tree inference, and species tree inference. At present, each component is usually treated as a single step in a linear analysis, where the output of each component is passed as input to the next as a point estimate. Here we outline a generative model that helps clarify assumptions that are implicit to phylogenetic workflows, focusing on the assumption of low relative entropy. This perspective unifies currently disparate advances, and will help investigators evaluate which steps would benefit the most from additional computation and future methods development.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Animals
  • Computational Biology
  • Evolution, Molecular*
  • Gene Flow*
  • Phylogeny*
  • Software
  • Species Specificity