The worst case complexity of maximum parsimony

J Comput Biol. 2014 Nov;21(11):799-808. doi: 10.1089/cmb.2014.0128. Epub 2014 Oct 10.

Abstract

One of the core classical problems in computational biology is that of constructing the most parsimonious phylogenetic tree interpreting an input set of sequences from the genomes of evolutionarily related organisms. We reexamine the classical maximum parsimony (MP) optimization problem for the general (asymmetric) scoring matrix case, where rooted phylogenies are implied, and analyze the worst case bounds of three approaches to MP: The approach of Cavalli-Sforza and Edwards, the approach of Hendy and Penny, and a new agglomerative, "bottom-up" approach we present in this article. We show that the second and third approaches are faster than the first one by a factor of Θ(√n) and Θ(n), respectively, where n is the number of species.

Keywords: asymmetric scoring matrix; dendograms; large parsimony; maximum parsimony; phylogenetic reconstruction; phylogeny.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Animals
  • Computational Biology / methods*
  • Evolution, Molecular*
  • Humans
  • Models, Statistical*
  • Phylogeny*