Parametric analysis of alignment and phylogenetic uncertainty

Bull Math Biol. 2011 Apr;73(4):795-810. doi: 10.1007/s11538-010-9610-8. Epub 2011 Mar 16.

Abstract

To infer a phylogenetic tree from a set of DNA sequences, typically a multiple alignment is first used to obtain homologous bases. The inferred phylogeny can be very sensitive to how the alignment was created. We develop tools for analyzing the robustness of phylogeny to perturbations in alignment parameters in the NW algorithm. Our main tool is parametric alignment, with novel improvements that are of general interest in parametric inference. Using parametric alignment and a Gaussian distribution on alignment parameters, we derive probabilities of optimal alignment summaries and inferred phylogenies. We apply our method to analyze intronic sequences from Drosophila flies. We show that phylogeny estimates can be sensitive to the choice of alignment parameters, and that parametric alignment elucidates the relationship between alignment parameters and reconstructed trees.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Alcohol Dehydrogenase / genetics
  • Algorithms
  • Animals
  • Base Sequence / genetics
  • Drosophila / genetics
  • Drosophila Proteins / genetics
  • Introns / genetics
  • Normal Distribution
  • Phylogeny*
  • Probability
  • Sequence Alignment / methods
  • Sequence Alignment / statistics & numerical data*
  • Sequence Homology, Nucleic Acid
  • Software
  • Synaptotagmins / genetics

Substances

  • Drosophila Proteins
  • Synaptotagmins
  • ADH protein, Drosophila
  • Alcohol Dehydrogenase