Detecting distant homologs using phylogenetic tree-based HMMs

Proteins. 2003 Aug 15;52(3):446-53. doi: 10.1002/prot.10373.

Abstract

It is often desired to identify further homologs of a family of biological sequences from the ever-growing sequence databases. Profile hidden Markov models excel at capturing the common statistical features of a group of biological sequences. With these common features, we can search the biological database and find new homologous sequences. Most general profile hidden Markov model methods, however, treat the evolutionary relationships between the sequences in a homologous group in an ad-hoc manner. We hereby introduce a method to incorporate phylogenetic information directly into hidden Markov models, and demonstrate that the resulting model performs better than most of the current multiple sequence-based methods for finding distant homologs.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Databases, Protein
  • Markov Chains*
  • Phylogeny*
  • Protein Conformation
  • Proteins / chemistry
  • Proteins / genetics*
  • Sequence Alignment / methods
  • Sequence Homology, Amino Acid

Substances

  • Proteins