Finding short DNA motifs using permuted Markov models

J Comput Biol. 2005 Jul-Aug;12(6):894-906. doi: 10.1089/cmb.2005.12.894.

Abstract

Many short DNA motifs, such as transcription factor binding sites (TFBS) and splice sites, exhibit strong local as well as nonlocal dependence. We introduce permuted variable length Markov models (PVLMM) which could capture the potentially important dependencies among positions and apply them to the problem of detecting splice and TFB sites. They have been satisfactory from the viewpoint of prediction performance and also give ready biological interpretations of the sequence dependence observed. The issue of model selection is also studied.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Binding Sites
  • Computational Biology
  • DNA / genetics*
  • DNA / metabolism
  • Markov Chains*
  • Models, Statistical*
  • Promoter Regions, Genetic / genetics*
  • Protein Binding
  • Transcription Factors / genetics*
  • Transcription Factors / metabolism

Substances

  • Transcription Factors
  • DNA