A pairwise alignment algorithm which favors clusters of blocks

J Comput Biol. 2005;12(1):33-47. doi: 10.1089/cmb.2005.12.33.

Abstract

Pairwise sequence alignments aim to decide whether two sequences are related and, if so, to exhibit their related domains. Recent works have pointed out that a significant number of true homologous sequences are missed when using classical comparison algorithms. This is the case when two homologous sequences share several little blocks of homology, too small to lead to a significant score. On the other hand, classical alignment algorithms, when detecting homologies, may fail to recognize all the significant biological signals. The aim of the paper is to give a solution to these two problems. We propose a new scoring method which tends to increase the score of an alignment when "blocks" are detected. This so-called Block-Scoring algorithm, which makes use of dynamic programming, is worth being used as a complementary tool to classical exact alignments methods. We validate our approach by applying it on a large set of biological data. Finally, we give a limit theorem for the score statistics of the algorithm.

MeSH terms

  • Algorithms*
  • Base Sequence
  • Computational Biology / methods*
  • Computer Simulation
  • Molecular Sequence Data
  • RNA / chemistry*
  • Sequence Alignment*
  • Sequence Homology, Nucleic Acid*

Substances

  • RNA