A MATLAB-based tool for accurate detection of perfect overlapping and nested inverted repeats in DNA sequences

Bioinformatics. 2014 Mar 15;30(6):887-8. doi: 10.1093/bioinformatics/btt651. Epub 2013 Nov 8.

Abstract

Summary: Palindromic sequences, or inverted repeats (IRs), in DNA sequences involve important biological processes such as DNA-protein binding, DNA replication and DNA transposition. Development of bioinformatics tools that are capable of accurately detecting perfect IRs can enable genome-wide studies of IR patterns in both prokaryotes and eukaryotes. Different from conventional string-comparison approaches, we propose a novel algorithm that uses a cumulative score system based on a prime number representation of nucleotide bases. We then implemented this algorithm as a MATLAB-based program for perfect IR detection. In comparison with other existing tools, our program demonstrates a high accuracy in detecting nested and overlapping IRs.

Availability and implementation: The source code is freely available on (http://bioinfolab.miamioh.edu/bioinfolab/palindrome.php)

Contact: [email protected] or [email protected]

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Genome
  • Repetitive Sequences, Nucleic Acid*
  • Sequence Analysis, DNA / methods*
  • Software