PairWise and SearchWise: finding the optimal alignment in a simultaneous comparison of a protein profile against all DNA translation frames

Nucleic Acids Res. 1996 Jul 15;24(14):2730-9. doi: 10.1093/nar/24.14.2730.

Abstract

DNA translation frames can be disrupted for several reasons, including: (i) errors in sequence determination; (ii) RNA processing, such as intron removal and guide RNA editing; (iii) less commonly, polymerase frameshifting during transcription or ribosomal frameshifting during translation. Frameshifts frequently confound computational activities involving homologous sequences, such as database searches and inferences on structure, function or phylogeny made from multiple alignments. A dynamic alignment algorithm is reported here which compares a protein profile (a residue scoring matrix for one or more aligned sequences) against the three translation frames of a DNA strand, allowing frameshifting. The algorithm has been incorporated into a new package, WiseTools, for comparison of biological sequences. A protein profile can be compared against either a DNA sequence or a protein sequence. The program PairWise may be used interactively for alignment of any two sequence inputs. SearchWise can perform combinations of searches through DNA or protein databases by a protein profile or DNA sequence. Routine application of the programs has revealed a set of database entries with frameshifts caused by errors in sequence determination.

Publication types

  • Comparative Study

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • DNA / chemistry
  • DNA / genetics*
  • Databases, Factual
  • Frameshift Mutation
  • Humans
  • Information Storage and Retrieval
  • Molecular Sequence Data
  • Protein Biosynthesis*
  • Proteins / chemistry
  • Proteins / genetics*
  • Repetitive Sequences, Nucleic Acid
  • Sequence Alignment*
  • Software

Substances

  • Proteins
  • DNA

Associated data

  • GENBANK/L04284
  • GENBANK/L08961
  • GENBANK/L24895
  • GENBANK/L33768
  • GENBANK/M18953
  • GENBANK/M33166
  • GENBANK/M33880
  • GENBANK/M58587
  • GENBANK/M61877
  • GENBANK/M96564
  • GENBANK/M96565
  • GENBANK/U00061
  • GENBANK/U00111
  • GENBANK/U17431
  • GENBANK/U22181
  • GENBANK/X12671
  • GENBANK/X16316
  • GENBANK/X51315
  • GENBANK/X51476
  • GENBANK/X53090
  • GENBANK/X54530
  • GENBANK/X73879
  • GENBANK/X75329
  • GENBANK/X78116
  • GENBANK/X78428
  • SWISSPROT/P13217
  • SWISSPROT/P13226
  • SWISSPROT/P13277
  • SWISSPROT/P17279
  • SWISSPROT/P18250