Automatic detection of conserved base pairing patterns in RNA virus genomes

Comput Chem. 1999 Jun 15;23(3-4):401-14. doi: 10.1016/s0097-8485(99)00013-3.

Abstract

Almost all RNA molecules--and consequently also almost all subsequences of a large RNA molecule-form secondary structures. The presence of secondary structure in itself therefore does not indicate any functional significance. In fact, we cannot expect a conserved secondary structure for all parts of a viral genome or a mRNA, even if there is a significant level of sequence conservation. We present a novel method for detecting conserved RNA secondary structures in a family of related RNA sequences. The method is based on combining the prediction of base pair probability matrices and comparative sequence analysis. It can be applied to small sets of long sequences and does not require a prior knowledge of conserved sequence or structure motifs. As such it can be used to scan large amounts of sequence data for regions that warrant further experimental investigation. Applications to complete genomic RNAs of some viruses show that in all cases the known secondary structure features are identified. In addition, we predict a substantial number of conserved structural elements which have not been described so far.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Base Pairing*
  • Base Sequence
  • Genome, Viral*
  • Molecular Sequence Data
  • Nucleic Acid Conformation
  • RNA Viruses / genetics*
  • RNA, Viral / chemistry
  • RNA, Viral / genetics

Substances

  • RNA, Viral

Associated data

  • GENBANK/D00917
  • GENBANK/D10112
  • GENBANK/K02013
  • GENBANK/K03454
  • GENBANK/K03456
  • GENBANK/L02317
  • GENBANK/L06436
  • GENBANK/L20571
  • GENBANK/L20587
  • GENBANK/M12508
  • GENBANK/M17451
  • GENBANK/M22639
  • GENBANK/M26727
  • GENBANK/M27323
  • GENBANK/M31171
  • GENBANK/M62320
  • GENBANK/U23487
  • GENBANK/U27490
  • GENBANK/U27491
  • GENBANK/U27492
  • GENBANK/U27493
  • GENBANK/U27494
  • GENBANK/U27495
  • GENBANK/U27496
  • GENBANK/U39292
  • GENBANK/X04414
  • GENBANK/X16109
  • GENBANK/X61034
  • GENBANK/X61240
  • GENBANK/Z84205