RNA sequence analysis using covariance models

Nucleic Acids Res. 1994 Jun 11;22(11):2079-88. doi: 10.1093/nar/22.11.2079.

Abstract

We describe a general approach to several RNA sequence analysis problems using probabilistic models that flexibly describe the secondary structure and primary sequence consensus of an RNA sequence family. We call these models 'covariance models'. A covariance model of tRNA sequences is an extremely sensitive and discriminative tool for searching for additional tRNAs and tRNA-related sequences in sequence databases. A model can be built automatically from an existing sequence alignment. We also describe an algorithm for learning a model and hence a consensus secondary structure from initially unaligned example sequences and no prior structural information. Models trained on unaligned tRNA examples correctly predict tRNA secondary structure and produce high-quality multiple alignments. The approach may be applied to any family of small RNA sequences.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Base Sequence
  • Caenorhabditis elegans / genetics
  • Consensus Sequence
  • Information Systems
  • Models, Genetic*
  • Models, Statistical*
  • Molecular Sequence Data
  • Nucleic Acid Conformation
  • RNA, Transfer / chemistry
  • RNA, Transfer / genetics*
  • Sequence Analysis, RNA / methods*
  • Sequence Homology, Nucleic Acid

Substances

  • RNA, Transfer