RNA canonical and non-canonical base pairing types: a recognition method and complete repertoire

Nucleic Acids Res. 2002 Oct 1;30(19):4250-63. doi: 10.1093/nar/gkf540.

Abstract

The problem of systematic and objective identification of canonical and non-canonical base pairs in RNA three-dimensional (3D) structures was studied. A probabilistic approach was applied, and an algorithm and its implementation in a computer program that detects and analyzes all the base pairs contained in RNA 3D structures were developed. The algorithm objectively distinguishes among canonical and non-canonical base pairing types formed by three, two and one hydrogen bonds (H-bonds), as well as those containing bifurcated and C-H.X...H-bonds. The nodes of a bipartite graph are used to encode the donor and acceptor atoms of a 3D structure. The capacities of the edges correspond to probabilities computed from the geometry of the donor and acceptor groups to form H-bonds. The maximum flow from donors to acceptors directly identifies base pairs and their types. A complete repertoire of base pairing types was built from the detected H-bonds of all X-ray crystal structures of a resolution of 3.0 A or better, including the large and small ribosomal subunits. The base pairing types are labeled using an extension of the nomenclature recently introduced by Leontis and Westhof. The probabilistic method was implemented in MC-Annotate, an RNA structure analysis computer program used to determine the base pairing parameters of the 3D modeling system MC-Sym.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Pairing*
  • Escherichia coli / genetics
  • Hydrogen Bonding
  • Models, Chemical
  • Nucleic Acid Conformation
  • RNA / chemistry*
  • RNA, Bacterial / chemistry
  • RNA, Ribosomal, 5S / chemistry

Substances

  • RNA, Bacterial
  • RNA, Ribosomal, 5S
  • RNA