Evaluating the accuracy of SHAPE-directed RNA secondary structure predictions

Nucleic Acids Res. 2013 Mar 1;41(5):2807-16. doi: 10.1093/nar/gks1283. Epub 2013 Jan 15.

Abstract

Recent advances in RNA structure determination include using data from high-throughput probing experiments to improve thermodynamic prediction accuracy. We evaluate the extent and nature of improvements in data-directed predictions for a diverse set of 16S/18S ribosomal sequences using a stochastic model of experimental SHAPE data. The average accuracy for 1000 data-directed predictions always improves over the original minimum free energy (MFE) structure. However, the amount of improvement varies with the sequence, exhibiting a correlation with MFE accuracy. Further analysis of this correlation shows that accurate MFE base pairs are typically preserved in a data-directed prediction, whereas inaccurate ones are not. Thus, the positive predictive value of common base pairs is consistently higher than the directed prediction accuracy. Finally, we confirm sequence dependencies in the directability of thermodynamic predictions and investigate the potential for greater accuracy improvements in the worst performing test sequence.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Computer Simulation*
  • Likelihood Functions
  • Models, Molecular*
  • Nucleic Acid Conformation
  • RNA, Archaeal / chemistry
  • RNA, Bacterial / chemistry
  • RNA, Ribosomal, 16S / chemistry*
  • RNA, Ribosomal, 18S / chemistry*
  • Software*
  • Stochastic Processes
  • Thermodynamics

Substances

  • RNA, Archaeal
  • RNA, Bacterial
  • RNA, Ribosomal, 16S
  • RNA, Ribosomal, 18S