Prediction of lncRNAs and their interactions with nucleic acids: benchmarking bioinformatics tools

Brief Bioinform. 2019 Mar 22;20(2):551-564. doi: 10.1093/bib/bby032.

Abstract

The genomes of mammalian species are pervasively transcribed producing as many noncoding as protein-coding RNAs. There is a growing body of evidence supporting their functional role. Long noncoding RNA (lncRNA) can bind both nucleic acids and proteins through several mechanisms. A reliable computational prediction of the most probable mechanism of lncRNA interaction can facilitate experimental validation of its function. In this study, we benchmarked computational tools capable to discriminate lncRNA from mRNA and predict lncRNA interactions with other nucleic acids. We assessed the performance of 9 tools for distinguishing protein-coding from noncoding RNAs, as well as 19 tools for prediction of RNA-RNA and RNA-DNA interactions. Our conclusions about the considered tools were based on their performances on the entire genome/transcriptome level, as it is the most common task nowadays. We found that FEELnc and CPAT distinguish between coding and noncoding mammalian transcripts in the most accurate manner. ASSA, RIBlast and LASTAL, as well as Triplexator, turned out to be the best predictors of RNA-RNA and RNA-DNA interactions, respectively. We showed that the normalization of the predicted interaction strength to the transcript length and GC content may improve the accuracy of inferring RNA interactions. Yet, all the current tools have difficulties to make accurate predictions of short-trans RNA-RNA interactions-stretches of sparse contacts. All over, there is still room for improvement in each category, especially for predictions of RNA interactions.

Keywords: RNA-DNA interaction; RNA-RNA interaction; gene prediction; lncRNA.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Benchmarking*
  • Computational Biology / methods*
  • Humans
  • RNA, Long Noncoding / genetics
  • RNA, Long Noncoding / metabolism*
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism*
  • Transcriptome

Substances

  • RNA, Long Noncoding
  • RNA, Messenger