Detecting miRNAs in deep-sequencing data: a software performance comparison and evaluation

Brief Bioinform. 2013 Jan;14(1):36-45. doi: 10.1093/bib/bbs010. Epub 2012 Mar 24.

Abstract

Deep sequencing has become a popular tool for novel miRNA detection but its data must be viewed carefully as the state of the field is still undeveloped. Using three different programs, miRDeep (v1, 2), miRanalyzer and DSAP, we have analyzed seven data sets (six biological and one simulated) to provide a critical evaluation of the programs performance. We selected these software based on their popularity and overall approach toward the detection of novel and known miRNAs using deep-sequencing data. The program comparisons suggest that, despite differing stringency levels they all identify a similar set of known and novel predictions. Comparisons between the first and second version of miRDeep suggest that the stringency level of each of these programs may, in fact, be a result of the algorithm used to map the reads to the target. Different stringency levels are likely to affect the number of possible novel candidates for functional verification, causing undue strain on resources and time. With that in mind, we propose that an intersection across multiple programs be taken, especially if considering novel candidates that will be targeted for additional analysis. Using this approach, we identify and performed initial validation of 12 novel predictions in our in-house data with real-time PCR, six of which have been previously unreported.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms
  • Animals
  • Cell Line, Tumor
  • Computational Biology / statistics & numerical data
  • Databases, Nucleic Acid* / statistics & numerical data
  • Humans
  • MicroRNAs / genetics*
  • Sequence Analysis, RNA* / statistics & numerical data
  • Software*

Substances

  • MicroRNAs