A Post-Processing Algorithm for miRNA Microarray Data

Int J Mol Sci. 2020 Feb 12;21(4):1228. doi: 10.3390/ijms21041228.

Abstract

One of the main disadvantages of using DNA microarrays for miRNA expression profiling is the inability of adequate comparison of expression values across different miRNAs. This leads to a large amount of miRNAs with high scores which are actually not expressed in examined samples, i.e., false positives. We propose a post-processing algorithm which performs scoring of miRNAs in the results of microarray analysis based on expression values, time of discovery of miRNA, and correlation level between the expressions of miRNA and corresponding pre-miRNA in considered samples. The algorithm was successfully validated by the comparison of the results of its application to miRNA microarray breast tumor samples with publicly available miRNA-seq breast tumor data. Additionally, we obtained possible reasons why miRNA can appear as a false positive in microarray study using paired miRNA sequencing and array data. The use of DNA microarrays for estimating miRNA expression profile is limited by several factors. One of them consists of problems with comparing expression values of different miRNAs. In this work, we show that situation can be significantly improved if some additional information is taken into consideration in a comparison.

Keywords: TCGA; miRNA microarrays; miRNome of breast cancer.

MeSH terms

  • Algorithms*
  • Breast Neoplasms / metabolism
  • Female
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic
  • Humans
  • Male
  • MicroRNAs / genetics*
  • Oligonucleotide Array Sequence Analysis / methods*
  • Sensitivity and Specificity
  • Sequence Analysis, RNA

Substances

  • MicroRNAs