OPTIMIR, a novel algorithm for integrating available genome-wide genotype data into miRNA sequence alignment analysis

RNA. 2019 Jun;25(6):657-668. doi: 10.1261/rna.069708.118. Epub 2019 Feb 28.

Abstract

Next-generation sequencing is an increasingly popular and efficient approach to characterize the full set of microRNAs (miRNAs) present in human biosamples. MiRNAs' detection and quantification still remain a challenge as they can undergo different posttranscriptional modifications and might harbor genetic variations (polymiRs) that may impact on the alignment step. We present a novel algorithm, OPTIMIR, that incorporates biological knowledge on miRNA editing and genome-wide genotype data available in the processed samples to improve alignment accuracy. OPTIMIR was applied to 391 human plasma samples that had been typed with genome-wide genotyping arrays. OPTIMIR was able to detect genotyping errors, suggested the existence of novel miRNAs and highlighted the allelic imbalance expression of polymiRs in heterozygous carriers. OPTIMIR is written in python, and freely available on the GENMED website (http://www.genmed.fr/index.php/fr/) and on Github (github.com/FlorianThibord/OptimiR).

Keywords: alignment; genetic variations; isomiRs; microRNA; next-generation sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Base Sequence
  • Computational Biology / methods
  • Databases, Genetic
  • Genome, Human*
  • Genome-Wide Association Study
  • Genotype
  • Heterozygote
  • Humans
  • MicroRNAs / blood
  • MicroRNAs / classification
  • MicroRNAs / genetics*
  • Oligonucleotide Array Sequence Analysis
  • Sequence Alignment / methods*
  • Software
  • Venous Thrombosis / blood
  • Venous Thrombosis / genetics*
  • Venous Thrombosis / pathology

Substances

  • MicroRNAs