ReorientExpress: reference-free orientation of nanopore cDNA reads with deep learning

Genome Biol. 2019 Nov 29;20(1):260. doi: 10.1186/s13059-019-1884-z.

Abstract

We describe ReorientExpress, a method to perform reference-free orientation of transcriptomic long sequencing reads. ReorientExpress uses deep learning to correctly predict the orientation of the majority of reads, and in particular when trained on a closely related species or in combination with read clustering. ReorientExpress enables long-read transcriptomics in non-model organisms and samples without a genome reference without using additional technologies and is available at https://github.com/comprna/reorientexpress.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA, Complementary / analysis*
  • Deep Learning*
  • Nanopore Sequencing*
  • Sequence Analysis, DNA*
  • Software*

Substances

  • DNA, Complementary