SCAPTURE: a deep learning-embedded pipeline that captures polyadenylation information from 3' tag-based RNA-seq of single cells

Genome Biol. 2021 Aug 10;22(1):221. doi: 10.1186/s13059-021-02437-5.

Abstract

Single-cell RNA-seq (scRNA-seq) profiles gene expression with high resolution. Here, we develop a stepwise computational method-called SCAPTURE to identify, evaluate, and quantify cleavage and polyadenylation sites (PASs) from 3' tag-based scRNA-seq. SCAPTURE detects PASs de novo in single cells with high sensitivity and accuracy, enabling detection of previously unannotated PASs. Quantified alternative PAS transcripts refine cell identity analysis beyond gene expression, enriching information extracted from scRNA-seq data. Using SCAPTURE, we show changes of PAS usage in PBMCs from infected versus healthy individuals at single-cell resolution.

Keywords: APA; Deep learning; PAS; Peak calling; Transcript quantification; scRNA-seq.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • COVID-19 / diagnosis
  • Deep Learning*
  • Humans
  • Polyadenylation*
  • RNA-Seq*
  • SARS-CoV-2
  • Sensitivity and Specificity
  • Sequence Analysis, RNA
  • Single-Cell Analysis*
  • Transcriptome