Millefy: visualizing cell-to-cell heterogeneity in read coverage of single-cell RNA sequencing datasets

BMC Genomics. 2020 Mar 3;21(1):177. doi: 10.1186/s12864-020-6542-z.

Abstract

Background: Read coverage of RNA sequencing data reflects gene expression and RNA processing events. Single-cell RNA sequencing (scRNA-seq) methods, particularly "full-length" ones, provide read coverage of many individual cells and have the potential to reveal cellular heterogeneity in RNA transcription and processing. However, visualization tools suited to highlighting cell-to-cell heterogeneity in read coverage are still lacking.

Results: Here, we have developed Millefy, a tool for visualizing read coverage of scRNA-seq data in genomic contexts. Millefy is designed to show read coverage of all individual cells at once in genomic contexts and to highlight cell-to-cell heterogeneity in read coverage. By visualizing read coverage of all cells as a heat map and dynamically reordering cells based on diffusion maps, Millefy facilitates discovery of "local" region-specific, cell-to-cell heterogeneity in read coverage. We applied Millefy to scRNA-seq data sets of mouse embryonic stem cells and triple-negative breast cancers and showed variability of transcribed regions including antisense RNAs, 3 ' UTR lengths, and enhancer RNA transcription.

Conclusions: Millefy simplifies the examination of cellular heterogeneity in RNA transcription and processing events using scRNA-seq data. Millefy is available as an R package (https://github.com/yuifu/millefy) and as a Docker image for use with Jupyter Notebook (https://hub.docker.com/r/yuifu/datascience-notebook-millefy).

Keywords: Read coverage; Single-cell RNA sequencing; Visualization.

MeSH terms

  • 3' Untranslated Regions
  • Animals
  • Cells, Cultured
  • Computational Biology / methods*
  • Female
  • Gene Expression Profiling / methods*
  • Genetic Heterogeneity
  • Humans
  • Mice
  • Mouse Embryonic Stem Cells / chemistry
  • Mouse Embryonic Stem Cells / cytology*
  • RNA, Antisense / genetics
  • Sequence Analysis, RNA / methods
  • Single-Cell Analysis / methods*
  • Software
  • Triple Negative Breast Neoplasms / genetics*

Substances

  • 3' Untranslated Regions
  • RNA, Antisense