Classification of low quality cells from single-cell RNA-seq data

Genome Biol. 2016 Feb 17:17:29. doi: 10.1186/s13059-016-0888-1.

Abstract

Single-cell RNA sequencing (scRNA-seq) has broad applications across biomedical research. One of the key challenges is to ensure that only single, live cells are included in downstream analysis, as the inclusion of compromised cells inevitably affects data interpretation. Here, we present a generic approach for processing scRNA-seq data and detecting low quality cells, using a curated set of over 20 biological and technical features. Our approach improves classification accuracy by over 30 % compared to traditional methods when tested on over 5,000 cells, including CD4+ T cells, bone marrow dendritic cells, and mouse embryonic stem cells.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence / genetics*
  • Bone Marrow Cells / classification
  • CD4-Positive T-Lymphocytes / classification
  • Dendritic Cells / classification
  • Embryonic Stem Cells / classification
  • Gene Expression Profiling
  • High-Throughput Nucleotide Sequencing
  • Mice
  • Oligonucleotide Array Sequence Analysis
  • RNA / genetics*
  • Single-Cell Analysis*

Substances

  • RNA