Quantitative assessment of cell population diversity in single-cell landscapes

PLoS Biol. 2018 Oct 22;16(10):e2006687. doi: 10.1371/journal.pbio.2006687. eCollection 2018 Oct.

Abstract

Single-cell RNA sequencing (scRNA-seq) has become a powerful tool for the systematic investigation of cellular diversity. As a number of computational tools have been developed to identify and visualize cell populations within a single scRNA-seq dataset, there is a need for methods to quantitatively and statistically define proportional shifts in cell population structures across datasets, such as expansion or shrinkage or emergence or disappearance of cell populations. Here we present sc-UniFrac, a framework to statistically quantify compositional diversity in cell populations between single-cell transcriptome landscapes. sc-UniFrac enables sensitive and robust quantification in simulated and experimental datasets in terms of both population identity and quantity. We have demonstrated the utility of sc-UniFrac in multiple applications, including assessment of biological and technical replicates, classification of tissue phenotypes and regional specification, identification and definition of altered cell infiltrates in tumorigenesis, and benchmarking batch-correction tools. sc-UniFrac provides a framework for quantifying diversity or alterations in cell populations across conditions and has broad utility for gaining insight into tissue-level perturbations at the single-cell resolution.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Brain / cytology
  • Brain / metabolism
  • CD4-Positive T-Lymphocytes / cytology
  • CD4-Positive T-Lymphocytes / metabolism
  • CD8-Positive T-Lymphocytes / cytology
  • CD8-Positive T-Lymphocytes / metabolism
  • Cluster Analysis
  • Computer Simulation
  • Databases, Nucleic Acid
  • Gene Expression Profiling / methods*
  • Gene Expression Profiling / statistics & numerical data
  • Humans
  • Intestinal Mucosa / cytology
  • Intestinal Mucosa / metabolism
  • Mice
  • Mice, Inbred C57BL
  • Models, Biological
  • Neoplasms, Experimental / genetics
  • Neoplasms, Experimental / pathology
  • Oligodendroglia / cytology
  • Oligodendroglia / metabolism
  • Sequence Analysis, RNA / methods*
  • Sequence Analysis, RNA / statistics & numerical data
  • Single-Cell Analysis / methods*
  • Single-Cell Analysis / statistics & numerical data
  • Software
  • Workflow