Confronting false discoveries in single-cell differential expression

Nat Commun. 2021 Sep 28;12(1):5692. doi: 10.1038/s41467-021-25960-2.

Abstract

Differential expression analysis in single-cell transcriptomics enables the dissection of cell-type-specific responses to perturbations such as disease, trauma, or experimental manipulations. While many statistical methods are available to identify differentially expressed genes, the principles that distinguish these methods and their performance remain unclear. Here, we show that the relative performance of these methods is contingent on their ability to account for variation between biological replicates. Methods that ignore this inevitable variation are biased and prone to false discoveries. Indeed, the most widely used methods can discover hundreds of differentially expressed genes in the absence of biological differences. To exemplify these principles, we exposed true and false discoveries of differentially expressed genes in the injured mouse spinal cord.

Publication types

  • Research Support, N.I.H., Intramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Biological Variation, Individual
  • Biological Variation, Population
  • Data Accuracy*
  • Datasets as Topic
  • Gene Expression Regulation
  • Humans
  • Mice
  • Models, Statistical*
  • RNA-Seq / methods*
  • RNA-Seq / statistics & numerical data
  • Rabbits
  • Rats
  • Single-Cell Analysis / methods*
  • Single-Cell Analysis / statistics & numerical data
  • Swine

Grants and funding