Doublet identification in single-cell sequencing data using scDblFinder

F1000Res. 2021 Sep 28:10:979. doi: 10.12688/f1000research.73600.2. eCollection 2021.

Abstract

Doublets are prevalent in single-cell sequencing data and can lead to artifactual findings. A number of strategies have therefore been proposed to detect them. Building on the strengths of existing approaches, we developed scDblFinder, a fast, flexible and accurate Bioconductor-based doublet detection method. Here we present the method, justify its design choices, demonstrate its performance on both single-cell RNA and accessibility (ATAC) sequencing data, and provide some observations on doublet formation, detection, and enrichment analysis. Even in complex datasets, scDblFinder can accurately identify most heterotypic doublets, and was already found by an independent benchmark to outcompete alternatives.

Keywords: doublets; filtering; multiplets; single-cell sequencing.

MeSH terms

  • RNA*
  • Software*

Substances

  • RNA

Associated data

  • figshare/10.6084/m9.figshare.16543518

Grants and funding

This work was supported by the Swiss National Science Foundation (grant number 310030_175841).