Inference and effects of barcode multiplets in droplet-based single-cell assays

Nat Commun. 2020 Feb 13;11(1):866. doi: 10.1038/s41467-020-14667-5.

Abstract

A widespread assumption for single-cell analyses specifies that one cell's nucleic acids are predominantly captured by one oligonucleotide barcode. Here, we show that ~13-21% of cell barcodes from the 10x Chromium scATAC-seq assay may have been derived from a droplet with more than one oligonucleotide sequence, which we call "barcode multiplets". We demonstrate that barcode multiplets can be derived from at least two different sources. First, we confirm that approximately 4% of droplets from the 10x platform may contain multiple beads. Additionally, we find that approximately 5% of beads may contain detectable levels of multiple oligonucleotide barcodes. We show that this artifact can confound single-cell analyses, including the interpretation of clonal diversity and proliferation of intra-tumor lymphocytes. Overall, our work provides a conceptual and computational framework to identify and assess the impacts of barcode multiplets in single-cell data.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artifacts
  • Base Sequence
  • DNA Barcoding, Taxonomic / methods*
  • DNA Barcoding, Taxonomic / statistics & numerical data
  • Data Interpretation, Statistical
  • Databases, Genetic / statistics & numerical data
  • Genomics / methods
  • Genomics / statistics & numerical data
  • Humans
  • Lymphocytes / chemistry
  • Lymphocytes / cytology
  • Single-Cell Analysis / methods*
  • Single-Cell Analysis / statistics & numerical data