High content of nuclei-free low-quality cells in reference single-cell atlases: a call for more stringent quality control using nuclear fraction

BMC Genomics. 2024 Nov 21;25(1):1124. doi: 10.1186/s12864-024-11015-5.

Abstract

The advent of droplet-based single-cell RNA-sequencing (scRNA-seq) has dramatically increased data throughput, enabling the release of a diverse array of tissue cell atlases to the public. However, we will show that prominent initiatives such as the Human Cell Atlas [1], the Tabula Sapiens [2] and the Tabula Muris [3] contain a significant amount of contamination products (frequently affecting the whole organ) in their data portals due to suboptimal quality filtering. Our work addresses a critical gap by advocating for more stringent quality filtering, highlighting the imperative for a shift from existing standards, which currently lean towards greater permissiveness. We will show the importance of incorporating cell intronic fraction in quality control -or MALAT1 expression otherwise- showcasing its informative nature and potential to elevate cell atlas data reliability. In summary, here, we unveil the hidden intronic landscape of every tissue and highlight the importance of more rigorous single-cell RNA-sequencing quality assessment in cell atlases to enhance their applicability in diverse downstream analyses.

Keywords: Cell atlases; Intronic content; MALAT1; Nuclear fraction; Quality control; Single-cell RNA-seq.

MeSH terms

  • Animals
  • Cell Nucleus / genetics
  • Humans
  • Introns / genetics
  • Quality Control*
  • RNA, Long Noncoding / genetics
  • Sequence Analysis, RNA
  • Single-Cell Analysis* / methods

Substances

  • RNA, Long Noncoding
  • MALAT1 long non-coding RNA, human