SEACells infers transcriptional and epigenomic cellular states from single-cell genomics data

Nat Biotechnol. 2023 Dec;41(12):1746-1757. doi: 10.1038/s41587-023-01716-9. Epub 2023 Mar 27.

Abstract

Metacells are cell groupings derived from single-cell sequencing data that represent highly granular, distinct cell states. Here we present single-cell aggregation of cell states (SEACells), an algorithm for identifying metacells that overcome the sparsity of single-cell data while retaining heterogeneity obscured by traditional cell clustering. SEACells outperforms existing algorithms in identifying comprehensive, compact and well-separated metacells in both RNA and assay for transposase-accessible chromatin (ATAC) modalities across datasets with discrete cell types and continuous trajectories. We demonstrate the use of SEACells to improve gene-peak associations, compute ATAC gene scores and infer the activities of critical regulators during differentiation. Metacell-level analysis scales to large datasets and is particularly well suited for patient cohorts, where per-patient aggregation provides more robust units for data integration. We use our metacells to reveal expression dynamics and gradual reconfiguration of the chromatin landscape during hematopoietic differentiation and to uniquely identify CD4 T cell differentiation and activation states associated with disease onset and severity in a Coronavirus Disease 2019 (COVID-19) patient cohort.

MeSH terms

  • Algorithms
  • CD4-Positive T-Lymphocytes / metabolism
  • Chromatin* / genetics
  • Chromatin* / metabolism
  • Epigenomics*
  • Genomics
  • Humans
  • Single-Cell Analysis

Substances

  • Chromatin