SDePER: a hybrid machine learning and regression method for cell-type deconvolution of spatial barcoding-based transcriptomic data

Genome Biol. 2024 Oct 14;25(1):271. doi: 10.1186/s13059-024-03416-2.

Abstract

Spatial barcoding-based transcriptomic (ST) data require deconvolution for cellular-level downstream analysis. Here we present SDePER, a hybrid machine learning and regression method to deconvolve ST data using reference single-cell RNA sequencing (scRNA-seq) data. SDePER tackles platform effects between ST and scRNA-seq data, ensuring a linear relationship between them while addressing sparsity and spatial correlations in cell types across capture spots. SDePER estimates cell-type proportions, enabling enhanced resolution tissue mapping by imputing cell-type compositions and gene expressions at unmeasured locations. Applications to simulated data and four real datasets showed SDePER's superior accuracy and robustness over existing methods.

MeSH terms

  • Animals
  • Gene Expression Profiling / methods
  • Humans
  • Machine Learning*
  • RNA-Seq / methods
  • Regression Analysis
  • Sequence Analysis, RNA / methods
  • Single-Cell Analysis* / methods
  • Software
  • Transcriptome*