Alleviating batch effects in cell type deconvolution with SCCAF-D

Nat Commun. 2024 Dec 30;15(1):10867. doi: 10.1038/s41467-024-55213-x.

Abstract

Cell type deconvolution methods can impute cell proportions from bulk transcriptomics data, revealing changes in disease progression or organ development. But benchmarking studies often use simulated bulk data from the same source as the reference, which limits its application scenarios. This study examines batch effects in deconvolution and introduces SCCAF-D, a computational workflow that ensures a Pearson Correlation Coefficient above 0.75 across simulated and real bulk data for various tissue types. Applied to non-alcoholic fatty liver disease, SCCAF-D unveils meaningful insights into changes in cell proportions during disease progression.

MeSH terms

  • Algorithms
  • Animals
  • Computational Biology / methods
  • Computer Simulation
  • Disease Progression
  • Gene Expression Profiling / methods
  • Humans
  • Mice
  • Non-alcoholic Fatty Liver Disease* / genetics
  • Non-alcoholic Fatty Liver Disease* / metabolism
  • Non-alcoholic Fatty Liver Disease* / pathology
  • Software
  • Transcriptome
  • Workflow