scapGNN: A graph neural network-based framework for active pathway and gene module inference from single-cell multi-omics data

PLoS Biol. 2023 Nov 13;21(11):e3002369. doi: 10.1371/journal.pbio.3002369. eCollection 2023 Nov.

Abstract

Although advances in single-cell technologies have enabled the characterization of multiple omics profiles in individual cells, extracting functional and mechanistic insights from such information remains a major challenge. Here, we present scapGNN, a graph neural network (GNN)-based framework that creatively transforms sparse single-cell profile data into the stable gene-cell association network for inferring single-cell pathway activity scores and identifying cell phenotype-associated gene modules from single-cell multi-omics data. Systematic benchmarking demonstrated that scapGNN was more accurate, robust, and scalable than state-of-the-art methods in various downstream single-cell analyses such as cell denoising, batch effect removal, cell clustering, cell trajectory inference, and pathway or gene module identification. scapGNN was developed as a systematic R package that can be flexibly extended and enhanced for existing analysis processes. It provides a new analytical platform for studying single cells at the pathway and network levels.

MeSH terms

  • Computational Biology / methods
  • Gene Regulatory Networks*
  • Multiomics*
  • Neural Networks, Computer

Grants and funding

This work was supported by the National Key R&D Program of China (2021YFC2700200 to XG), the Chinese National Natural Science Foundation (Grants No. 82221005 to XG, 81971439 to XG, 82001611 to YL, 31871164 to HZ, 82071702 to HZ) and the fund from Health Commission of Jiangsu Province (M2020071 to YL). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.