Identifying key genes in cancer networks using persistent homology

Sci Rep. 2025 Jan 22;15(1):2751. doi: 10.1038/s41598-025-87265-4.

Abstract

Identifying driver genes is crucial for understanding oncogenesis and developing targeted cancer therapies. Driver discovery methods using protein or pathway networks rely on traditional network science measures, focusing on nodes, edges, or community metrics. These methods can overlook the high-dimensional interactions that cancer genes have within cancer networks. This study presents a novel method using Persistent Homology to analyze the role of driver genes in higher-order structures within Cancer Consensus Networks derived from main cellular pathways. We integrate mutation data from six cancer types and three biological functions: DNA Repair, Chromatin Organization, and Programmed Cell Death. We systematically evaluated the impact of gene removal on topological voids ([Formula: see text] structures) within the Cancer Consensus Networks. Our results reveal that only known driver genes and cancer-associated genes influence these structures, while passenger genes do not. Although centrality measures alone proved insufficient to fully characterize impact genes, combining higher-order topological analysis with traditional network metrics can improve the precision of distinguishing between drivers and passengers. This work shows that cancer genes play an important role in higher-order structures, going beyond pairwise measures, and provides an approach to distinguish drivers and cancer-associated genes from passenger genes.

Keywords: Cancer genomics; Driver genes; Pathways networks; Persistent homology; Protein networks; Topological data analysis.

MeSH terms

  • Computational Biology / methods
  • DNA Repair / genetics
  • Gene Regulatory Networks*
  • Genes, Neoplasm
  • Humans
  • Mutation
  • Neoplasms* / genetics