Multi-layer Bundling as a New Approach for Determining Multi-scale Correlations Within a High-Dimensional Dataset

Bull Math Biol. 2024 Jul 12;86(9):105. doi: 10.1007/s11538-024-01335-8.

Abstract

The growing complexity of biological data has spurred the development of innovative computational techniques to extract meaningful information and uncover hidden patterns within vast datasets. Biological networks, such as gene regulatory networks and protein-protein interaction networks, hold critical insights into biological features' connections and functions. Integrating and analyzing high-dimensional data, particularly in gene expression studies, stands prominent among the challenges in deciphering these networks. Clustering methods play a crucial role in addressing these challenges, with spectral clustering emerging as a potent unsupervised technique considering intrinsic geometric structures. However, spectral clustering's user-defined cluster number can lead to inconsistent and sometimes orthogonal clustering regimes. We propose the Multi-layer Bundling (MLB) method to address this limitation, combining multiple prominent clustering regimes to offer a comprehensive data view. We call the outcome clusters "bundles". This approach refines clustering outcomes, unravels hierarchical organization, and identifies bridge elements mediating communication between network components. By layering clustering results, MLB provides a global-to-local view of biological feature clusters enabling insights into intricate biological systems. Furthermore, the method enhances bundle network predictions by integrating the bundle co-cluster matrix with the affinity matrix. The versatility of MLB extends beyond biological networks, making it applicable to various domains where understanding complex relationships and patterns is needed.

Keywords: Biological network; Clustering method; Correlation network analysis; Dimension reduction; Spectral clustering.

MeSH terms

  • Algorithms*
  • Cluster Analysis
  • Computational Biology*
  • Gene Expression Profiling / methods
  • Gene Expression Profiling / statistics & numerical data
  • Gene Regulatory Networks*
  • Humans
  • Mathematical Concepts*
  • Models, Biological
  • Protein Interaction Maps*