A Central Edge Selection Based Overlapping Community Detection Algorithm for the Detection of Overlapping Structures in Protein⁻Protein Interaction Networks

Molecules. 2018 Oct 13;23(10):2633. doi: 10.3390/molecules23102633.

Abstract

Overlapping structures of protein⁻protein interaction networks are very prevalent in different biological processes, which reflect the sharing mechanism to common functional components. The overlapping community detection (OCD) algorithm based on central node selection (CNS) is a traditional and acceptable algorithm for OCD in networks. The main content of CNS is the central node selection and the clustering procedure. However, the original CNS does not consider the influence among the nodes and the importance of the division of the edges in networks. In this paper, an OCD algorithm based on a central edge selection (CES) algorithm for detection of overlapping communities of protein⁻protein interaction (PPI) networks is proposed. Different from the traditional CNS algorithms for OCD, the proposed algorithm uses community magnetic interference (CMI) to obtain more reasonable central edges in the process of CES, and employs a new distance between the non-central edge and the set of the central edges to divide the non-central edge into the correct cluster during the clustering procedure. In addition, the proposed CES improves the strategy of overlapping nodes pruning (ONP) to make the division more precisely. The experimental results on three benchmark networks and three biological PPI networks of Mus. musculus, Escherichia coli, and Cerevisiae show that the CES algorithm performs well.

Keywords: central edge selection; overlapping community detection; overlapping node pruning; protein–protein interaction network.

MeSH terms

  • Algorithms
  • Animals
  • Computational Biology / methods*
  • Escherichia coli / metabolism*
  • Escherichia coli Proteins / metabolism
  • Mice
  • Protein Interaction Mapping / methods*
  • Protein Interaction Maps
  • Saccharomyces cerevisiae / metabolism*
  • Saccharomyces cerevisiae Proteins / metabolism

Substances

  • Escherichia coli Proteins
  • Saccharomyces cerevisiae Proteins