Incomplete and noisy network data as a percolation process

J R Soc Interface. 2010 Oct 6;7(51):1411-9. doi: 10.1098/rsif.2010.0044. Epub 2010 Apr 8.

Abstract

We discuss the ramifications of noisy and incomplete observations of network data on the existence of a giant connected component (GCC). The existence of a GCC in a random graph can be described in terms of a percolation process, and building on general results for classes of random graphs with specified degree distributions we derive percolation thresholds above which GCCs exist. We show that sampling and noise can have a profound effect on the perceived existence of a GCC and find that both processes can destroy it. We also show that the absence of a GCC puts a theoretical upper bound on the false-positive rate and relate our percolation analysis to experimental protein-protein interaction data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods*
  • Data Interpretation, Statistical
  • Models, Biological
  • Protein Interaction Mapping*