A likelihood approach to analysis of network data

Proc Natl Acad Sci U S A. 2006 May 16;103(20):7566-70. doi: 10.1073/pnas.0600061103. Epub 2006 May 8.

Abstract

Biological, sociological, and technological network data are often analyzed by using simple summary statistics, such as the observed degree distribution, and nonparametric bootstrap procedures to provide an adequate null distribution for testing hypotheses about the network. In this article we present a full-likelihood approach that allows us to estimate parameters for general models of network growth that can be expressed in terms of recursion relations. To handle larger networks we have developed an importance sampling scheme that allows us to approximate the likelihood and draw inference about the network and how it has been generated, estimate the parameters in the model, and perform parametric bootstrap analysis of network data. We illustrate the power of this approach by estimating growth parameters for the Caenorhabditis elegans protein interaction network.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Caenorhabditis elegans
  • Caenorhabditis elegans Proteins*
  • Data Interpretation, Statistical
  • Information Services*
  • Likelihood Functions*
  • Mathematics
  • Models, Theoretical*
  • Nerve Net*
  • Social Support*

Substances

  • Caenorhabditis elegans Proteins