Network orientation via shortest paths

Bioinformatics. 2014 May 15;30(10):1449-55. doi: 10.1093/bioinformatics/btu043. Epub 2014 Jan 27.

Abstract

The graph orientation problem calls for orienting the edges of a graph so as to maximize the number of pre-specified source-target vertex pairs that admit a directed path from the source to the target. Most algorithmic approaches to this problem share a common preprocessing step, in which the input graph is reduced to a tree by repeatedly contracting its cycles. Although this reduction is valid from an algorithmic perspective, the assignment of directions to the edges of the contracted cycles becomes arbitrary, and the connecting source-target paths may be arbitrarily long. In the context of biological networks, the connection of vertex pairs via shortest paths is highly motivated, leading to the following problem variant: given a graph and a collection of source-target vertex pairs, assign directions to the edges so as to maximize the number of pairs that are connected by a shortest (in the original graph) directed path. This problem is NP-complete and hard to approximate to within sub-polynomial factors. Here we provide a first polynomial-size integer linear program formulation for this problem, which allows its exact solution in seconds on current networks. We apply our algorithm to orient protein-protein interaction networks in yeast and compare it with two state-of-the-art algorithms. We find that our algorithm outperforms previous approaches and can orient considerable parts of the network, thus revealing its structure and function.

Availability and implementation: The source code is available at www.cs.tau.ac.il/∼roded/shortest.zip.

Contact: [email protected].

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Linear Models
  • Protein Binding
  • Saccharomyces cerevisiae / metabolism
  • Saccharomyces cerevisiae Proteins / metabolism
  • Software

Substances

  • Saccharomyces cerevisiae Proteins