A Note on Using the Adjusted Rand Index for Link Prediction in Networks

Michaela Hoffman; Douglas Steinley; Michael J Brusco

doi:10.1016/j.socnet.2015.03.002

A Note on Using the Adjusted Rand Index for Link Prediction in Networks

Soc Networks. 2015 Jul:42:72-79. doi: 10.1016/j.socnet.2015.03.002. Epub 2015 Apr 6.

Authors

Michaela Hoffman¹, Douglas Steinley¹, Michael J Brusco²

Affiliations

¹ University of Missouri - Columbia.
² Florida State University.

Abstract

As network data gains popularity for research in various fields, the need for methods to predict future links or find missing links in the data increases. One subset of the methodology used to solve this problem involves creating a similarity measure between each pair of nodes in the network; unfortunately, these methods can be shown to have arbitrary cutoffs and poor performance. To address these shortcomings, we use the adjusted Rand index to create a similarity measure between nodes that has a natural threshold of zero. The effectiveness of this method is then compared to a number of other similarity measures and assessed on a variety of simulated data sets with block model structure and three real network data sets. Under this particular formulation of the adjusted Rand index, information is also provided on dissimilarity. As such, we then go on to test its use for detecting incorrect links in network data, highlighting the dual use of the approach.

Keywords: adjusted Rand index; link prediction; missing links; network analysis.

Grants and funding

R21 AA022074/AA/NIAAA NIH HHS/United States