An implicit enumeration method for an exact test of weighted kappa

Br J Math Stat Psychol. 2008 Nov;61(Pt 2):439-52. doi: 10.1348/000711007X227058. Epub 2007 Jul 23.

Abstract

The kappa coefficient is one of the most widely used measures for evaluating the agreement between two raters asked to assign N objects to one of K nominal categories. Weighted versions of kappa enable partial credit to be awarded for near agreement, most notably in the case of ordinal categories. An exact significance test for weighted kappa can be conducted by enumerating all rater agreement tables with the same fixed marginal frequencies as the observed table, and accumulating the probabilities for all tables that produce a weighted kappa index that is greater than or equal to the observed measure. Unfortunately, complete enumeration of all tables is computationally unwieldy for modest values of N and K. We present an implicit enumeration algorithm for conducting an exact test of weighted kappa, which can be applied to tables of non-trivial size. The algorithm is particularly efficient for 'good' to 'excellent' values of weighted kappa that typically have very small p-values. Therefore, our method is beneficial for situations where resampling tests are of limited value because the number of trials needed to estimate the p-value tends to be large.

MeSH terms

  • Algorithms*
  • Humans
  • Models, Psychological*
  • Psychology / methods
  • Psychology / statistics & numerical data