Measuring intrarater association between correlated ordinal ratings

Biom J. 2020 Nov;62(7):1687-1701. doi: 10.1002/bimj.201900177. Epub 2020 Jun 11.

Abstract

Variability between raters' ordinal scores is commonly observed in imaging tests, leading to uncertainty in the diagnostic process. In breast cancer screening, a radiologist visually interprets mammograms and MRIs, while skin diseases, Alzheimer's disease, and psychiatric conditions are graded based on clinical judgment. Consequently, studies are often conducted in clinical settings to investigate whether a new training tool can improve the interpretive performance of raters. In such studies, a large group of experts each classify a set of patients' test results on two separate occasions, before and after some form of training with the goal of assessing the impact of training on experts' paired ratings. However, due to the correlated nature of the ordinal ratings, few statistical approaches are available to measure association between raters' paired scores. Existing measures are restricted to assessing association at just one time point for a single screening test. We propose here a novel paired kappa to provide a summary measure of association between many raters' paired ordinal assessments of patients' test results before versus after rater training. Intrarater association also provides valuable insight into the consistency of ratings when raters view a patient's test results on two occasions with no intervention undertaken between viewings. In contrast to existing correlated measures, the proposed kappa is a measure that provides an overall evaluation of the association among multiple raters' scores from two time points and is robust to the underlying disease prevalence. We implement our proposed approach in two recent breast-imaging studies and conduct extensive simulation studies to evaluate properties and performance of our summary measure of association.

Keywords: association; generalized linear mixed model; model-based kappa; ordinal classifications; screening test.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Breast Neoplasms* / diagnostic imaging
  • Computer Simulation
  • Diagnostic Tests, Routine
  • Early Detection of Cancer
  • Female
  • Humans
  • Mammography*
  • Observer Variation*
  • Reproducibility of Results