Background: The Odom criteria, established in 1958, are a widely used, 4-point rating scale for assessing the clinical outcome after cervical spine surgery. Surprisingly, the Odom criteria have never been validated, to our knowledge. The aim of this study was to investigate the reliability and validity of the Odom criteria for the evaluation of surgical procedures of the cervical spine.
Methods: Patients with degenerative cervical spine disease were included in the study and divided into 2 subgroups on the basis of their most predominant symptom: myelopathy or radiculopathy. Reliability was assessed with interrater and test-retest design using quadratic weighted kappa coefficients. Construct validity was assessed by means of hypotheses testing. To evaluate whether the Odom criteria could act as a global perceived effect (GPE) scale, we assessed concurrent validity by comparing area under the curve (AUC) values of receiver operating characteristic (ROC) curves for the set of questionnaires.
Results: A total of 110 patients were included in the study; 19 were excluded, leaving 91 in our analysis. Reliability assessments showed κ = 0.77 for overall interrater reliability and κ = 0.93 for overall test-retest reliability. Interrater reliability was κ = 0.81 for the radiculopathy subgroup and κ = 0.68 for the myelopathy subgroup. At least 75% of the hypotheses were met. The AUCs showed similar characteristics between the Odom criteria and GPE scale.
Conclusions: The Odom criteria met the predefined criteria for reliability and validity. Therefore, the Odom criteria may be used to assess surgical outcome after a cervical spine procedure, specifically for patients presenting with radicular symptoms. The results of previous studies that have been deemed less trustworthy because of the use of the Odom criteria should be reconsidered.