The reliability of clinical tonsil size grading in children

Divjot S Kumar; Dianne Valenzuela; Frederick K Kozak; Jeffrey P Ludemann; J Paul Moxham; Jane Lea; Neil K Chadha

doi:10.1001/jamaoto.2014.2338

The reliability of clinical tonsil size grading in children

JAMA Otolaryngol Head Neck Surg. 2014 Nov;140(11):1034-7. doi: 10.1001/jamaoto.2014.2338.

Authors

Divjot S Kumar¹, Dianne Valenzuela¹, Frederick K Kozak², Jeffrey P Ludemann², J Paul Moxham², Jane Lea², Neil K Chadha²

Affiliations

¹ Medical student at University of British Columbia, British Columbia, Canada.
² University of British Columbia, British Columbia, Canada3Division of Pediatric Otolaryngology, British Columbia Children's Hospital, British Columbia, Canada.

PMID: 25317509
DOI: 10.1001/jamaoto.2014.2338

Abstract

Importance: Because tonsillar enlargement can have substantial ill health effects in children, reliable monitoring and documentation of tonsil size is necessary in clinical settings. Tonsil grading scales potentially allow clinicians to precisely record and communicate changes in tonsil size, but their reliability in a clinical setting has not been studied.

Objective: To assess the interobserver and intraobserver reliability of the Brodsky and Friedman tonsil size grading scales and a novel 3-grade scale.

Design, setting, and participants: Cross-sectional study between June 2012 and August 2013 at a tertiary pediatric otolaryngology outpatient clinic at British Columbia Children's Hospital. We recruited 116 children, aged 3 to 14 years, with no major craniofacial abnormalities. For each child, 2 separate tonsil assessments (with at least a 5-minute interval in between) were conducted by 4 independent observers: 2 staff pediatric otolaryngologists, 1 otolaryngology trainee (fellow or resident), and 1 medical student. Each observer assessed and graded tonsil sizes using 3 different scales.

Main outcomes and measures: Interobserver and intraobserver reliabilities were assessed by deriving the intraclass correlation coefficients (ICCs) and Pearson correlation coefficients, respectively. To discount for any asymmetric scores, all data analysis was conducted on the left tonsil measurement only.

Results: Mean interobserver reliability was highest for the Brodsky grading scale (ICC, 0.721; Cronbach α, 0.911), followed by the Friedman grading scale (ICC, 0.647; Cronbach α, 0.879) and the 3-grade scale (ICC, 0.599; Cronbach α, 0.857). The mean intraobserver reliabilities for the Brodsky, Friedman, and modified 3-grade scales were 0.954, 0.932, and 0.927, respectively.

Conclusions and relevance: The Brodsky grading scale offered the highest interobserver and intraobserver reliability when compared with the Friedman and novel 3-grade scales. The results of this study would support the uniform use of the Brodsky scale for future clinical and research work.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't

MeSH terms

Adolescent
Child
Child, Preschool
Cross-Sectional Studies
Female
Humans
Male
Observer Variation
Organ Size
Palatine Tonsil / pathology*