Purpose: To evaluate the impact of video sequences (cine-loops) on the interobserver agreement (IOA) using risk stratification systems (RSSs) for thyroid nodules (TNs).
Methods: Twenty TNs were randomly selected from a large database and evaluated by twelve experienced observers using five different RSSs (Kwak-, ACR-, EU-, Korean-TIRADS, ATA Guidelines). In the first step, the evaluation was conducted based on static ultrasound (US) images in two planes ("static"). Six months later, these cases were reevaluated by the same observers using video sequences in two planes ("cine-loops"). Fleiss' kappa (κ) was calculated for the IOA analyses.
Results: IOA on static was moderate with κ values of 0.46, 0.42, 0.40, 0.45, and 0.38 for the Kwak-, ACR-, EU-, Korean-TIRADS, and ATA Guidelines, respectively, while the IOA on cine-loops was fair with κ values of 0.41, 0.38, 0.37, 0.36, and 0.34 for the Kwak-, ACR-, EU-, Korean-TIRADS, and ATA Guidelines, respectively. The overall IOA was superior in static images versus cine-loops (p = 0.024). Among other findings, the subgroup analyses (related to age, gender, US certificates, number of thyroid US per week, and RSSs experience) particularly showed that the experience of the observers in using RSSs had a significant influence on the IOA.
Conclusions: The overall IOA (all twelve observers and all five RSSs) was superior on static US images in comparison to cine-loops. Furthermore, the overall IOA of the five US features revealed superior κ values of the static images over cine-loops. However, this impact was significantly lower when the observers were highly experienced in the use of US RSSs of TNs.
Keywords: Thyroid Imaging Reporting and Data Systems (TIRADS); cine-loop; interobserver agreement; thyroid nodule; ultrasound.