Variability of Breast Density Classification Between US and UK Radiologists

Wijdan Alomaim; Desiree O'Leary; John Ryan; Louise Rainford; Michael Evanoff; Shane Foley

doi:10.1016/j.jmir.2018.11.002

Variability of Breast Density Classification Between US and UK Radiologists

J Med Imaging Radiat Sci. 2019 Mar;50(1):53-61. doi: 10.1016/j.jmir.2018.11.002. Epub 2019 Jan 5.

Authors

Wijdan Alomaim¹, Desiree O'Leary², John Ryan³, Louise Rainford³, Michael Evanoff⁴, Shane Foley³

Affiliations

¹ Radiography & Diagnostic Imaging, School of Medicine, University College Dublin, Ireland. Electronic address: [email protected].
² Radiography, Keele University, Staffordshire, UK.
³ Radiography & Diagnostic Imaging, School of Medicine, University College Dublin, Ireland.
⁴ American Board of Radiology, Tucson, Arizona, USA.

PMID: 30777249
DOI: 10.1016/j.jmir.2018.11.002

Abstract

Purpose: To assess whether subjective breast density categorization remains the most useful way to categorize mammographic breast density and whether variations exist across geographic regions with differing national legislation.

Methods: Breast radiologists from two countries (UK, USA) were voluntarily recruited to review sets of anonymized mammographic images (n = 180) and additional repeated images (n = 70), totaling 250 images, to subjectively rate breast density according to the Breast Imaging Reporting and Data system (BI-RADS) categorization. Images were reviewed using standardized viewing conditions and Ziltron software. Inter-rater reliability was analyzed using the Kappa test.

Results: The US radiologists (n = 25) judged fewer images as being "mostly fatty" than UK radiologists (n = 24), leading a greater number of images classified in the higher BI-RADS categories, particularly in BI-RADS 3. Overall agreement for all data sets was k = 0.654 indicating substantial agreement between the two cohorts. When the data were split into BI-RADS categories, the level of agreement varied from fair to substantial.

Conclusion: Variations in how radiologists from the USA and UK classify breast density was established, especially when the data were divided into breast density categories. This variation supports the need for a reliable breast density assessment method to enhance the individualized supplemental screening pathways for dense breasts. The use of two-scale categorization method demonstrated improved agreement.

Advances in knowledge: Larger sample of radiologists from different breast density jurisdictions confirms international subjective variability in density categorization and improved agreement with the two-scale (low, high) categorization. With this variability, a standardized and automated breast density assessment shows to be timely.

Keywords: BI-RADS; Breast density; intrarater variability; mammography.

MeSH terms

Breast Density / physiology*
Female
Humans
Mammography / classification*
Mammography / standards
Mammography / statistics & numerical data*
Observer Variation
Radiologists / standards
Radiologists / statistics & numerical data*
United Kingdom