Improving the accuracy of mammography: volume and outcome relationships

J Natl Cancer Inst. 2002 Mar 6;94(5):369-75. doi: 10.1093/jnci/94.5.369.

Abstract

Background: Countries with centralized, high-volume mammography screening programs, such as the U.K. and Sweden, emphasize high specificity (low percentage of false positives) and high sensitivity (high percentage of true positives). By contrast, the United States does not have centralized, high-volume screening programs, emphasizes high sensitivity, and has lower average specificity. We investigated whether high sensitivity can be achieved in the context of high specificity and whether the number of mammograms read per radiologist (reader volume) drives both sensitivity and specificity.

Methods: The U.K.'s National Health Service Breast Screening Programme uses the PERFORMS 2 test as a teaching and assessment tool for radiologists. The same 60-film PERFORMS 2 test was given to 194 high-volume U.K. radiologists and to 60 U.S. radiologists, who were assigned to low-, medium-, or high-volume groups on the basis of the number of mammograms read per month. The standard binormal receiver-operating characteristic (ROC) model was fitted to the data of individual readers. Detection accuracy was measured by the sensitivity at specificity = 0.90, and differences among sensitivities were determined by analysis of variance.

Results: The average sensitivity at specificity = 0.90 was 0.785 for U.K. radiologists, 0.756 for high-volume U.S. radiologists, 0.702 for medium-volume U.S. radiologists, and 0.648 for low-volume U.S. radiologists. At this specificity, low-volume U.S. radiologists had statistically significantly lower sensitivity than either high-volume U.S. radiologists or U.K. radiologists, and medium-volume U.S. radiologists had statistically significantly lower sensitivity than U.K. radiologists (P<.001, for all comparisons).

Conclusions: Reader volume is an important determinant of mammogram sensitivity and specificity. High sensitivity (high cancer detection rate) can be achieved with high specificity (low false-positive rate) in high-volume centers. This study suggests that there is great potential for optimizing mammography screening.

MeSH terms

  • Breast Neoplasms / diagnostic imaging*
  • Breast Neoplasms / prevention & control
  • Clinical Competence / statistics & numerical data*
  • Diagnosis, Differential
  • Education, Medical, Continuing
  • Female
  • Humans
  • Mammography / standards*
  • Mammography / statistics & numerical data*
  • Mass Screening / standards
  • Observer Variation
  • Practice Patterns, Physicians' / statistics & numerical data*
  • Predictive Value of Tests
  • Process Assessment, Health Care*
  • ROC Curve
  • Sensitivity and Specificity
  • United Kingdom
  • United States