Improving the accuracy of mammography: volume and outcome relationships

Laura Esserman; Helen Cowley; Carey Eberle; Alastair Kirkpatrick; Sophia Chang; Kevin Berbaum; Alastair Gale

doi:10.1093/jnci/94.5.369

Improving the accuracy of mammography: volume and outcome relationships

J Natl Cancer Inst. 2002 Mar 6;94(5):369-75. doi: 10.1093/jnci/94.5.369.

Authors

Laura Esserman¹, Helen Cowley, Carey Eberle, Alastair Kirkpatrick, Sophia Chang, Kevin Berbaum, Alastair Gale

Affiliation

¹ Department of Surgery, University of California at San Francisco, 94143-1710, USA. [email protected]

PMID: 11880475
DOI: 10.1093/jnci/94.5.369

Abstract

Background: Countries with centralized, high-volume mammography screening programs, such as the U.K. and Sweden, emphasize high specificity (low percentage of false positives) and high sensitivity (high percentage of true positives). By contrast, the United States does not have centralized, high-volume screening programs, emphasizes high sensitivity, and has lower average specificity. We investigated whether high sensitivity can be achieved in the context of high specificity and whether the number of mammograms read per radiologist (reader volume) drives both sensitivity and specificity.

Methods: The U.K.'s National Health Service Breast Screening Programme uses the PERFORMS 2 test as a teaching and assessment tool for radiologists. The same 60-film PERFORMS 2 test was given to 194 high-volume U.K. radiologists and to 60 U.S. radiologists, who were assigned to low-, medium-, or high-volume groups on the basis of the number of mammograms read per month. The standard binormal receiver-operating characteristic (ROC) model was fitted to the data of individual readers. Detection accuracy was measured by the sensitivity at specificity = 0.90, and differences among sensitivities were determined by analysis of variance.

Results: The average sensitivity at specificity = 0.90 was 0.785 for U.K. radiologists, 0.756 for high-volume U.S. radiologists, 0.702 for medium-volume U.S. radiologists, and 0.648 for low-volume U.S. radiologists. At this specificity, low-volume U.S. radiologists had statistically significantly lower sensitivity than either high-volume U.S. radiologists or U.K. radiologists, and medium-volume U.S. radiologists had statistically significantly lower sensitivity than U.K. radiologists (P<.001, for all comparisons).

Conclusions: Reader volume is an important determinant of mammogram sensitivity and specificity. High sensitivity (high cancer detection rate) can be achieved with high specificity (low false-positive rate) in high-volume centers. This study suggests that there is great potential for optimizing mammography screening.

MeSH terms

Breast Neoplasms / diagnostic imaging*
Breast Neoplasms / prevention & control
Clinical Competence / statistics & numerical data*
Diagnosis, Differential
Education, Medical, Continuing
Female
Humans
Mammography / standards*
Mammography / statistics & numerical data*
Mass Screening / standards
Observer Variation
Practice Patterns, Physicians' / statistics & numerical data*
Predictive Value of Tests
Process Assessment, Health Care*
ROC Curve
Sensitivity and Specificity
United Kingdom
United States