Context: Field validation of slides used in gynecologic cytology proficiency testing has surfaced as an important issue. Although the precision of diagnoses in peer-reviewed educational programs has been examined, the robustness of the validation criteria for specific types of interpretations used in proficiency testing has not been previously studied.
Objective: To evaluate the robustness of validation criteria for slides entering an educational slide program.
Design: We reviewed the results of the College of American Pathologists Interlaboratory Comparison Program in Cervicovaginal Cytology and compared the robustness of validation criteria for different reference diagnoses, using a total of 16,948 circulating slides.
Results: Validation criteria could be divided into 2 significantly different groups. The criteria for herpes, Trichomonas, squamous cell carcinoma, and adenocarcinoma were significantly more robust than the diagnoses of unsatisfactory; negative for intraepithelial lesion and malignancy, not otherwise specified; low-grade squamous intraepithelial lesion; and high-grade squamous intraepithelial lesion (P < .001).
Conclusions: The validation criteria used in the College of American Pathologists Interlaboratory Comparison Program in Cervicovaginal Cytology show 2 different levels of robustness or redundancy. These results have implications for the design of fair proficiency tests. Proficiency testing can be designed with the necessary number of reviews needed for slide validation.