Deep learning assisted mitotic counting for breast cancer

Maschenka C A Balkenhol; David Tellez; Willem Vreuls; Pieter C Clahsen; Hans Pinckaers; Francesco Ciompi; Peter Bult; Jeroen A W M van der Laak

doi:10.1038/s41374-019-0275-0

Deep learning assisted mitotic counting for breast cancer

Lab Invest. 2019 Nov;99(11):1596-1606. doi: 10.1038/s41374-019-0275-0. Epub 2019 Jun 20.

Authors

Maschenka C A Balkenhol¹, David Tellez², Willem Vreuls³, Pieter C Clahsen⁴, Hans Pinckaers², Francesco Ciompi², Peter Bult², Jeroen A W M van der Laak^{2

5}

Affiliations

¹ Department of Pathology, Radboud University Medical Center, Nijmegen, Netherlands. [email protected].
² Department of Pathology, Radboud University Medical Center, Nijmegen, Netherlands.
³ Department of Pathology, Canisius Wilhelmina Hospital, Nijmegen, Netherlands.
⁴ Department of Pathology, Haaglanden Medical Center, 's-Gravenhage, Netherlands.
⁵ Center for Medical Image Science and Visualization, Linköping University, Linköping, Sweden.

PMID: 31222166
DOI: 10.1038/s41374-019-0275-0

Abstract

As part of routine histological grading, for every invasive breast cancer the mitotic count is assessed by counting mitoses in the (visually selected) region with the highest proliferative activity. Because this procedure is prone to subjectivity, the present study compares visual mitotic counting with deep learning based automated mitotic counting and fully automated hotspot selection. Two cohorts were used in this study. Cohort A comprised 90 prospectively included tumors which were selected based on the mitotic frequency scores given during routine glass slide diagnostics. This pathologist additionally assessed the mitotic count in these tumors in whole slide images (WSI) within a preselected hotspot. A second observer performed the same procedures on this cohort. The preselected hotspot was generated by a convolutional neural network (CNN) trained to detect all mitotic figures in digitized hematoxylin and eosin (H&E) sections. The second cohort comprised a multicenter, retrospective TNBC cohort (n = 298), of which the mitotic count was assessed by three independent observers on glass slides. The same CNN was applied on this cohort and the absolute number of mitotic figures in the hotspot was compared to the averaged mitotic count of the observers. Baseline interobserver agreement for glass slide assessment in cohort A was good (kappa 0.689; 95% CI 0.580-0.799). Using the CNN generated hotspot in WSI, the agreement score increased to 0.814 (95% CI 0.719-0.909). Automated counting by the CNN in comparison with observers counting in the predefined hotspot region yielded an average kappa of 0.724. We conclude that manual mitotic counting is not affected by assessment modality (glass slides, WSI) and that counting mitotic figures in WSI is feasible. Using a predefined hotspot area considerably improves reproducibility. Also, fully automated assessment of mitotic score appears to be feasible without introducing additional bias or variability.

Publication types

Multicenter Study
Research Support, Non-U.S. Gov't

MeSH terms

Adult
Aged
Breast Neoplasms / pathology*
Cohort Studies
Deep Learning* / statistics & numerical data
Diagnosis, Computer-Assisted
Female
Humans
Middle Aged
Mitotic Index / methods*
Mitotic Index / statistics & numerical data
Netherlands
Neural Networks, Computer
Observer Variation
Prospective Studies
Reproducibility of Results
Retrospective Studies