Towards interactive AI-authoring with prototypical few-shot classifiers in histopathology

Petr Kuritcyn; Rosalie Kletzander; Sophia Eisenberg; Thomas Wittenberg; Volker Bruns; Katja Evert; Felix Keil; Paul K Ziegler; Katrin Bankov; Peter Wild; Markus Eckstein; Arndt Hartmann; Carol I Geppert; Michaela Benz

doi:10.1016/j.jpi.2024.100388

Towards interactive AI-authoring with prototypical few-shot classifiers in histopathology

J Pathol Inform. 2024 Jun 6:15:100388. doi: 10.1016/j.jpi.2024.100388. eCollection 2024 Dec.

Authors

Petr Kuritcyn¹, Rosalie Kletzander¹, Sophia Eisenberg¹, Thomas Wittenberg¹, Volker Bruns¹, Katja Evert², Felix Keil², Paul K Ziegler^{3

4}, Katrin Bankov^{4

5}, Peter Wild^{3

4

6}, Markus Eckstein^{7

8}, Arndt Hartmann^{7

8}, Carol I Geppert^{7

8}, Michaela Benz¹

Affiliations

¹ Fraunhofer IIS, Fraunhofer Institute for Integrated Circuits IIS, Medical Image Analysis Group, Erlangen, Germany.
² Institute of Pathology, University of Regensburg, Regensburg, Germany.
³ Frankfurt Cancer Institute (FCI), Goethe University Frankfurt, Frankfurt am Main, Germany.
⁴ Dr. Senckenberg Institute of Pathology, University Hospital Frankfurt, Goethe University Frankfurt, Frankfurt am Main, Germany.
⁵ Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Department of Pediatric Oncology and Hematology, Germany.
⁶ Frankfurt Institute for Advanced Studies (FIAS), Frankfurt am Main, Germany.
⁷ Institute of Pathology, University Hospital Erlangen, FAU Erlangen-Nuremberg, Erlangen, Germany.
⁸ Comprehensive Cancer Center Erlangen-EMN (CCC), University Hospital Erlangen, FAU Erlangen-Nuremberg, Erlangen, Germany.

Abstract

A vast multitude of tasks in histopathology could potentially benefit from the support of artificial intelligence (AI). Many examples have been shown in the literature and first commercial products with FDA or CE-IVDR clearance are available. However, two key challenges remain: (1) a scarcity of thoroughly annotated images, respectively the laboriousness of this task, and (2) the creation of robust models that can cope with the data heterogeneity in the field (domain generalization). In this work, we investigate how the combination of prototypical few-shot classification models and data augmentation can address both of these challenges. Based on annotated data sets that include multiple centers, multiple scanners, and two tumor entities, we examine the robustness and the adaptability of few-shot classifiers in multiple scenarios. We demonstrate that data from one scanner and one site are sufficient to train robust few-shot classification models by applying domain-specific data augmentation. The models achieved classification performance of around 90% on a multiscanner and multicenter database, which is on par with the accuracy achieved on the primary single-center single-scanner data. Various convolutional neural network (CNN) architectures can be used for feature extraction in the few-shot model. A comparison of nine state-of-the-art architectures yielded that EfficientNet B0 provides the best trade-off between accuracy and inference time. The classification of prototypical few-shot models directly relies on class prototypes derived from example images of each class. Therefore, we investigated the influence of prototypes originating from images from different scanners and evaluated their performance also on the multiscanner database. Again, our few-shot model showed a stable performance with an average absolute deviation in accuracy compared to the primary prototypes of 1.8% points. Finally, we examined the adaptability to a new tumor entity: classification of tissue sections containing urothelial carcinoma into normal, tumor, and necrotic regions. Only three annotations per subclass (e.g., muscle and adipose tissue are subclasses of normal tissue) were provided to adapt the few-shot model, which obtained an overall accuracy of 93.6%. These results demonstrate that prototypical few-shot classification is an ideal technology for realizing an interactive AI authoring system as it only requires few annotations and can be adapted to new tasks without involving retraining of the underlying feature extraction CNN, which would in turn require a selection of hyper-parameters based on data science expert knowledge. Similarly, it can be regarded as a guided annotation system. To this end, we realized a workflow and user interface that targets non-technical users.

Keywords: Colon adenocarcinoma; Data augmentation; Digital pathology; Few-shot learning; Prototypical networks; Tissue classification; Urothelial carcinoma.