A systematic comparison of deep learning methods for Gleason grading and scoring

Med Image Anal. 2024 Jul:95:103191. doi: 10.1016/j.media.2024.103191. Epub 2024 May 4.

Abstract

Prostate cancer is the second most frequent cancer in men worldwide after lung cancer. Its diagnosis is based on the identification of the Gleason score that evaluates the abnormality of cells in glands through the analysis of the different Gleason patterns within tissue samples. The recent advancements in computational pathology, a domain aiming at developing algorithms to automatically analyze digitized histopathology images, lead to a large variety and availability of datasets and algorithms for Gleason grading and scoring. However, there is no clear consensus on which methods are best suited for each problem in relation to the characteristics of data and labels. This paper provides a systematic comparison on nine datasets with state-of-the-art training approaches for deep neural networks (including fully-supervised learning, weakly-supervised learning, semi-supervised learning, Additive-MIL, Attention-Based MIL, Dual-Stream MIL, TransMIL and CLAM) applied to Gleason grading and scoring tasks. The nine datasets are collected from pathology institutes and openly accessible repositories. The results show that the best methods for Gleason grading and Gleason scoring tasks are fully supervised learning and CLAM, respectively, guiding researchers to the best practice to adopt depending on the task to solve and the labels that are available.

Keywords: Computational pathology; Deep learning; Full supervision; Multiple-instance learning; Prostate cancer; Semi-supervision; Weak supervision.

Publication types

  • Comparative Study

MeSH terms

  • Algorithms
  • Deep Learning*
  • Humans
  • Image Interpretation, Computer-Assisted / methods
  • Male
  • Neoplasm Grading*
  • Prostatic Neoplasms* / diagnostic imaging
  • Prostatic Neoplasms* / pathology