Advancements in diagnosing oral potentially malignant disorders: leveraging Vision transformers for multi-class detection

Shankeeth Vinayahalingam; Niels van Nistelrooij; René Rothweiler; Alessandro Tel; Tim Verhoeven; Daniel Tröltzsch; Marco Kesting; Stefaan Bergé; Tong Xi; Max Heiland; Tabea Flügge

doi:10.1007/s00784-024-05762-8

Advancements in diagnosing oral potentially malignant disorders: leveraging Vision transformers for multi-class detection

Clin Oral Investig. 2024 Jun 8;28(7):364. doi: 10.1007/s00784-024-05762-8.

Authors

Shankeeth Vinayahalingam^{1

2

3}, Niels van Nistelrooij^{1

4}, René Rothweiler⁵, Alessandro Tel⁶, Tim Verhoeven¹, Daniel Tröltzsch⁴, Marco Kesting⁷, Stefaan Bergé¹, Tong Xi¹, Max Heiland⁴, Tabea Flügge⁸

Affiliations

¹ Department of Oral and Maxillofacial Surgery, Radboud University Medical Centre, Nijmegen, the Netherlands.
² Department of Artificial Intelligence, Radboud University, Nijmegen, the Netherlands.
³ Department of Oral and Maxillofacial Surgery, Universitätsklinikum Münster, Münster, Germany.
⁴ Department of Oral and Maxillofacial Surgery, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt- Universität zu Berlin, Hindenburgdamm 30, 12203, Berlin, Germany.
⁵ Department of Oral and Maxillofacial Surgery, Translational Implantology, Medical Center, Faculty of Medicine, University of Freiburg, University of Freiburg, Freiburg, Germany.
⁶ Clinic of Maxillofacial Surgery, Head&Neck and Neuroscience Department, University Hospital of Udine, Udine, Italy.
⁷ Department of Oral and Cranio-Maxillofacial Surgery, Friedrich-Alexander-University Erlangen- Nuremberg (FAU), Erlangen, Germany.
⁸ Department of Oral and Maxillofacial Surgery, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt- Universität zu Berlin, Hindenburgdamm 30, 12203, Berlin, Germany. [email protected].

Abstract

Objectives: Diagnosing oral potentially malignant disorders (OPMD) is critical to prevent oral cancer. This study aims to automatically detect and classify the most common pre-malignant oral lesions, such as leukoplakia and oral lichen planus (OLP), and distinguish them from oral squamous cell carcinomas (OSCC) and healthy oral mucosa on clinical photographs using vision transformers.

Methods: 4,161 photographs of healthy mucosa, leukoplakia, OLP, and OSCC were included. Findings were annotated pixel-wise and reviewed by three clinicians. The photographs were divided into 3,337 for training and validation and 824 for testing. The training and validation images were further divided into five folds with stratification. A Mask R-CNN with a Swin Transformer was trained five times with cross-validation, and the held-out test split was used to evaluate the model performance. The precision, F1-score, sensitivity, specificity, and accuracy were calculated. The area under the receiver operating characteristics curve (AUC) and the confusion matrix of the most effective model were presented.

Results: The detection of OSCC with the employed model yielded an F1 of 0.852 and AUC of 0.974. The detection of OLP had an F1 of 0.825 and AUC of 0.948. For leukoplakia the F1 was 0.796 and the AUC was 0.938.

Conclusions: OSCC were effectively detected with the employed model, whereas the detection of OLP and leukoplakia was moderately effective.

Clinical relevance: Oral cancer is often detected in advanced stages. The demonstrated technology may support the detection and observation of OPMD to lower the disease burden and identify malignant oral cavity lesions earlier.

Keywords: Artificial Intelligence; Deep learning; Leukoplakia; Malignant transformation; Oral lichen planus; Oral squamous cell carcinoma.

MeSH terms

Carcinoma, Squamous Cell / diagnosis
Diagnosis, Differential
Female
Humans
Image Interpretation, Computer-Assisted / methods
Leukoplakia, Oral* / diagnosis
Lichen Planus, Oral* / diagnosis
Male
Mouth Neoplasms* / diagnosis
Photography
Photography, Dental
Precancerous Conditions* / diagnosis
Sensitivity and Specificity