Towards safe and reliable deep learning for lung nodule malignancy estimation using out-of-distribution detection

Dré Peeters; Kiran V Venkadesh; Renate Dinnessen; Zaigham Saghir; Ernst T Scholten; Rozemarijn Vliegenthart; Mathias Prokop; Colin Jacobs

doi:10.1016/j.compbiomed.2024.109633

Towards safe and reliable deep learning for lung nodule malignancy estimation using out-of-distribution detection

Comput Biol Med. 2024 Dec 29:186:109633. doi: 10.1016/j.compbiomed.2024.109633. Online ahead of print.

Authors

Dré Peeters¹, Kiran V Venkadesh², Renate Dinnessen², Zaigham Saghir³, Ernst T Scholten², Rozemarijn Vliegenthart⁴, Mathias Prokop⁵, Colin Jacobs²

Affiliations

¹ Diagnostic Imaging Analysis Group, Medical Imaging Department, Radboud University Medical Center, Geert Grooteplein Zuid 10, 6525 GA, Nijmegen, the Netherlands. Electronic address: [email protected].
² Diagnostic Imaging Analysis Group, Medical Imaging Department, Radboud University Medical Center, Geert Grooteplein Zuid 10, 6525 GA, Nijmegen, the Netherlands.
³ Department of Medicine, Section of Pulmonary Medicine, Herlev-Gentofte Hospital, Hellerup, Denmark; Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark.
⁴ Department of Radiology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9700RB, Groningen, the Netherlands.
⁵ Diagnostic Imaging Analysis Group, Medical Imaging Department, Radboud University Medical Center, Geert Grooteplein Zuid 10, 6525 GA, Nijmegen, the Netherlands; Department of Radiology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9700RB, Groningen, the Netherlands.

PMID: 39736253
DOI: 10.1016/j.compbiomed.2024.109633

Abstract

Artificial Intelligence (AI) models may fail or suffer from reduced performance when applied to unseen data that differs from the training data distribution, referred to as dataset shift. Automatic detection of out-of-distribution (OOD) data contributes to safe and reliable clinical implementation of AI models. In this study, we propose a recognized OOD detection method that utilizes the Mahalanobis distance (MD) and compare its performance to widely known classical methods. The MD measures the similarity between features of an unseen sample and the distribution of development samples features of intermediate model layers. We integrate our proposed method in an existing deep learning (DL) model for lung nodule malignancy risk estimation on chest CT and validate it across four dataset shifts known to reduce AI model performance. The results show that our proposed method outperforms the classical methods and can effectively detect near- and far-OOD samples across all datasets with different data distribution shifts. Additionally, we demonstrate that our proposed method can seamlessly incorporate additional In-distribution (ID) data while maintaining the ability to accurately differentiate between the remaining OOD cases. Lastly, we searched for the optimal OOD threshold in the OOD dataset where the performance of the DL model stays reliable, however no decline in DL performance was revealed as the OOD score increased.

Keywords: Chest CT; Deep learning; Lung nodule risk estimation; Out-of-distribution detection.