Detecting failure modes in image reconstructions with interval neural network uncertainty

Luis Oala; Cosmas Heiß; Jan Macdonald; Maximilian März; Gitta Kutyniok; Wojciech Samek

doi:10.1007/s11548-021-02482-2

Detecting failure modes in image reconstructions with interval neural network uncertainty

Int J Comput Assist Radiol Surg. 2021 Dec;16(12):2089-2097. doi: 10.1007/s11548-021-02482-2. Epub 2021 Sep 4.

Authors

Luis Oala^#¹, Cosmas Heiß^#², Jan Macdonald^#², Maximilian März^#³, Gitta Kutyniok⁴, Wojciech Samek³

Affiliations

¹ Department of Artificial Intelligence, Fraunhofer HHI, Berlin, Germany. [email protected].
² Institut für Mathematik, Technische Universität Berlin, Berlin, Germany.
³ Department of Artificial Intelligence, Fraunhofer HHI, Berlin, Germany.
⁴ Mathematisches Institut, Ludwig-Maximilians-Universität München, Munich, Germany.

^# Contributed equally.

Abstract

Purpose: The quantitative detection of failure modes is important for making deep neural networks reliable and usable at scale. We consider three examples for common failure modes in image reconstruction and demonstrate the potential of uncertainty quantification as a fine-grained alarm system.

Methods: We propose a deterministic, modular and lightweight approach called Interval Neural Network (INN) that produces fast and easy to interpret uncertainty scores for deep neural networks. Importantly, INNs can be constructed post hoc for already trained prediction networks. We compare it against state-of-the-art baseline methods (MCDROP, PROBOUT).

Results: We demonstrate on controlled, synthetic inverse problems the capacity of INNs to capture uncertainty due to noise as well as directional error information. On a real-world inverse problem with human CT scans, we can show that INNs produce uncertainty scores which improve the detection of all considered failure modes compared to the baseline methods.

Conclusion: Interval Neural Networks offer a promising tool to expose weaknesses of deep image reconstruction models and ultimately make them more reliable. The fact that they can be applied post hoc to equip already trained deep neural network models with uncertainty scores makes them particularly interesting for deployment.

Keywords: Deep learning; Failure modes; Image reconstruction; Uncertainty quantification.

MeSH terms

Humans
Image Processing, Computer-Assisted*
Neural Networks, Computer*
Tomography, X-Ray Computed
Uncertainty

Abstract

MeSH terms

Grants and funding