Mitigating Bias in Radiology Machine Learning: 3. Performance Metrics

Radiol Artif Intell. 2022 Aug 24;4(5):e220061. doi: 10.1148/ryai.220061. eCollection 2022 Sep.

Abstract

The increasing use of machine learning (ML) algorithms in clinical settings raises concerns about bias in ML models. Bias can arise at any step of ML creation, including data handling, model development, and performance evaluation. Potential biases in the ML model can be minimized by implementing these steps correctly. This report focuses on performance evaluation and discusses model fitness, as well as a set of performance evaluation toolboxes: namely, performance metrics, performance interpretation maps, and uncertainty quantification. By discussing the strengths and limitations of each toolbox, our report highlights strategies and considerations to mitigate and detect biases during performance evaluations of radiology artificial intelligence models. Keywords: Segmentation, Diagnosis, Convolutional Neural Network (CNN) © RSNA, 2022.

Keywords: Convolutional Neural Network (CNN); Diagnosis; Segmentation.