The role of explainability in creating trustworthy artificial intelligence for health care: A comprehensive survey of the terminology, design choices, and evaluation strategies

Aniek F Markus; Jan A Kors; Peter R Rijnbeek

doi:10.1016/j.jbi.2020.103655

The role of explainability in creating trustworthy artificial intelligence for health care: A comprehensive survey of the terminology, design choices, and evaluation strategies

J Biomed Inform. 2021 Jan:113:103655. doi: 10.1016/j.jbi.2020.103655. Epub 2020 Dec 10.

Authors

Aniek F Markus¹, Jan A Kors², Peter R Rijnbeek²

Affiliations

¹ Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands. Electronic address: [email protected].
² Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands.

PMID: 33309898
DOI: 10.1016/j.jbi.2020.103655

Abstract

Artificial intelligence (AI) has huge potential to improve the health and well-being of people, but adoption in clinical practice is still limited. Lack of transparency is identified as one of the main barriers to implementation, as clinicians should be confident the AI system can be trusted. Explainable AI has the potential to overcome this issue and can be a step towards trustworthy AI. In this paper we review the recent literature to provide guidance to researchers and practitioners on the design of explainable AI systems for the health-care domain and contribute to formalization of the field of explainable AI. We argue the reason to demand explainability determines what should be explained as this determines the relative importance of the properties of explainability (i.e. interpretability and fidelity). Based on this, we propose a framework to guide the choice between classes of explainable AI methods (explainable modelling versus post-hoc explanation; model-based, attribution-based, or example-based explanations; global and local explanations). Furthermore, we find that quantitative evaluation metrics, which are important for objective standardized evaluation, are still lacking for some properties (e.g. clarity) and types of explanations (e.g. example-based methods). We conclude that explainable modelling can contribute to trustworthy AI, but the benefits of explainability still need to be proven in practice and complementary measures might be needed to create trustworthy AI in health care (e.g. reporting data quality, performing extensive (external) validation, and regulation).

Keywords: Explainable artificial intelligence; Explainable modelling; Interpretability; Post-hoc explanation; Trustworthy artificial intelligence.

Publication types

Research Support, Non-U.S. Gov't
Review

MeSH terms

Artificial Intelligence*
Delivery of Health Care*
Humans