Practical guide to SHAP analysis: Explaining supervised machine learning model predictions in drug development

Ana Victoria Ponce-Bobadilla; Vanessa Schmitt; Corinna S Maier; Sven Mensing; Sven Stodtmann

doi:10.1111/cts.70056

Practical guide to SHAP analysis: Explaining supervised machine learning model predictions in drug development

Clin Transl Sci. 2024 Nov;17(11):e70056. doi: 10.1111/cts.70056.

Authors

Ana Victoria Ponce-Bobadilla¹, Vanessa Schmitt¹, Corinna S Maier¹, Sven Mensing¹, Sven Stodtmann¹

Affiliation

¹ AbbVie Deutschland GmbH & Co. KG, Ludwigshafen, Germany.

PMID: 39463176
DOI: 10.1111/cts.70056

Abstract

Despite increasing interest in using Artificial Intelligence (AI) and Machine Learning (ML) models for drug development, effectively interpreting their predictions remains a challenge, which limits their impact on clinical decisions. We address this issue by providing a practical guide to SHapley Additive exPlanations (SHAP), a popular feature-based interpretability method, which can be seamlessly integrated into supervised ML models to gain a deeper understanding of their predictions, thereby enhancing their transparency and trustworthiness. This tutorial focuses on the application of SHAP analysis to standard ML black-box models for regression and classification problems. We provide an overview of various visualization plots and their interpretation, available software for implementing SHAP, and highlight best practices, as well as special considerations, when dealing with binary endpoints and time-series models. To enhance the reader's understanding for the method, we also apply it to inherently explainable regression models. Finally, we discuss the limitations and ongoing advancements aimed at tackling the current drawbacks of the method.

Publication types

Review

MeSH terms

Artificial Intelligence
Drug Development* / methods
Humans
Maschinelles Lernen
Software
Supervised Machine Learning*