AI as an intervention: improving clinical outcomes relies on a causal approach to AI development and validation

Shalmali Joshi; Iñigo Urteaga; Wouter A C van Amsterdam; George Hripcsak; Pierre Elias; Benjamin Recht; Noémie Elhadad; James Fackler; Mark P Sendak; Jenna Wiens; Kaivalya Deshpande; Yoav Wald; Madalina Fiterau; Zachary Lipton; Daniel Malinsky; Madhur Nayan; Hongseok Namkoong; Soojin Park; Julia E Vogt; Rajesh Ranganath

doi:10.1093/jamia/ocae301

AI as an intervention: improving clinical outcomes relies on a causal approach to AI development and validation

J Am Med Inform Assoc. 2025 Jan 7:ocae301. doi: 10.1093/jamia/ocae301. Online ahead of print.

Authors

Shalmali Joshi¹, Iñigo Urteaga^{2

3}, Wouter A C van Amsterdam⁴, George Hripcsak¹, Pierre Elias^{1

5}, Benjamin Recht⁶, Noémie Elhadad¹, James Fackler⁷, Mark P Sendak⁸, Jenna Wiens⁹, Kaivalya Deshpande¹⁰, Yoav Wald¹¹, Madalina Fiterau¹², Zachary Lipton¹³, Daniel Malinsky¹⁴, Madhur Nayan¹⁵, Hongseok Namkoong¹⁶, Soojin Park¹, Julia E Vogt¹⁷, Rajesh Ranganath^{11

18}

Affiliations

¹ Department of Biomedical Informatics, Columbia University, New York, NY 10032, United States.
² BCAM-Basque Center for Applied Mathematics, Bilbao 48009, Spain.
³ IKERBASQUE-Basque Foundation for Science, Bilbao 48009, Spain.
⁴ Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht 3584 CX, The Netherlands.
⁵ Division of Cardiology, Columbia University, New York, NY 10032, United States.
⁶ Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA 94720, United States.
⁷ Department of Anesthesiology and Critical Care Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21287, United States.
⁸ Population Health and Data Science, Duke Institute of Health Innovation, Durham, NC 27701, United States.
⁹ Department of Computer Science and Engineering, University of Michigan, Ann Arbor, Ann Arbor, MI 48109, United States.
¹⁰ Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, United States.
¹¹ Center for Data Science, New York University, New York, NY 10011, United States.
¹² College of Information and Computer Sciences, University of Massachusetts, Amherst, Amherst, MA 01003, United States.
¹³ Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA 15213, United States.
¹⁴ Department of Biostatistics, Columbia University, New York, NY 10032, United States.
¹⁵ Department of Population Health and Urology, NYU Grossman School of Medicine, New York, NY 10016, United States.
¹⁶ Division of Decisions, Risk, and Operations, Columbia Business School, New York, NY 10027, United States.
¹⁷ Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland.
¹⁸ Department of Computer Science, New York University, New York, NY 10012, United States.

PMID: 39775871
DOI: 10.1093/jamia/ocae301

Abstract

The primary practice of healthcare artificial intelligence (AI) starts with model development, often using state-of-the-art AI, retrospectively evaluated using metrics lifted from the AI literature like AUROC and DICE score. However, good performance on these metrics may not translate to improved clinical outcomes. Instead, we argue for a better development pipeline constructed by working backward from the end goal of positively impacting clinically relevant outcomes using AI, leading to considerations of causality in model development and validation, and subsequently a better development pipeline. Healthcare AI should be "actionable," and the change in actions induced by AI should improve outcomes. Quantifying the effect of changes in actions on outcomes is causal inference. The development, evaluation, and validation of healthcare AI should therefore account for the causal effect of intervening with the AI on clinically relevant outcomes. Using a causal lens, we make recommendations for key stakeholders at various stages of the healthcare AI pipeline. Our recommendations aim to increase the positive impact of AI on clinical outcomes.

Keywords: artificial intelligence; causal inference; healthcare.

Grants and funding

Machine Learning for Healthcare