Guideline-informed reinforcement learning for mechanical ventilation in critical care

Floris den Hengst; Martijn Otten; Paul Elbers; Frank van Harmelen; Vincent François-Lavet; Mark Hoogendoorn

doi:10.1016/j.artmed.2023.102742

Guideline-informed reinforcement learning for mechanical ventilation in critical care

Artif Intell Med. 2024 Jan:147:102742. doi: 10.1016/j.artmed.2023.102742. Epub 2023 Dec 1.

Authors

Floris den Hengst¹, Martijn Otten², Paul Elbers², Frank van Harmelen³, Vincent François-Lavet³, Mark Hoogendoorn³

Affiliations

¹ Department of Computer Science, Vrije Universiteit Amsterdam, De Boelelaan 1111, Amsterdam, 1081 HV, The Netherlands. Electronic address: [email protected].
² Department of Intensive Care Medicine, Center for Critical Care Computational Intelligence (C4I), Amsterdam Medical Data Science (AMDS), Amsterdam Cardiovascular Sciences (ACS), Amsterdam Institute for Infection and Immunity (AII), Amsterdam Public Health (APH), Amsterdam UMC, Vrije Universiteit, De Boelelaan 1117, Amsterdam, 1081 HV, The Netherlands.
³ Department of Computer Science, Vrije Universiteit Amsterdam, De Boelelaan 1111, Amsterdam, 1081 HV, The Netherlands.

PMID: 38184349
DOI: 10.1016/j.artmed.2023.102742

Abstract

Reinforcement Learning (RL) has recently found many applications in the healthcare domain thanks to its natural fit to clinical decision-making and ability to learn optimal decisions from observational data. A key challenge in adopting RL-based solution in clinical practice, however, is the inclusion of existing knowledge in learning a suitable solution. Existing knowledge from e.g. medical guidelines may improve the safety of solutions, produce a better balance between short- and long-term outcomes for patients and increase trust and adoption by clinicians. We present a framework for including knowledge available from medical guidelines in RL. The framework includes components for enforcing safety constraints and an approach that alters the learning signal to better balance short- and long-term outcomes based on these guidelines. We evaluate the framework by extending an existing RL-based mechanical ventilation (MV) approach with clinically established ventilation guidelines. Results from off-policy policy evaluation indicate that our approach has the potential to decrease 90-day mortality while ensuring lung protective ventilation. This framework provides an important stepping stone towards implementations of RL in clinical practice and opens up several avenues for further research.

Keywords: Clinical guidelines; Critical care; Imitation learning; Mechanical ventilation; Q-learning; Reinforcement learning.

MeSH terms

Clinical Decision-Making
Critical Care
Humans
Learning*
Reinforcement, Psychology
Respiration, Artificial*