Development and Validation of a Routine Electronic Health Record-Based Delirium Prediction Model for Surgical Patients Without Dementia: Retrospective Case-Control Study

Emma Holler; Christina Ludema; Zina Ben Miled; Molly Rosenberg; Corey Kalbaugh; Malaz Boustani; Sanjay Mohanty

doi:10.2196/59422

Development and Validation of a Routine Electronic Health Record-Based Delirium Prediction Model for Surgical Patients Without Dementia: Retrospective Case-Control Study

JMIR Perioper Med. 2025 Jan 9:8:e59422. doi: 10.2196/59422.

Authors

Emma Holler¹, Christina Ludema², Zina Ben Miled³, Molly Rosenberg², Corey Kalbaugh², Malaz Boustani⁴, Sanjay Mohanty¹

Affiliations

¹ Department of Surgery, Indiana University School of Medicine, Indianapolis, IN, United States.
² Department of Epidemiology & Biostatistics, Indiana University Bloomington, Bloomington, United States.
³ Department of Electrical & Computer Engineering, Lamar University, Beaumont, TX, United States.
⁴ Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, United States.

PMID: 39786865
DOI: 10.2196/59422

Abstract

Background: Postoperative delirium (POD) is a common complication after major surgery and is associated with poor outcomes in older adults. Early identification of patients at high risk of POD can enable targeted prevention efforts. However, existing POD prediction models require inpatient data collected during the hospital stay, which delays predictions and limits scalability.

Objective: This study aimed to develop and externally validate a machine learning-based prediction model for POD using routine electronic health record (EHR) data.

Methods: We identified all surgical encounters from 2014 to 2021 for patients aged 50 years and older who underwent an operation requiring general anesthesia, with a length of stay of at least 1 day at 3 Indiana hospitals. Patients with preexisting dementia or mild cognitive impairment were excluded. POD was identified using Confusion Assessment Method records and delirium International Classification of Diseases (ICD) codes. Controls without delirium or nurse-documented confusion were matched to cases by age, sex, race, and year of admission. We trained logistic regression, random forest, extreme gradient boosting (XGB), and neural network models to predict POD using 143 features derived from routine EHR data available at the time of hospital admission. Separate models were developed for each hospital using surveillance periods of 3 months, 6 months, and 1 year before admission. Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC). Each model was internally validated using holdout data and externally validated using data from the other 2 hospitals. Calibration was assessed using calibration curves.

Results: The study cohort included 7167 delirium cases and 7167 matched controls. XGB outperformed all other classifiers. AUROCs were highest for XGB models trained on 12 months of preadmission data. The best-performing XGB model achieved a mean AUROC of 0.79 (SD 0.01) on the holdout set, which decreased to 0.69-0.74 (SD 0.02) when externally validated on data from other hospitals.

Conclusions: Our routine EHR-based POD prediction models demonstrated good predictive ability using a limited set of preadmission and surgical variables, though their generalizability was limited. The proposed models could be used as a scalable, automated screening tool to identify patients at high risk of POD at the time of hospital admission.

Keywords: algorithm; delirium; electronic health records; machine learning; postoperative; prediction; risk prediction; surgery.

©Emma Holler, Christina Ludema, Zina Ben Miled, Molly Rosenberg, Corey Kalbaugh, Malaz Boustani, Sanjay Mohanty. Originally published in JMIR Perioperative Medicine (http://periop.jmir.org), 09.01.2025.

Publication types

Validation Study

MeSH terms

Aged
Aged, 80 and over
Case-Control Studies
Delirium* / diagnosis
Delirium* / epidemiology
Delirium* / etiology
Dementia / diagnosis
Dementia / epidemiology
Electronic Health Records* / statistics & numerical data
Female
Humans
Indiana / epidemiology
Machine Learning
Male
Middle Aged
Postoperative Complications* / diagnosis
Postoperative Complications* / epidemiology
Postoperative Complications* / etiology
Predictive Value of Tests
Retrospective Studies
Risk Assessment / methods