Identification of delirium from real-world electronic health record clinical notes

Jennifer St Sauver; Sunyang Fu; Sunghwan Sohn; Susan Weston; Chun Fan; Janet Olson; Bjoerg Thorsteinsdottir; Nathan LeBrasseur; Sandeep Pagali; Walter Rocca; Hongfang Liu

doi:10.1017/cts.2023.610

Identification of delirium from real-world electronic health record clinical notes

J Clin Transl Sci. 2023 Aug 24;7(1):e187. doi: 10.1017/cts.2023.610. eCollection 2023.

Authors

Jennifer St Sauver^{1

2}, Sunyang Fu³, Sunghwan Sohn³, Susan Weston³, Chun Fan⁴, Janet Olson¹, Bjoerg Thorsteinsdottir⁵, Nathan LeBrasseur^{6

7}, Sandeep Pagali⁵, Walter Rocca^{1

8

9}, Hongfang Liu^{1

3}

Affiliations

¹ Division of Epidemiology, Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA.
² The Robert D. and Patricia E. Kern Center for the Science of Health Care Delivery, Mayo Clinic, Rochester, MN, USA.
³ Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, USA.
⁴ Division of Clinical Trials and Biostatistics, Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA.
⁵ Department of Medicine, Mayo Clinic, Rochester, MN, USA.
⁶ Robert and Arlene Kogod Center on Aging, Mayo Clinic, Rochester, MN, USA.
⁷ Department of Physical Medicine and Rehabilitation, Mayo Clinic, Rochester, MN, USA.
⁸ Department of Neurology, Mayo Clinic, Rochester, MN, USA.
⁹ Women's Health Research Center, Mayo Clinic, Rochester, MN, USA.

Abstract

Introduction: We tested the ability of our natural language processing (NLP) algorithm to identify delirium episodes in a large-scale study using real-world clinical notes.

Methods: We used the Rochester Epidemiology Project to identify persons ≥ 65 years who were hospitalized between 2011 and 2017. We identified all persons with an International Classification of Diseases code for delirium within ±14 days of a hospitalization. We independently applied our NLP algorithm to all clinical notes for this same population. We calculated rates using number of delirium episodes as the numerator and number of hospitalizations as the denominator. Rates were estimated overall, by demographic characteristics, and by year of episode, and differences were tested using Poisson regression.

Results: In total, 14,255 persons had 37,554 hospitalizations between 2011 and 2017. The code-based delirium rate was 3.02 per 100 hospitalizations (95% CI: 2.85, 3.20). The NLP-based rate was 7.36 per 100 (95% CI: 7.09, 7.64). Rates increased with age (both p < 0.0001). Code-based rates were higher in men compared to women (p = 0.03), but NLP-based rates were similar by sex (p = 0.89). Code-based rates were similar by race and ethnicity, but NLP-based rates were higher in the White population compared to the Black and Asian populations (p = 0.001). Both types of rates increased significantly over time (both p values < 0.001).

Conclusions: The NLP algorithm identified more delirium episodes compared to the ICD code method. However, NLP may still underestimate delirium cases because of limitations in real-world clinical notes, including incomplete documentation, practice changes over time, and missing clinical notes in some time periods.

Keywords: Delirium; International Classification of Diseases (ICD); bioinformatics; electronic health records; natural language processing algorithm.

Grants and funding

R33 AG058738/AG/NIA NIH HHS/United States