Machine Learning For Risk Prediction After Heart Failure Emergency Department Visit or Hospital Admission Using Administrative Health Data

Nowell M Fine; Sunil V Kalmady; Weijie Sun; Russ Greiner; Jonathan G Howlett; James A White; Finlay A McAlister; Justin A Ezekowitz; Padma Kaul

doi:10.1371/journal.pdig.0000636

Machine Learning For Risk Prediction After Heart Failure Emergency Department Visit or Hospital Admission Using Administrative Health Data

PLOS Digit Health. 2024 Oct 25;3(10):e0000636. doi: 10.1371/journal.pdig.0000636. eCollection 2024 Oct.

Authors

Nowell M Fine¹, Sunil V Kalmady^{2

3}, Weijie Sun^{2

4}, Russ Greiner^{3

4}, Jonathan G Howlett¹, James A White^{1

5}, Finlay A McAlister^{2

6}, Justin A Ezekowitz^{2

7}, Padma Kaul²

Affiliations

¹ Division of Cardiology, Department of Cardiac Sciences, Libin Cardiovascular Institute, Cumming School of Medicine, University of Calgary, Calgary Alberta.
² Canadian Vigour Center, Katz Group Center for Pharmacy and Health Research, University of Alberta, Edmonton Alberta.
³ Alberta Machine Intelligence Institute, Edmonton Alberta.
⁴ Department of Computing Sciences, University of Alberta.
⁵ Stephenson Cardiac Imaging Center, Calgary Alberta.
⁶ Division of Internal Medicine, University of Alberta, Edmonton Alberta.
⁷ Division of Cardiology, Department of Medicine, University of Alberta, Edmonton Alberta.

Abstract

Aims: Patients visiting the emergency department (ED) or hospitalized for heart failure (HF) are at increased risk for subsequent adverse outcomes, however effective risk stratification remains challenging. We utilized a machine-learning (ML)-based approach to identify HF patients at risk of adverse outcomes after an ED visit or hospitalization using a large regional administrative healthcare data system.

Methods and results: Patients visiting the ED or hospitalized with HF between 2002-2016 in Alberta, Canada were included. Outcomes of interest were 30-day and 1-year HF-related ED visits, HF hospital readmission or all-cause mortality. We applied a feature extraction method using deep feature synthesis from multiple sources of health data and compared performance of a gradient boosting algorithm (CatBoost) with logistic regression modelling. The area under receiver operating characteristic curve (AUC-ROC) was used to assess model performance. We included 50,630 patients with 93,552 HF ED visits/hospitalizations. At 30-day follow-up in the holdout validation cohort, the AUC-ROC for the combined endpoint of HF ED visit, HF hospital readmission or death for the Catboost and logistic regression models was 74.16 (73.18-75.11) versus 62.25 (61.25-63.18), respectively. At 1-year follow-up corresponding values were 76.80 (76.1-77.47) versus 69.52 (68.77-70.26), respectively. AUC-ROC values for the endpoint of all-cause death alone at 30-days and 1-year follow-up were 83.21 (81.83-84.41) versus 69.53 (67.98-71.18), and 85.73 (85.14-86.29) versus 69.40 (68.57-70.26), for the CatBoost and logistic regression models, respectively.

Conclusions: ML-based modelling with deep feature synthesis provided superior risk stratification for HF patients at 30-days and 1-year follow-up after an ED visit or hospitalization using data from a large administrative regional healthcare system.

Copyright: © 2024 Fine et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Grants and funding

This study was funded through the Servier Alberta Innovation Health Fund Award in partnership with the Alberta University Hospital Foundation. The award was granted to N.M.F., J.A.E as principal investigators and R.G., J.G.H., J.A.W., F.A.M., P.K. as co-investigators. S.V.K. and W.S. received partial salary support from this grant. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.