A Machine Learning Prediction Model of Respiratory Failure Within 48 Hours of Patient Admission for COVID-19: Model Development and Validation

J Med Internet Res. 2021 Feb 10;23(2):e24246. doi: 10.2196/24246.

Abstract

Background: Predicting early respiratory failure due to COVID-19 can help triage patients to higher levels of care, allocate scarce resources, and reduce morbidity and mortality by appropriately monitoring and treating the patients at greatest risk for deterioration. Given the complexity of COVID-19, machine learning approaches may support clinical decision making for patients with this disease.

Objective: Our objective is to derive a machine learning model that predicts respiratory failure within 48 hours of admission based on data from the emergency department.

Methods: Data were collected from patients with COVID-19 who were admitted to Northwell Health acute care hospitals and were discharged, died, or spent a minimum of 48 hours in the hospital between March 1 and May 11, 2020. Of 11,525 patients, 933 (8.1%) were placed on invasive mechanical ventilation within 48 hours of admission. Variables used by the models included clinical and laboratory data commonly collected in the emergency department. We trained and validated three predictive models (two based on XGBoost and one that used logistic regression) using cross-hospital validation. We compared model performance among all three models as well as an established early warning score (Modified Early Warning Score) using receiver operating characteristic curves, precision-recall curves, and other metrics.

Results: The XGBoost model had the highest mean accuracy (0.919; area under the curve=0.77), outperforming the other two models as well as the Modified Early Warning Score. Important predictor variables included the type of oxygen delivery used in the emergency department, patient age, Emergency Severity Index level, respiratory rate, serum lactate, and demographic characteristics.

Conclusions: The XGBoost model had high predictive accuracy, outperforming other early warning scores. The clinical plausibility and predictive ability of XGBoost suggest that the model could be used to predict 48-hour respiratory failure in admitted patients with COVID-19.

Keywords: COVID-19; artificial intelligence; development; machine learning; model; modeling; pandemic; prognostic; severe acute respiratory syndrome coronavirus 2; validation.

Publication types

  • Observational Study
  • Validation Study

MeSH terms

  • Aged
  • COVID-19 / complications
  • COVID-19 / physiopathology*
  • Clinical Decision Rules
  • Early Warning Score
  • Emergency Service, Hospital
  • Female
  • Hospitalization*
  • Hospitals
  • Humans
  • Intubation, Intratracheal / statistics & numerical data*
  • Logistic Models
  • Machine Learning*
  • Male
  • Middle Aged
  • Patient Admission
  • ROC Curve
  • Respiration, Artificial / statistics & numerical data*
  • Respiratory Insufficiency / epidemiology*
  • Respiratory Insufficiency / etiology
  • Retrospective Studies
  • SARS-CoV-2
  • Triage