A Comparison of XGBoost, Random Forest, and Nomograph for the Prediction of Disease Severity in Patients With COVID-19 Pneumonia: Implications of Cytokine and Immune Cell Profile

Wandong Hong; Xiaoying Zhou; Shengchun Jin; Yajing Lu; Jingyi Pan; Qingyi Lin; Shaopeng Yang; Tingting Xu; Zarrin Basharat; Maddalena Zippi; Sirio Fiorino; Vladislav Tsukanov; Simon Stock; Alfonso Grottesi; Qin Chen; Jingye Pan

doi:10.3389/fcimb.2022.819267

A Comparison of XGBoost, Random Forest, and Nomograph for the Prediction of Disease Severity in Patients With COVID-19 Pneumonia: Implications of Cytokine and Immune Cell Profile

Front Cell Infect Microbiol. 2022 Apr 12:12:819267. doi: 10.3389/fcimb.2022.819267. eCollection 2022.

Authors

Affiliations

¹ Department of Gastroenterology and Hepatology, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China.
² School of the First Clinical Medical Sciences, Wenzhou Medical University, Wenzhou, China.
³ Jamil-ur-Rahman Center for Genome Research, Dr. Panjwani Centre for Molecular Medicine and Drug Research, International Center for Chemical and Biological Sciences, University of Karachi, Karachi, Pakistan.
⁴ Unit of Gastroenterology and Digestive Endoscopy, Sandro Pertini Hospital, Rome, Italy.
⁵ Internal Medicine Unit, Budrio Hospital, Bologna, Italy.
⁶ Department of Gastroenterology, Scientific Research Institute of Medical Problems of the North, Krasnoyarsk, Russia.
⁷ Department of Surgery, World Mate Emergency Hospital, Battambang, Cambodia.
⁸ Unit of General Surgery, Sandro Pertini Hospital, Rome, Italy.
⁹ Department of Intensive Care Unit, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China.

Abstract

Background and aims: The aim of this study was to apply machine learning models and a nomogram to differentiate critically ill from non-critically ill COVID-19 pneumonia patients.

Methods: Clinical symptoms and signs, laboratory parameters, cytokine profile, and immune cellular data of 63 COVID-19 pneumonia patients were retrospectively reviewed. Outcomes were followed up until Mar 12, 2020. A logistic regression function (LR model), Random Forest, and XGBoost models were developed. The performance of these models was measured by area under receiver operating characteristic curve (AUC) analysis.

Results: Univariate analysis revealed that there was a difference between critically and non-critically ill patients with respect to levels of interleukin-6, interleukin-10, T cells, CD4⁺ T, and CD8⁺ T cells. Interleukin-10 with an AUC of 0.86 was most useful predictor of critically ill patients with COVID-19 pneumonia. Ten variables (respiratory rate, neutrophil counts, aspartate transaminase, albumin, serum procalcitonin, D-dimer and B-type natriuretic peptide, CD4⁺ T cells, interleukin-6 and interleukin-10) were used as candidate predictors for LR model, Random Forest (RF) and XGBoost model application. The coefficients from LR model were utilized to build a nomogram. RF and XGBoost methods suggested that Interleukin-10 and interleukin-6 were the most important variables for severity of illness prediction. The mean AUC for LR, RF, and XGBoost model were 0.91, 0.89, and 0.93 respectively (in two-fold cross-validation). Individualized prediction by XGBoost model was explained by local interpretable model-agnostic explanations (LIME) plot.

Conclusions: XGBoost exhibited the highest discriminatory performance for prediction of critically ill patients with COVID-19 pneumonia. It is inferred that the nomogram and visualized interpretation with LIME plot could be useful in the clinical setting. Additionally, interleukin-10 could serve as a useful predictor of critically ill patients with COVID-19 pneumonia.

Keywords: COVID-19; critically ill; infection; machine learning; pneumonia; predictor; severity.

Publication types

Comparative Study

MeSH terms

CD8-Positive T-Lymphocytes
COVID-19* / diagnosis
Critical Illness
Cytokines
Humans
Interleukin-10*
Interleukin-6
Nomograms
Patient Acuity
Retrospective Studies
Severity of Illness Index

Substances

Cytokines
Interleukin-6
Interleukin-10