[Research on algorithms for identifying the severity of acute respiratory distress syndrome patients based on noninvasive parameters]

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2019 Jun 25;36(3):435-443. doi: 10.7507/1001-5515.201801081.
[Article in Chinese]

Abstract

Acute respiratory distress syndrome (ARDS) is a serious threat to human life and health disease, with acute onset and high mortality. The current diagnosis of the disease depends on blood gas analysis results, while calculating the oxygenation index. However, blood gas analysis is an invasive operation, and can't continuously monitor the development of the disease. In response to the above problems, in this study, we proposed a new algorithm for identifying the severity of ARDS disease. Based on a variety of non-invasive physiological parameters of patients, combined with feature selection techniques, this paper sorts the importance of various physiological parameters. The cross-validation technique was used to evaluate the identification performance. The classification results of four supervised learning algorithms using neural network, logistic regression, AdaBoost and Bagging were compared under different feature subsets. The optimal feature subset and classification algorithm are comprehensively selected by the sensitivity, specificity, accuracy and area under curve (AUC) of different algorithms under different feature subsets. We use four supervised learning algorithms to distinguish the severity of ARDS (P/F ≤ 300). The performance of the algorithm is evaluated according to AUC. When AdaBoost uses 20 features, AUC = 0.832 1, the accuracy is 74.82%, and the optimal AUC is obtained. The performance of the algorithm is evaluated according to the number of features. When using 2 features, Bagging has AUC = 0.819 4 and the accuracy is 73.01%. Compared with traditional methods, this method has the advantage of continuously monitoring the development of patients with ARDS and providing medical staff with auxiliary diagnosis suggestions.

急性呼吸窘迫综合征(ARDS)是一种严重威胁人类生命健康的疾病,具有起病急、病死率高等特点。目前这种疾病的主要诊断和疾病严重程度分级标准依赖于血气分析结果,从而计算患者的氧合指数(PaO 2/FiO 2,P/F),但是血气分析是有创操作,且不能连续监测病情的发展。针对以上问题,我们提出了一种新的 ARDS 疾病严重程度的辨识算法。基于患者的多种无创生理参数,结合特征选择技术,对多种生理参数进行重要性排序。利用交叉验证技术评估辨识性能,比较不同特征子集下,使用神经网络、逻辑回归、AdaBoost、Bagging 四种监督学习算法的分类结果。通过不同特征子集下不同算法的敏感性、特异性、准确率、曲线下面积(AUC)来综合选择最优的特征子集和分类算法。我们利用四种监督学习算法,对 ARDS 严重程度进行区分(P/F ≤ 300)。根据 AUC 来评估算法性能,AdaBoost 在使用 20 个特征时,AUC = 0.832 1,准确率为 74.82%,取得了最优的 AUC。根据特征个数来评估算法性能,Bagging 在使用 2 个特征时,AUC = 0.819 4,准确率为 73.01%。该方法相较于传统方法有较大的优势,能够连续监测 ARDS 患者的病情发展,为医务人员提供辅助诊断建议。.

Keywords: acute respiratory distress syndrome; integrated learning; machine learning; non-invasive identification.

MeSH terms

  • Algorithms*
  • Area Under Curve
  • Blood Gas Analysis
  • Humans
  • Machine Learning*
  • Monitoring, Physiologic / methods*
  • ROC Curve
  • Respiratory Distress Syndrome / diagnosis*
  • Sensitivity and Specificity

Grants and funding

国家重点研发计划(2017YFC0806402);国家自然科学基金项目(81501551);军事医学创新工程专项(16CXZ034)