Machine Learning Models for Cardiovascular Disease Events Prediction

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul:2022:1066-1069. doi: 10.1109/EMBC48229.2022.9871121.

Abstract

Cardiovascular diseases (CVDs) are among the most serious disorders leading to high mortality rates worldwide. CVDs can be diagnosed and prevented early by identifying risk biomarkers using statistical and machine learning (ML) models, In this work, we utilize clinical CVD risk factors and biochemical data using machine learning models such as Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), Naïve Bayes (NB), Extreme Grading Boosting (XGB) and Adaptive Boosting (AdaBoost) to predict death caused by CVD within ten years of follow-up. We used the cohort of the Ludwigshafen Risk and Cardiovascular Health (LURIC) study and 2943 patients were included in the analysis (484 annotated as dead due to CVD). We calculated the Accuracy (ACC), Precision, Recall, F1-Score, Specificity (SPE) and area under the receiver operating characteristic curve (AUC) of each model. The findings of the comparative analysis show that Logistic Regression has been proven to be the most reliable algorithm having accuracy 72.20 %. These results will be used in the TIMELY study to estimate the risk score and mortality of CVD in patients with 10-year risk.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Cardiovascular Diseases* / diagnosis
  • Humans
  • Machine Learning
  • Risk Factors
  • Support Vector Machine