Development and validation of a cardiovascular risk prediction model for Sri Lankans using machine learning

PLoS One. 2024 Oct 22;19(10):e0309843. doi: 10.1371/journal.pone.0309843. eCollection 2024.

Abstract

Introduction and objectives: Sri Lankans do not have a specific cardiovascular (CV) risk prediction model and therefore, World Health Organization(WHO) risk charts developed for the Southeast Asia Region are being used. We aimed to develop a CV risk prediction model specific for Sri Lankans using machine learning (ML) of data of a population-based, randomly selected cohort of Sri Lankans followed up for 10 years and to validate it in an external cohort.

Material and methods: The cohort consisted of 2596 individuals between 40-65 years of age in 2007, who were followed up for 10 years. Of them, 179 developed hard CV diseases (CVD) by 2017. We developed three CV risk prediction models named model 1, 2 and 3 using ML. We compared predictive performances between models and the WHO risk charts using receiver operating characteristic curves (ROC). The most predictive and practical model for use in primary care, model 3 was named "SLCVD score" which used age, sex, smoking status, systolic blood pressure, history of diabetes, and total cholesterol level in the calculation. We developed an online platform to calculate the SLCVD score. Predictions of SLCVD score were validated in an external hospital-based cohort.

Results: Model 1, 2, SLCVD score and the WHO risk charts predicted 173, 162, 169 and 10 of 179 observed events and the area under the ROC (AUC) were 0.98, 0.98, 0.98 and 0.52 respectively. During external validation, the SLCVD score and WHO risk charts predicted 56 and 18 respectively of 119 total events and AUCs were 0.64 and 0.54 respectively.

Conclusions: SLCVD score is the first and only CV risk prediction model specific for Sri Lankans. It predicts the 10-year risk of developing a hard CVD in Sri Lankans. SLCVD score was more effective in predicting Sri Lankans at high CV risk than WHO risk charts.

Publication types

  • Validation Study

MeSH terms

  • Adult
  • Aged
  • Cardiovascular Diseases* / epidemiology
  • Cohort Studies
  • Female
  • Heart Disease Risk Factors
  • Humans
  • Machine Learning*
  • Male
  • Middle Aged
  • ROC Curve
  • Risk Assessment / methods
  • Risk Factors
  • Sri Lanka / epidemiology

Grants and funding

This study was supported by the Strengthening Research Outputs Grant of the University of Kelaniya, Sri Lanka (RC/SROG/2021/01). The funding bodies played no role in the design of the study, collection, analysis, and interpretation of data or in writing the manuscript.