Exploring Prediabetes Pathways Using Explainable AI on Data from Electronic Medical Records

Stud Health Technol Inform. 2024 Aug 22:316:736-740. doi: 10.3233/SHTI240519.

Abstract

This study leverages data from a Canadian database of primary care Electronic Medical Records to develop machine learning models predicting type 2 diabetes mellitus (T2D), prediabetes, or normoglycemia. These models are used as a basis for extracting counterfactual explanations and derive personalized changes in biomarkers to prevent T2D onset, particularly in the still reversible prediabetic state. The models achieve satisfactory performance. Furthermore, feature importance analysis underscores the significance of fasting blood sugar and glycated hemoglobin, while counterfactuals explanations emphasize the centrality of keeping body mass index and cholesterol indicators within or close to the clinically desirable ranges. This research highlights the potential of machine learning and counterfactual explanations in guiding preventive interventions that may help slow down the progression from prediabetes to T2D on an individual basis, eventually fostering a recovery from prediabetes to a normoglycemic state.

Keywords: Counterfactual Explanations; Explainable AI (XAI); Personalized Prevention; Prediabetes; Type 2 Diabetes.

MeSH terms

  • Biomarkers / blood
  • Canada
  • Diabetes Mellitus, Type 2*
  • Electronic Health Records*
  • Humans
  • Machine Learning*
  • Prediabetic State*

Substances

  • Biomarkers