Progression from Prediabetes to Diabetes in a Diverse U.S. Population: A Machine Learning Model

Diabetes Technol Ther. 2024 Oct;26(10):748-753. doi: 10.1089/dia.2024.0052. Epub 2024 Apr 24.

Abstract

Objective: To date, there are no widely implemented machine learning (ML) models that predict progression from prediabetes to diabetes. Addressing this knowledge gap would aid in identifying at-risk patients within this heterogeneous population who may benefit from targeted treatment and management in order to preserve glucose metabolism and prevent adverse outcomes. The objective of this study was to utilize readily available laboratory data to train and test the performance of ML-based predictive risk models for progression from prediabetes to diabetes. Methods: The study population was composed of laboratory information services data procured from a large U.S. outpatient laboratory network. The retrospective dataset was composed of 15,029 adults over a 5-year period with initial hemoglobin A1C (A1C) values between 5.0% and 6.4%. ML models were developed using random forest survival methods. The ground truth outcome was progression to A1C values indicative of diabetes (i.e., ≥6.5%) within 5 years. Results: The prediabetes risk classifier model accurately predicted A1C ≥6.5% within 5 years and achieved an area under the receiver-operator characteristic curve of 0.87. The most important predictors of progression from prediabetes to diabetes were initial A1C, initial serum glucose, A1C slope, serum glucose slope, initial HDL, HDL slope, age, and sex. Conclusions: Leveraging readily obtainable laboratory data, our ML risk classifier accurately predicts elevation in A1C associated with progression from prediabetes to diabetes. Although prospective studies are warranted, the results support the clinical utility of the model to improve timely recognition, risk stratification, and optimal management for patients with prediabetes.

Keywords: diabetes; hemoglobin A1C; machine learning; prediabetes; predictive modeling.

MeSH terms

  • Adult
  • Aged
  • Blood Glucose / analysis
  • Diabetes Mellitus / blood
  • Diabetes Mellitus / epidemiology
  • Disease Progression*
  • Female
  • Glycated Hemoglobin* / analysis
  • Humans
  • Machine Learning*
  • Male
  • Middle Aged
  • Prediabetic State* / blood
  • Retrospective Studies
  • Risk Factors
  • United States / epidemiology

Substances

  • Glycated Hemoglobin
  • hemoglobin A1c protein, human
  • Blood Glucose