Random survival forest model for early prediction of Alzheimer's disease conversion in early and late Mild cognitive impairment stages

PLoS One. 2024 Dec 13;19(12):e0314725. doi: 10.1371/journal.pone.0314725. eCollection 2024.

Abstract

With a clinical trial failure rate of 99.6% for Alzheimer's Disease (AD), early diagnosis is critical. Machine learning (ML) models have shown promising results in early AD prediction, with survival ML models outperforming typical classifiers by providing probabilities of disease progression over time. This study utilized various ML survival models to predict the time-to-conversion to AD for early (eMCI) and late (lMCI) Mild Cognitive Impairment stages, considering their different progression rates. ADNI data, consisting of 291 eMCI and 546 lMCI cases, was preprocessed to handle missing values and data imbalance. The models used included Random Survival Forest (RSF), Extra Survival Trees (XST), Gradient Boosting (GB), Survival Tree (ST), Cox-net, and Cox Proportional Hazard (CoxPH). We evaluated cognitive, cerebrospinal fluid (CSF) biomarkers, and neuroimaging modalities, both individually and combined, to identify the most influential features. Our results indicate that RSF outperformed traditional CoxPH and other ML models. For eMCI, RSF trained on multimodal data achieved a C-Index of 0.90 and an IBS of 0.10. For lMCI, the C-Index was 0.82 and the IBS was 0.16. Cognitive tests showed a statistically significant improvement over other modalities, underscoring their reliability in early prediction. Furthermore, RSF-generated individual survival curves from baseline data facilitate clinical decision-making, aiding clinicians in developing personalized treatment plans and implementing preventive measures to slow or prevent AD progression in prodromal stages.

MeSH terms

  • Aged
  • Aged, 80 and over
  • Alzheimer Disease* / diagnosis
  • Biomarkers
  • Cognitive Dysfunction* / diagnosis
  • Disease Progression*
  • Early Diagnosis
  • Female
  • Humans
  • Machine Learning*
  • Male
  • Neuroimaging / methods
  • Proportional Hazards Models

Substances

  • Biomarkers

Grants and funding

This work was funded by the Higher Education Commission (HEC) of Pakistan under grant number 16052/NRPU/R&D/HEC/2021-2020. The funders had no role in study design, data collection and analysis, the decision to publish, or the preparation of the manuscript.