Ensemble-imbalance-based classification for amyotrophic lateral sclerosis prognostic prediction: identifying short-survival patients at diagnosis

BMC Med Inform Decis Mak. 2024 Mar 19;24(1):80. doi: 10.1186/s12911-024-02484-5.

Abstract

Prognosticating Amyotrophic Lateral Sclerosis (ALS) presents a formidable challenge due to patients exhibiting different onset sites, progression rates, and survival times. In this study, we have developed and evaluated Machine Learning (ML) algorithms that integrate Ensemble and Imbalance Learning techniques to classify patients into Short and Non-Short survival groups based on data collected during diagnosis. We aimed to identify individuals at high risk of mortality within 24 months of symptom onset through analysis of patient data commonly encountered in daily clinical practice. Our Ensemble-Imbalance approach underwent evaluation employing six ML algorithms as base classifiers. Remarkably, our results outperformed those of individual algorithms, achieving a Balanced Accuracy of 88% and a Sensitivity of 96%. Additionally, we used the Shapley Additive Explanations framework to elucidate the decision-making process of the top-performing model, pinpointing the most important features and their correlations with the target prediction. Furthermore, we presented helpful tools to visualize and compare patient similarities, offering valuable insights. Confirming the obtained results, our approach could aid physicians in devising personalized treatment plans at the time of diagnosis or serve as an inclusion/exclusion criterion in clinical trials.

Keywords: Amyotrophic lateral sclerosis; Health informatics; Machine learning; Prognosis.

MeSH terms

  • Amyotrophic Lateral Sclerosis* / diagnosis
  • Amyotrophic Lateral Sclerosis* / drug therapy
  • Humans
  • Machine Learning
  • Prognosis