Global and Regional Deep Learning Models for Multiple Sclerosis Stratification From MRI

J Magn Reson Imaging. 2024 Jul;60(1):258-267. doi: 10.1002/jmri.29046. Epub 2023 Oct 6.

Abstract

Background: The combination of anatomical MRI and deep learning-based methods such as convolutional neural networks (CNNs) is a promising strategy to build predictive models of multiple sclerosis (MS) prognosis. However, studies assessing the effect of different input strategies on model's performance are lacking.

Purpose: To compare whole-brain input sampling strategies and regional/specific-tissue strategies, which focus on a priori known relevant areas for disability accrual, to stratify MS patients based on their disability level.

Study type: Retrospective.

Subjects: Three hundred nineteen MS patients (382 brain MRI scans) with clinical assessment of disability level performed within the following 6 months (~70% training/~15% validation/~15% inference in-house dataset) and 440 MS patients from multiple centers (independent external validation cohort).

Field strength/sequence: Single vendor 1.5 T or 3.0 T. Magnetization-Prepared Rapid Gradient-Echo and Fluid-Attenuated Inversion Recovery sequences.

Assessment: A 7-fold patient cross validation strategy was used to train a 3D-CNN to classify patients into two groups, Expanded Disability Status Scale score (EDSS) ≥ 3.0 or EDSS < 3.0. Two strategies were investigated: 1) a global approach, taking the whole brain volume as input and 2) regional approaches using five different regions-of-interest: white matter, gray matter, subcortical gray matter, ventricles, and brainstem structures. The performance of the models was assessed in the in-house and the independent external cohorts.

Statistical tests: Balanced accuracy, sensitivity, specificity, area under receiver operating characteristic (ROC) curve (AUC).

Results: With the in-house dataset, the gray matter regional model showed the highest stratification accuracy (81%), followed by the global approach (79%). In the external dataset, without any further retraining, an accuracy of 72% was achieved for the white matter model and 71% for the global approach.

Data conclusion: The global approach offered the best trade-off between internal performance and external validation to stratify MS patients based on accumulated disability.

Evidence level: 4 TECHNICAL EFFICACY: Stage 2.

Keywords: classification; deep learning; input sampling; multiple sclerosis; structural MRI.

MeSH terms

  • Adult
  • Brain* / diagnostic imaging
  • Deep Learning*
  • Female
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Magnetic Resonance Imaging* / methods
  • Male
  • Middle Aged
  • Multiple Sclerosis* / diagnostic imaging
  • Neural Networks, Computer
  • Prognosis
  • ROC Curve
  • Reproducibility of Results
  • Retrospective Studies