Model-based and Model-free Machine Learning Techniques for Diagnostic Prediction and Classification of Clinical Outcomes in Parkinson's Disease

Sci Rep. 2018 May 8;8(1):7129. doi: 10.1038/s41598-018-24783-4.

Abstract

In this study, we apply a multidisciplinary approach to investigate falls in PD patients using clinical, demographic and neuroimaging data from two independent initiatives (University of Michigan and Tel Aviv Sourasky Medical Center). Using machine learning techniques, we construct predictive models to discriminate fallers and non-fallers. Through controlled feature selection, we identified the most salient predictors of patient falls including gait speed, Hoehn and Yahr stage, postural instability and gait difficulty-related measurements. The model-based and model-free analytical methods we employed included logistic regression, random forests, support vector machines, and XGboost. The reliability of the forecasts was assessed by internal statistical (5-fold) cross validation as well as by external out-of-bag validation. Four specific challenges were addressed in the study: Challenge 1, develop a protocol for harmonizing and aggregating complex, multisource, and multi-site Parkinson's disease data; Challenge 2, identify salient predictive features associated with specific clinical traits, e.g., patient falls; Challenge 3, forecast patient falls and evaluate the classification performance; and Challenge 4, predict tremor dominance (TD) vs. posture instability and gait difficulty (PIGD). Our findings suggest that, compared to other approaches, model-free machine learning based techniques provide a more reliable clinical outcome forecasting of falls in Parkinson's patients, for example, with a classification accuracy of about 70-80%.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Accidental Falls / prevention & control*
  • Aged
  • Female
  • Gait / physiology
  • Gait Disorders, Neurologic / diagnosis*
  • Gait Disorders, Neurologic / diagnostic imaging
  • Gait Disorders, Neurologic / physiopathology
  • Humans
  • Logistic Models
  • Machine Learning*
  • Male
  • Middle Aged
  • Models, Theoretical
  • Neuroimaging / methods
  • Parkinson Disease / diagnosis*
  • Parkinson Disease / diagnostic imaging
  • Parkinson Disease / physiopathology
  • Postural Balance / physiology
  • Support Vector Machine