Ensemble of random forests One vs. Rest classifiers for MCI and AD prediction using ANOVA cortical and subcortical feature selection and partial least squares

J Neurosci Methods. 2018 May 15:302:47-57. doi: 10.1016/j.jneumeth.2017.12.005. Epub 2017 Dec 11.

Abstract

Background: Alzheimer's disease (AD) is the most common cause of dementia in the elderly and affects approximately 30 million individuals worldwide. Mild cognitive impairment (MCI) is very frequently a prodromal phase of AD, and existing studies have suggested that people with MCI tend to progress to AD at a rate of about 10-15% per year. However, the ability of clinicians and machine learning systems to predict AD based on MRI biomarkers at an early stage is still a challenging problem that can have a great impact in improving treatments.

Method: The proposed system, developed by the SiPBA-UGR team for this challenge, is based on feature standardization, ANOVA feature selection, partial least squares feature dimension reduction and an ensemble of One vs. Rest random forest classifiers. With the aim of improving its performance when discriminating healthy controls (HC) from MCI, a second binary classification level was introduced that reconsiders the HC and MCI predictions of the first level.

Results: The system was trained and evaluated on an ADNI datasets that consist of T1-weighted MRI morphological measurements from HC, stable MCI, converter MCI and AD subjects. The proposed system yields a 56.25% classification score on the test subset which consists of 160 real subjects.

Comparison with existing method(s): The classifier yielded the best performance when compared to: (i) One vs. One (OvO), One vs. Rest (OvR) and error correcting output codes (ECOC) as strategies for reducing the multiclass classification task to multiple binary classification problems, (ii) support vector machines, gradient boosting classifier and random forest as base binary classifiers, and (iii) bagging ensemble learning.

Conclusions: A robust method has been proposed for the international challenge on MCI prediction based on MRI data. The system yielded the second best performance during the competition with an accuracy rate of 56.25% when evaluated on the real subjects of the test set.

Keywords: ANOVA feature selection; Alzheimer's disease; Bagging; Computer-aided diagnosis; Machine learning; Magnetic resonance imaging; Mild cognitive impairment; One vs. Rest classification; Partial least squares; Random forests.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aged
  • Alzheimer Disease / classification*
  • Alzheimer Disease / diagnostic imaging*
  • Alzheimer Disease / pathology
  • Analysis of Variance
  • Brain / diagnostic imaging*
  • Brain / pathology
  • Cognitive Dysfunction / classification*
  • Cognitive Dysfunction / diagnostic imaging*
  • Cognitive Dysfunction / pathology
  • Databases, Factual
  • Decision Trees
  • Disease Progression
  • Female
  • Humans
  • Image Interpretation, Computer-Assisted / methods
  • Least-Squares Analysis
  • Machine Learning*
  • Magnetic Resonance Imaging*
  • Male
  • Pattern Recognition, Automated