High dimensional classification of structural MRI Alzheimer's disease data based on large scale regularization

Front Neuroinform. 2011 Oct 14:5:22. doi: 10.3389/fninf.2011.00022. eCollection 2011.

Abstract

In this work we use a large scale regularization approach based on penalized logistic regression to automatically classify structural MRI images (sMRI) according to cognitive status. Its performance is illustrated using sMRI data from the Alzheimer Disease Neuroimaging Initiative (ADNI) clinical database. We downloaded sMRI data from 98 subjects (49 cognitive normal and 49 patients) matched by age and sex from the ADNI website. Images were segmented and normalized using SPM8 and ANTS software packages. Classification was performed using GLMNET library implementation of penalized logistic regression based on coordinate-wise descent optimization techniques. To avoid optimistic estimates classification accuracy, sensitivity, and specificity were determined based on a combination of three-way split of the data with nested 10-fold cross-validations. One of the main features of this approach is that classification is performed based on large scale regularization. The methodology presented here was highly accurate, sensitive, and specific when automatically classifying sMRI images of cognitive normal subjects and Alzheimer disease (AD) patients. Higher levels of accuracy, sensitivity, and specificity were achieved for gray matter (GM) volume maps (85.7, 82.9, and 90%, respectively) compared to white matter volume maps (81.1, 80.6, and 82.5%, respectively). We found that GM and white matter tissues carry useful information for discriminating patients from cognitive normal subjects using sMRI brain data. Although we have demonstrated the efficacy of this voxel-wise classification method in discriminating cognitive normal subjects from AD patients, in principle it could be applied to any clinical population.

Keywords: ADNI; GLMNET; curse of dimensionality; elastic net; high dimensional; large scale regularization; logistic regression.