Detecting outlying subjects in high-dimensional neuroimaging datasets with regularized minimum covariance determinant

Med Image Comput Comput Assist Interv. 2011;14(Pt 3):264-71. doi: 10.1007/978-3-642-23626-6_33.

Abstract

Medical imaging datasets used in clinical studies or basic research often comprise highly variable multi-subject data. Statistically-controlled inclusion of a subject in a group study, i.e. deciding whether its images should be considered as samples from a given population or whether they should be rejected as outlier data, is a challenging issue. While the informal approaches often used do not provide any statistical assessment that a given dataset is indeed an outlier, traditional statistical procedures are not well-suited to the noisy, high-dimensional, settings encountered in medical imaging, e.g. with functional brain images. In this work, we modify the classical Minimum Covariance Determinant approach by adding a regularization term, that ensures that the estimation is well-posed in high-dimensional settings and in the presence of many outliers. We show on simulated and real data that outliers can be detected satisfactorily, even in situations where the number of dimensions of the data exceeds the number of observations.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Automation
  • Brain Mapping / methods*
  • Diagnostic Imaging / methods*
  • Humans
  • Image Processing, Computer-Assisted / methods*
  • Magnetic Resonance Imaging / methods*
  • Models, Statistical
  • Normal Distribution
  • Pattern Recognition, Automated
  • ROC Curve
  • Reproducibility of Results
  • Software