Improved medical image modality classification using a combination of visual and textual features

Comput Med Imaging Graph. 2015 Jan:39:14-26. doi: 10.1016/j.compmedimag.2014.06.005. Epub 2014 Jun 19.

Abstract

In this paper, we present the approach that we applied to the medical modality classification tasks at the ImageCLEF evaluation forum. More specifically, we used the modality classification databases from the ImageCLEF competitions in 2011, 2012 and 2013, described by four visual and one textual types of features, and combinations thereof. We used local binary patterns, color and edge directivity descriptors, fuzzy color and texture histogram and scale-invariant feature transform (and its variant opponentSIFT) as visual features and the standard bag-of-words textual representation coupled with TF-IDF weighting. The results from the extensive experimental evaluation identify the SIFT and opponentSIFT features as the best performing features for modality classification. Next, the low-level fusion of the visual features improves the predictive performance of the classifiers. This is because the different features are able to capture different aspects of an image, their combination offering a more complete representation of the visual content in an image. Moreover, adding textual features further increases the predictive performance. Finally, the results obtained with our approach are the best results reported on these databases so far.

Keywords: Feature fusion; Image modality classification; Visual image descriptors.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence
  • Documentation / methods*
  • Image Enhancement / methods
  • Image Interpretation, Computer-Assisted / methods*
  • Natural Language Processing*
  • Pattern Recognition, Automated / methods*
  • Radiology Information Systems / organization & administration*
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Terminology as Topic
  • User-Computer Interface