Discriminative components of data

IEEE Trans Neural Netw. 2005 Jan;16(1):68-83. doi: 10.1109/TNN.2004.836194.

Abstract

A simple probabilistic model is introduced to generalize classical linear discriminant analysis (LDA) in finding components that are informative of or relevant for data classes. The components maximize the predictability of the class distribution which is asymptotically equivalent to 1) maximizing mutual information with the classes, and 2) finding principal components in the so-called learning or Fisher metrics. The Fisher metric measures only distances that are relevant to the classes, that is, distances that cause changes in the class distribution. The components have applications in data exploration, visualization, and dimensionality reduction. In empirical experiments, the method outperformed, in addition to more classical methods, a Renyi entropy-based alternative while having essentially equivalent computational cost.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Cluster Analysis
  • Computing Methodologies*
  • Databases, Factual
  • Discriminant Analysis*
  • Information Storage and Retrieval / methods*
  • Models, Statistical*
  • Neural Networks, Computer
  • Pattern Recognition, Automated / methods
  • Principle-Based Ethics