A tutorial on support vector machine-based methods for classification problems in chemometrics

Anal Chim Acta. 2010 Apr 30;665(2):129-45. doi: 10.1016/j.aca.2010.03.030. Epub 2010 Mar 24.

Abstract

This tutorial provides a concise overview of support vector machines and different closely related techniques for pattern classification. The tutorial starts with the formulation of support vector machines for classification. The method of least squares support vector machines is explained. Approaches to retrieve a probabilistic interpretation are covered and it is explained how the binary classification techniques can be extended to multi-class methods. Kernel logistic regression, which is closely related to iteratively weighted least squares support vector machines, is discussed. Different practical aspects of these methods are addressed: the issue of feature selection, parameter tuning, unbalanced data sets, model evaluation and statistical comparison. The different concepts are illustrated on three real-life applications in the field of metabolomics, genetics and proteomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Brain Neoplasms / classification
  • Image Interpretation, Computer-Assisted
  • Least-Squares Analysis
  • Logistic Models
  • Pattern Recognition, Automated / methods*