Support vector machine approach to separate control and breast cancer serum samples

Stat Appl Genet Mol Biol. 2008;7(2):Article11. doi: 10.2202/1544-6115.1355. Epub 2008 Feb 21.

Abstract

The paper presents two analyzes of the MALDI-TOF mass spectrometry dataset. Both analyzes use the support vector machine as a tool to build a prediction model. The first analysis which is our contribution to the competition uses the given spectra data without further processing. In the second analysis, we employed an additional preprocessing step consisting of peak detection, peak alignment and feature selection based on statistical tests. The experimental results suggest that the preprocessing step with feature selection improves prediction accuracy.

MeSH terms

  • Artificial Intelligence*
  • Breast Neoplasms / blood*
  • Breast Neoplasms / classification
  • Breast Neoplasms / diagnosis
  • Case-Control Studies
  • Data Interpretation, Statistical
  • Databases, Protein
  • Diagnosis, Computer-Assisted
  • Female
  • Humans
  • Models, Statistical
  • Neoplasm Proteins / blood
  • Proteomics / statistics & numerical data*
  • Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization / statistics & numerical data*

Substances

  • Neoplasm Proteins