Molecular classification of cancer types from microarray data using the combination of genetic algorithms and support vector machines

FEBS Lett. 2003 Dec 4;555(2):358-62. doi: 10.1016/s0014-5793(03)01275-4.

Abstract

Simultaneous multiclass classification of tumor types is essential for future clinical implementations of microarray-based cancer diagnosis. In this study, we have combined genetic algorithms (GAs) and all paired support vector machines (SVMs) for multiclass cancer identification. The predictive features have been selected through iterative SVMs/GAs, and recursive feature elimination post-processing steps, leading to a very compact cancer-related predictive gene set. Leave-one-out cross-validations yielded accuracies of 87.93% for the eight-class and 85.19% for the fourteen-class cancer classifications, outperforming the results derived from previously published methods.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Base Sequence
  • Cluster Analysis
  • Computational Biology / methods*
  • Confidence Intervals
  • Databases, Genetic
  • Gene Expression Profiling
  • Humans
  • Molecular Sequence Data
  • Neoplasms / classification
  • Neoplasms / genetics*
  • Oligonucleotide Array Sequence Analysis / methods*

Associated data

  • GENBANK/D51292
  • GENBANK/M21388
  • GENBANK/R12974
  • GENBANK/X07979
  • GENBANK/X12876
  • GENBANK/Z19554
  • GENBANK/Z29678