The LeFE algorithm: embracing the complexity of gene expression in the interpretation of microarray data

Genome Biol. 2007;8(9):R187. doi: 10.1186/gb-2007-8-9-r187.

Abstract

Interpretation of microarray data remains a challenge, and most methods fail to consider the complex, nonlinear regulation of gene expression. To address that limitation, we introduce Learner of Functional Enrichment (LeFE), a statistical/machine learning algorithm based on Random Forest, and demonstrate it on several diverse datasets: smoker/never smoker, breast cancer classification, and cancer drug sensitivity. We also compare it with previously published algorithms, including Gene Set Enrichment Analysis. LeFE regularly identifies statistically significant functional themes consistent with known biology.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Breast Neoplasms / diagnosis*
  • Breast Neoplasms / genetics*
  • Cell Line, Tumor
  • Cluster Analysis
  • Computational Biology / methods
  • Female
  • Gefitinib
  • Gene Expression Profiling*
  • Gene Expression Regulation, Neoplastic*
  • Humans
  • Oligonucleotide Array Sequence Analysis
  • Pattern Recognition, Automated
  • Quinazolines / pharmacology
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Smoking

Substances

  • Quinazolines
  • Gefitinib