Improved breast cancer prognosis through the combination of clinical and genetic markers

Bioinformatics. 2007 Jan 1;23(1):30-7. doi: 10.1093/bioinformatics/btl543. Epub 2006 Nov 26.

Abstract

Motivation: Accurate prognosis of breast cancer can spare a significant number of breast cancer patients from receiving unnecessary adjuvant systemic treatment and its related expensive medical costs. Recent studies have demonstrated the potential value of gene expression signatures in assessing the risk of post-surgical disease recurrence. However, these studies all attempt to develop genetic marker-based prognostic systems to replace the existing clinical criteria, while ignoring the rich information contained in established clinical markers. Given the complexity of breast cancer prognosis, a more practical strategy would be to utilize both clinical and genetic marker information that may be complementary.

Methods: A computational study is performed on publicly available microarray data, which has spawned a 70-gene prognostic signature. The recently proposed I-RELIEF algorithm is used to identify a hybrid signature through the combination of both genetic and clinical markers. A rigorous experimental protocol is used to estimate the prognostic performance of the hybrid signature and other prognostic approaches. Survival data analyses is performed to compare different prognostic approaches.

Results: The hybrid signature performs significantly better than other methods, including the 70-gene signature, clinical makers alone and the St. Gallen consensus criterion. At the 90% sensitivity level, the hybrid signature achieves 67% specificity, as compared to 47% for the 70-gene signature and 48% for the clinical makers. The odds ratio of the hybrid signature for developing distant metastases within five years between the patients with a good prognosis signature and the patients with a bad prognosis is 21.0 (95% CI:6.5-68.3), far higher than either genetic or clinical markers alone.

Availability: The breast cancer dataset is available at www.nature.com and Matlab codes are available upon request.

MeSH terms

  • Algorithms
  • Biomarkers, Tumor / analysis*
  • Breast Neoplasms / classification*
  • Breast Neoplasms / diagnosis*
  • Breast Neoplasms / genetics
  • Breast Neoplasms / therapy
  • Computational Biology
  • Female
  • Genetic Markers*
  • Humans
  • Models, Biological*
  • Neoplasm Metastasis / diagnosis
  • Neoplasm Recurrence, Local / classification
  • Neoplasm Recurrence, Local / diagnosis
  • Odds Ratio
  • Oligonucleotide Array Sequence Analysis / methods*
  • Predictive Value of Tests
  • Prognosis
  • ROC Curve
  • Sensitivity and Specificity

Substances

  • Biomarkers, Tumor
  • Genetic Markers