Prognostic gene signatures for non-small-cell lung cancer

Proc Natl Acad Sci U S A. 2009 Feb 24;106(8):2824-8. doi: 10.1073/pnas.0809444106. Epub 2009 Feb 5.

Abstract

Resectable non-small-cell lung cancer (NSCLC) patients have poor prognosis, with 30-50% relapsing within 5 years. Current staging criteria do not fully capture the complexity of this disease. Survival could be improved by identification of those early-stage patients who are most likely to benefit from adjuvant therapy. Molecular classification by using mRNA expression profiles has led to multiple, poorly overlapping signatures. We hypothesized that differing statistical methodologies contribute to this lack of overlap. To test this hypothesis, we analyzed our previously published quantitative RT-PCR dataset with a semisupervised method. A 6-gene signature was identified and validated in 4 independent public microarray datasets that represent a range of tumor histologies and stages. This result demonstrated that at least 2 prognostic signatures can be derived from this single dataset. We next estimated the total number of prognostic signatures in this dataset with a 10-million-signature permutation study. Our 6-gene signature was among the top 0.02% of signatures with maximum verifiability, reaffirming its efficacy. Importantly, this analysis identified 1,789 unique signatures, implying that our dataset contains >500,000 verifiable prognostic signatures for NSCLC. This result appears to rationalize the observed lack of overlap among reported NSCLC prognostic signatures.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Carcinoma, Non-Small-Cell Lung / genetics*
  • Gene Expression Profiling*
  • Humans
  • Lung Neoplasms / genetics*
  • Prognosis
  • RNA, Messenger / genetics
  • Reverse Transcriptase Polymerase Chain Reaction

Substances

  • RNA, Messenger