Genomic and transcriptional alterations in lung adenocarcinoma in relation to smoking history

Clin Cancer Res. 2014 Sep 15;20(18):4912-24. doi: 10.1158/1078-0432.CCR-14-0246. Epub 2014 Jul 18.

Abstract

Purpose: Cigarette smoking is the major pathogenic factor for lung cancer. The precise mechanisms of tobacco-related carcinogenesis and its effect on the genomic and transcriptional landscape in lung cancer are not fully understood.

Experimental design: A total of 1,398 (277 never-smokers and 1,121 smokers) genomic and 1,449 (370 never-smokers and 1,079 smokers) transcriptional profiles were assembled from public lung adenocarcinoma cohorts, including matched next-generation DNA-sequencing data (n = 423). Unsupervised and supervised methods were used to identify smoking-related copy-number alterations (CNAs), predictors of smoking status, and molecular subgroups.

Results: Genomic meta-analyses showed that never-smokers and smokers harbored a similar frequency of total CNAs, although specific regions (5q, 8q, 16p, 19p, and 22q) displayed a 20% to 30% frequency difference between the two groups. Importantly, supervised classification analyses based on CNAs or gene expression could not accurately predict smoking status (balanced accuracies ∼60% to 80%). However, unsupervised multicohort transcriptional profiling stratified adenocarcinomas into distinct molecular subgroups with specific patterns of CNAs, oncogenic mutations, and mutation transversion frequencies that were independent of the smoking status. One subgroup included approximately 55% to 90% of never-smokers and approximately 20% to 40% of smokers (both current and former) with molecular and clinical features of a less aggressive and smoking-unrelated disease. Given the considerable intragroup heterogeneity in smoking-defined subgroups, especially among former smokers, our results emphasize the clinical importance of accurate molecular characterization of lung adenocarcinoma.

Conclusions: The landscape of smoking-related CNAs and transcriptional alterations in adenocarcinomas is complex, heterogeneous, and with moderate differences. Our results support a molecularly distinct less aggressive adenocarcinoma entity, arising in never-smokers and a subset of smokers.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma / etiology*
  • Adenocarcinoma / genetics*
  • Adenocarcinoma of Lung
  • Female
  • Gene Dosage
  • Genomics
  • Humans
  • Lung Neoplasms / etiology*
  • Lung Neoplasms / genetics*
  • Male
  • Oligonucleotide Array Sequence Analysis
  • Smoking / adverse effects*
  • Smoking / genetics*
  • Transcription, Genetic
  • Transcriptome