Tobacco Smoking-Related Mutational Signatures in Classifying Smoking-Associated and Nonsmoking-Associated NSCLC

J Thorac Oncol. 2023 Apr;18(4):487-498. doi: 10.1016/j.jtho.2022.11.030. Epub 2022 Dec 14.

Abstract

Introduction: Patient-reported smoking history is frequently used as a stratification factor in NSCLC-directed clinical research. Nevertheless, this classification does not fully reflect the mutational processes in a tumor. Next-generation sequencing can identify mutational signatures associated with tobacco smoking, such as single-base signature 4 and indel-based signature 3. This provides an opportunity to redefine the classification of smoking- and nonsmoking-associated NSCLC on the basis of individual genomic tumor characteristics and could contribute to reducing the lung cancer stigma.

Methods: Whole genome sequencing data and clinical records were obtained from three prospective cohorts of metastatic NSCLC (N = 316). Relative contributions and absolute counts of single-base signature 4 and indel-based signature 3 were combined with relative contributions of age-related signatures to divide the cohort into smoking-associated ("smoking high") and nonsmoking-associated ("smoking low") clusters.

Results: The smoking high (n = 169) and smoking low (n = 147) clusters differed considerably in tumor mutational burden, signature contribution, and mutational landscape. This signature-based classification overlapped considerably with smoking history. Yet, 26% of patients with an active smoking history were included in the smoking low cluster, of which 52% harbored an EGFR/ALK/RET/ROS1 alteration, and 4% of patients without smoking history were included in the smoking high cluster. These discordant samples had similar genomic contexts to the rest of their respective cluster.

Conclusions: A substantial subset of metastatic NSCLC is differently classified into smoking- and nonsmoking-associated tumors on the basis of smoking-related mutational signatures than on the basis of smoking history. This signature-based classification more accurately classifies patients on the basis of genome-wide context and should therefore be considered as a stratification factor in clinical research.

Keywords: Biomarker; Mutational signature; Non–small cell lung cancer; Tobacco smoking; Whole genome sequencing.

MeSH terms

  • Carcinoma, Non-Small-Cell Lung* / genetics
  • Humans
  • Lung Neoplasms* / genetics
  • Lung Neoplasms* / pathology
  • Mutation
  • Prospective Studies
  • Protein-Tyrosine Kinases / genetics
  • Proto-Oncogene Proteins / genetics
  • Smoking / adverse effects
  • Smoking / genetics
  • Tobacco Smoking / adverse effects
  • Tobacco Smoking / genetics

Substances

  • Protein-Tyrosine Kinases
  • Proto-Oncogene Proteins