Global analysis of somatic structural genomic alterations and their impact on gene expression in diverse human cancers

Proc Natl Acad Sci U S A. 2016 Nov 29;113(48):13768-13773. doi: 10.1073/pnas.1606220113. Epub 2016 Nov 16.

Abstract

Tumor genomes are mosaics of somatic structural variants (SVs) that may contribute to the activation of oncogenes or inactivation of tumor suppressors, for example, by altering gene copy number amplitude. However, there are multiple other ways in which SVs can modulate transcription, but the general impact of such events on tumor transcriptional output has not been systematically determined. Here we use whole-genome sequencing data to map SVs across 600 tumors and 18 cancers, and investigate the relationship between SVs, copy number alterations (CNAs), and mRNA expression. We find that 34% of CNA breakpoints can be clarified structurally and that most amplifications are due to tandem duplications. We observe frequent swapping of strong and weak promoters in the context of gene fusions, and find that this has a measurable global impact on mRNA levels. Interestingly, several long noncoding RNAs were strongly activated by this mechanism. Additionally, SVs were confirmed in telomere reverse transcriptase (TERT) upstream regions in several cancers, associated with elevated TERT mRNA levels. We also highlight high-confidence gene fusions supported by both genomic and transcriptomic evidence, including a previously undescribed paired box 8 (PAX8)-nuclear factor, erythroid 2 like 2 (NFE2L2) fusion in thyroid carcinoma. In summary, we combine SV, CNA, and expression data to provide insights into the structural basis of CNAs as well as the impact of SVs on gene expression in tumors.

Keywords: cancer genomics; gene expression; gene fusion; somatic structural variation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA Copy Number Variations / genetics
  • Gene Expression Regulation, Neoplastic / genetics
  • Genome, Human / genetics*
  • Genomic Structural Variation / genetics*
  • Humans
  • NF-E2-Related Factor 2 / genetics
  • Neoplasms / genetics*
  • Neoplasms / pathology
  • Oncogene Proteins, Fusion / genetics*
  • RNA, Messenger / genetics
  • Telomerase / genetics

Substances

  • NF-E2-Related Factor 2
  • NFE2L2 protein, human
  • Oncogene Proteins, Fusion
  • RNA, Messenger
  • TERT protein, human
  • Telomerase