Investigation of somatic single nucleotide variations in human endogenous retrovirus elements and their potential association with cancer

PLoS One. 2019 Apr 1;14(4):e0213770. doi: 10.1371/journal.pone.0213770. eCollection 2019.

Abstract

Human endogenous retroviruses (HERVs) have been investigated for potential links with human cancer. However, the distribution of somatic nucleotide variations in HERV elements has not been explored in detail. This study aims to identify HERV elements with an over-representation of somatic mutations (hot spots) in cancer patients. Four HERV elements with mutation hotspots were identified that overlap with exons of four human protein coding genes. These hotspots were identified based on the significant over-representation (p<8.62e-4) of non-synonymous single-nucleotide variations (nsSNVs). These genes are TNN (HERV-9/LTR12), OR4K15 (HERV-IP10F/LTR10F), ZNF99 (HERV-W/HERV17/LTR17), and KIR2DL1 (MST/MaLR). In an effort to identify mutations that effect survival, all nsSNVs were further evaluated and it was found that kidney cancer patients with mutation C2270G in ZNF99 have a significantly lower survival rate (hazard ratio = 2.6) compared to those without it. Among HERV elements in the human non-protein coding regions, we found 788 HERVs with significantly elevated numbers of somatic single-nucleotide variations (SNVs) (p<1.60e-5). From this category the top three HERV elements with significantly over-represented SNVs are HERV-H/LTR7, HERV-9/LTR12 and HERV-L/MLT2. Majority of the SNVs in these 788 HERV elements are located in three DNA functional groups: long non-coding RNAs (lncRNAs) (60%), introns (22.2%) and transcriptional factor binding sites (TFBS) (14.8%). This study provides a list of mutational hotspots in HERVs, which could potentially be used as biomarkers and therapeutic targets.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Endogenous Retroviruses / genetics*
  • Exons / genetics
  • Gene Expression Regulation, Neoplastic
  • Genome, Human / genetics*
  • Humans
  • Introns / genetics
  • Kidney Neoplasms / genetics*
  • Kidney Neoplasms / pathology
  • Mutation
  • Polymorphism, Single Nucleotide / genetics*
  • RNA, Long Noncoding / genetics
  • Receptors, KIR2DL1 / genetics
  • Survival Analysis
  • Tenascin / genetics
  • Terminal Repeat Sequences / genetics

Substances

  • KIR2DL1 protein, human
  • RNA, Long Noncoding
  • Receptors, KIR2DL1
  • Tenascin
  • tenascin-W, human

Grants and funding

This work was supported by McCormick Genomic and Proteomic Center at George Washington University. There was no additional external funding received for this study.