HERV-H RNA is abundant in human embryonic stem cells and a precise marker for pluripotency

Retrovirology. 2012 Dec 20:9:111. doi: 10.1186/1742-4690-9-111.

Abstract

Background: Certain post-translational modifications to histones, including H3K4me3, as well as binding sites for the transcription factor STAT1, predict the site of integration of exogenous gamma-retroviruses with great accuracy and cell-type specificity. Statistical methods that were used to identify chromatin features that predict exogenous gamma-retrovirus integration site selection were exploited here to determine whether cell type-specific chromatin markers are enriched in the vicinity of endogenous retroviruses (ERVs).

Results: Among retro-elements in the human genome, the gamma-retrovirus HERV-H was highly associated with H3K4me3, though this association was only observed in embryonic stem (ES) cells (p < 10-300) and, to a lesser extent, in induced pluripotent stem (iPS) cells. No significant association was observed in nearly 40 differentiated cell types, nor was any association observed with other retro-elements. Similar strong association was observed between HERV-H and the binding sites within ES cells for the pluripotency transcription factors NANOG, OCT4, and SOX2. NANOG binding sites were located within the HERV-H 5'LTR itself. OCT4 and SOX2 binding sites were within 1 kB and 2 kB of the 5'LTR, respectively. In keeping with these observations, HERV-H RNA constituted 2% of all poly A RNA in ES cells. As ES cells progressed down a differentiation pathway, the levels of HERV-H RNA decreased progressively. RNA-Seq datasets showed HERV-H transcripts to be over 5 kB in length and to have the structure 5'LTR-gag-pro-3'LTR, with no evidence of splicing and no intact open reading frames.

Conclusion: The developmental regulation of HERV-H expression, the association of HERV-H with binding sites for pluripotency transcription factors, and the extremely high levels of HERV-H RNA in human ES cells suggest that HERV-H contributes to pluripotency in human cells. Proximity of HERV-H to binding sites for pluripotency transcription factors within ES cells might be due to retention of the same chromatin features that determined the site of integration of the ancestral, exogenous, gamma-retrovirus that gave rise to HERV-H in the distant past. Retention of these markers, or, alternatively, recruitment of them to the site of the established provirus, may have acted post-integration to fix the provirus within the germ-line of the host species. Either way, HERV-H RNA provides a specific marker for pluripotency in human cells.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Binding Sites
  • Biomarkers / metabolism
  • Chromatin / chemistry
  • Embryonic Stem Cells / cytology
  • Embryonic Stem Cells / metabolism*
  • Embryonic Stem Cells / virology
  • Endogenous Retroviruses / genetics*
  • Gene Expression Regulation
  • Histones / genetics
  • Histones / metabolism*
  • Homeodomain Proteins / genetics
  • Homeodomain Proteins / metabolism
  • Humans
  • Induced Pluripotent Stem Cells / cytology
  • Induced Pluripotent Stem Cells / metabolism*
  • Induced Pluripotent Stem Cells / virology
  • Nanog Homeobox Protein
  • Octamer Transcription Factor-3 / genetics
  • Octamer Transcription Factor-3 / metabolism
  • Organ Specificity
  • Protein Binding
  • Protein Processing, Post-Translational*
  • RNA, Viral / genetics*
  • SOXB1 Transcription Factors / genetics
  • SOXB1 Transcription Factors / metabolism
  • STAT1 Transcription Factor / genetics
  • STAT1 Transcription Factor / metabolism
  • Signal Transduction
  • Terminal Repeat Sequences

Substances

  • Biomarkers
  • Chromatin
  • Histones
  • Homeodomain Proteins
  • NANOG protein, human
  • Nanog Homeobox Protein
  • Octamer Transcription Factor-3
  • POU5F1 protein, human
  • RNA, Viral
  • SOX2 protein, human
  • SOXB1 Transcription Factors
  • STAT1 Transcription Factor
  • STAT1 protein, human