Conserved RNA secondary structures and long-range interactions in hepatitis C viruses

RNA. 2015 Jul;21(7):1219-32. doi: 10.1261/rna.049338.114. Epub 2015 May 11.

Abstract

Hepatitis C virus (HCV) is a hepatotropic virus with a plus-strand RNA genome of ∼9.600 nt. Due to error-prone replication by its RNA-dependent RNA polymerase (RdRp) residing in nonstructural protein 5B (NS5B), HCV isolates are grouped into seven genotypes with several subtypes. By using whole-genome sequences of 106 HCV isolates and secondary structure alignments of the plus-strand genome and its minus-strand replication intermediate, we established refined secondary structures of the 5' untranslated region (UTR), the cis-acting replication element (CRE) in NS5B, and the 3' UTR. We propose an alternative structure in the 5' UTR, conserved secondary structures of 5B stem-loop (SL)1 and 5BSL2, and four possible structures of the X-tail at the very 3' end of the HCV genome. We predict several previously unknown long-range interactions, most importantly a possible circularization interaction between distinct elements in the 5' and 3' UTR, reminiscent of the cyclization elements of the related flaviviruses. Based on analogy to these viruses, we propose that the 5'-3' UTR base-pairing in the HCV genome might play an important role in viral RNA replication. These results may have important implications for our understanding of the nature of the cis-acting RNA elements in the HCV genome and their possible role in regulating the mutually exclusive processes of viral RNA translation and replication.

Keywords: HCV; RNA long-range interaction prediction; RNA secondary structure prediction; bioinformatics; circularization.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3' Untranslated Regions
  • 5' Untranslated Regions
  • Base Sequence
  • Hepacivirus / genetics*
  • Hepacivirus / physiology
  • Molecular Sequence Data
  • Nucleic Acid Conformation*
  • RNA, Viral / chemistry*
  • Virus Replication

Substances

  • 3' Untranslated Regions
  • 5' Untranslated Regions
  • RNA, Viral