Comparative analysis of protein evolution in the genome of pre-epidemic and epidemic Zika virus

Infect Genet Evol. 2017 Jul:51:74-85. doi: 10.1016/j.meegid.2017.03.012. Epub 2017 Mar 14.

Abstract

Zika virus (ZIKV) causes microcephaly in congenital infection, neurological disorders, and poor pregnancy outcome and no vaccine is available for use in humans or approved. Although ZIKV was first discovered in 1947, the exact mechanism of virus replication and pathogenesis remains unknown. Recent outbreaks of Zika virus in the Americas clearly suggest a human-mosquito cycle or urban cycle of transmission. Understanding the conserved and adaptive features in the evolution of ZIKV genome will provide a hint on the mechanism of ZIKV adaptation to a new cycle of transmission. Here, we show comprehensive analysis of protein evolution of ZIKV strains including the current 2015-16 outbreak. To identify the constraints on ZIKV evolution, selection pressure at individual codons, immune epitopes and co-evolving sites were analyzed. Phylogenetic trees show that the ZIKV strains of the Asian genotype form distinct cluster and share a common ancestor with African genotype. The TMRCA (Time to the Most Recent Common Ancestor) for the Asian lineage and the subsequently evolved Asian human strains was calculated at 88 and 34years ago, respectively. The proteome of current 2015/16 epidemic ZIKV strains of Asian genotype was found to be genetically conserved due to genome-wide negative selection, with limited positive selection. We identified a total of 16 amino acid substitutions in the epidemic and pre-epidemic strains from human, mosquito, and monkey hosts. Negatively selected amino acid sites of Envelope protein (E-protein) (positions 69, 166, and 174) and NS5 (292, 345, and 587) were located in central dimerization domains and C-terminal RNA-directed RNA polymerase regions, respectively. The predicted 137 (92 CD4 TCEs; 45 CD8 TCEs) immunogenic peptide chains comprising negatively selected amino acid sites can be considered as suitable target for sub-unit vaccine development, as these sites are less likely to generate immune-escape variants due to strong functional constrains operating on them. The targeted changes at the amino acid level may contribute to better adaptation of ZIKV strains to human-mosquito cycle or urban cycle of transmission.

Keywords: Co-evolving sites; Host adaptation; Immune epitopes; Natural selection; Zika virus.

Publication types

  • Comparative Study

MeSH terms

  • Aedes / virology
  • Africa / epidemiology
  • Americas / epidemiology
  • Amino Acid Substitution
  • Animals
  • Disease Outbreaks*
  • Evolution, Molecular
  • Genome, Viral*
  • Haplorhini
  • Humans
  • India / epidemiology
  • Infant, Newborn
  • Insect Vectors / virology
  • Models, Molecular
  • Phylogeny*
  • Proteome / genetics*
  • Proteome / metabolism
  • RNA-Dependent RNA Polymerase / genetics
  • RNA-Dependent RNA Polymerase / metabolism
  • Selection, Genetic
  • Viral Envelope Proteins / chemistry
  • Viral Envelope Proteins / genetics
  • Viral Envelope Proteins / metabolism
  • Zika Virus / classification
  • Zika Virus / genetics*
  • Zika Virus / isolation & purification
  • Zika Virus Infection / epidemiology*
  • Zika Virus Infection / virology

Substances

  • Proteome
  • Viral Envelope Proteins
  • RNA-Dependent RNA Polymerase