Human cytomegalovirus haplotype reconstruction reveals high diversity due to superinfection and evidence of within-host recombination

Proc Natl Acad Sci U S A. 2019 Mar 19;116(12):5693-5698. doi: 10.1073/pnas.1818130116. Epub 2019 Feb 28.

Abstract

Recent sequencing efforts have led to estimates of human cytomegalovirus (HCMV) genome-wide intrahost diversity that rival those of persistent RNA viruses [Renzette N, Bhattacharjee B, Jensen JD, Gibson L, Kowalik TF (2011) PLoS Pathog 7:e1001344]. Here, we deep sequence HCMV genomes recovered from single and longitudinally collected blood samples from immunocompromised children to show that the observations of high within-host HCMV nucleotide diversity are explained by the frequent occurrence of mixed infections caused by genetically distant strains. To confirm this finding, we reconstructed within-host viral haplotypes from short-read sequence data. We verify that within-host HCMV nucleotide diversity in unmixed infections is no greater than that of other DNA viruses analyzed by the same sequencing and bioinformatic methods and considerably less than that of human immunodeficiency and hepatitis C viruses. By resolving individual viral haplotypes within patients, we reconstruct the timing, likely origins, and natural history of superinfecting strains. We uncover evidence for within-host recombination between genetically distinct HCMV strains, observing the loss of the parental virus containing the nonrecombinant fragment. The data suggest selection for strains containing the recombinant fragment, generating testable hypotheses about HCMV evolution and pathogenesis. These results highlight that high HCMV diversity present in some samples is caused by coinfection with multiple distinct strains and provide reassurance that within the host diversity for single-strain HCMV infections is no greater than for other herpesviruses.

Keywords: diversity; human cytomegalovirus; recombination; superinfection; whole genome sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence / genetics
  • Child
  • Child, Preschool
  • Cytomegalovirus / genetics*
  • Cytomegalovirus Infections / virology
  • DNA, Viral / genetics
  • Female
  • Genetic Variation / genetics
  • Genome, Human / genetics
  • Genome, Viral
  • Haplotypes / genetics
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Immunocompromised Host / genetics
  • Infant
  • Infant, Newborn
  • Male
  • Recombination, Genetic / genetics*
  • Sequence Analysis, DNA / methods
  • Superinfection / genetics*

Substances

  • DNA, Viral