Genomics and proteomics of mycobacteriophage patience, an accidental tourist in the Mycobacterium neighborhood

mBio. 2014 Dec 2;5(6):e02145. doi: 10.1128/mBio.02145-14.

Abstract

Newly emerging human viruses such as Ebola virus, severe acute respiratory syndrome (SARS) virus, and HIV likely originate within an extant population of viruses in nonhuman hosts and acquire the ability to infect and cause disease in humans. Although several mechanisms preventing viral infection of particular hosts have been described, the mechanisms and constraints on viral host expansion are ill defined. We describe here mycobacteriophage Patience, a newly isolated phage recovered using Mycobacterium smegmatis mc(2)155 as a host. Patience has genomic features distinct from its M. smegmatis host, including a much lower GC content (50.3% versus 67.4%) and an abundance of codons that are rarely used in M. smegmatis. Nonetheless, it propagates well in M. smegmatis, and we demonstrate the use of mass spectrometry to show expression of over 75% of the predicted proteins, to identify new genes, to refine the genome annotation, and to estimate protein abundance. We propose that Patience evolved primarily among lower-GC hosts and that the disparities between its genomic profile and that of M. smegmatis presented only a minimal barrier to host expansion. Rapid adaptions to its new host include recent acquisition of higher-GC genes, expression of out-of-frame proteins within predicted genes, and codon selection among highly expressed genes toward the translational apparatus of its new host.

Importance: The mycobacteriophage Patience genome has a notably lower GC content (50.3%) than its Mycobacterium smegmatis host (67.4%) and has markedly different codon usage biases. The viral genome has an abundance of codons that are rare in the host and are decoded by wobble tRNA pairing, although the phage grows well and expression of most of the genes is detected by mass spectrometry. Patience thus has the genomic profile of a virus that evolved primarily in one type of host genetic landscape (moderate-GC bacteria) but has found its way into a distinctly different high-GC environment. Although Patience genes are ill matched to the host expression apparatus, this is of little functional consequence and has not evidently imposed a barrier to migration across the microbial landscape. Interestingly, comparison of expression levels and codon usage profiles reveals evidence of codon selection as the genome evolves and adapts to its new environment.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Composition
  • Codon
  • Genome, Viral*
  • Mass Spectrometry
  • Mycobacteriophages / chemistry*
  • Mycobacteriophages / genetics*
  • Mycobacteriophages / isolation & purification
  • Mycobacteriophages / physiology
  • Mycobacterium smegmatis / virology*
  • Proteome / analysis*
  • Viral Proteins / analysis*
  • Viral Proteins / genetics*
  • Virus Replication

Substances

  • Codon
  • Proteome
  • Viral Proteins