Subtypes of tail spike proteins predicts the host range of Ackermannviridae phages

Comput Struct Biotechnol J. 2021 Aug 21:19:4854-4867. doi: 10.1016/j.csbj.2021.08.030. eCollection 2021.

Abstract

Phages belonging to the Ackermannviridae family encode up to four tail spike proteins (TSPs), each recognizing a specific receptor of their bacterial hosts. Here, we determined the TSPs diversity of 99 Ackermannviridae phages by performing a comprehensive in silico analysis. Based on sequence diversity, we assigned all TSPs into distinctive subtypes of TSP1, TSP2, TSP3 and TSP4, and found each TSP subtype to be specifically associated with the genera (Kuttervirus, Agtrevirus, Limestonevirus, Taipeivirus) of the Ackermannviridae family. Further analysis showed that the N-terminal XD1 and XD2 domains in TSP2 and TSP4, hinging the four TSPs together, are preserved. In contrast, the C-terminal receptor binding modules were only conserved within TSP subtypes, except for some Kuttervirus TSP1s and TSP3s that were similar to specific TSP4s. A conserved motif in TSP1, TSP3 and TSP4 of Kuttervirus phages may allow recombination between receptor binding modules, thus altering host recognition. The receptors for numerous uncharacterized phages expressing TSPs in the same subtypes were predicted using previous host range data. To validate our predictions, we experimentally determined the host recognition of three of the four TSPs expressed by kuttervirus S117. We confirmed that S117 TSP1 and TSP2 bind to their predicted host receptors, and identified the receptor for TSP3, which is shared by 51 other Kuttervirus phages. Kuttervirus phages were thus shown encode a vast genetic diversity of potentially exchangeable TSPs influencing host recognition. Overall, our study demonstrates that comprehensive in silico and host range analysis of TSPs can predict host recognition of Ackermannviridae phages.

Keywords: ANI, Average nucleotide identity; Ackermannviridae family; Bacteriophage; CPS, Capsular polysaccharide; EOP, Efficiency of plating; Escherichia coli O:157; Host range; LB, Luria-Bertani; LPS, Lipopolysaccharide; NCBI, National Center for Biotechnology Information; O-antigen; ORF, Open reading frame; PFU, Plaque formation unit; RBP, Receptor binding protein; Receptor-binding proteins; Salmonella; TSP, Tail spike protein; Tail spike proteins; VriC, Virulence-associated protein.