Identification of distinct genotypes in circulating RSV A strains based on variants on the virus replication-associated genes

bioRxiv [Preprint]. 2024 Apr 23:2024.04.22.590570. doi: 10.1101/2024.04.22.590570.

Abstract

Respiratory syncytial virus is a common cause of respiratory infection that often leads to hospitalization of infected younger children and older adults. RSV is classified into two strains, A and B, each with several subgroups or genotypes. One issue with the definition of these subgroups is the lack of a unified method of identification or genotyping. We propose that genotyping strategies based on the genes coding for replication-associated proteins could provide critical information on the replication capacity of the distinct subgroup, while clearly distinguishing genotypes. Here, we analyzed the virus replication-associated genes N, P, M2, and L from de novo assembled RSV A sequences obtained from 31 newly sequenced samples from hospitalized patients in Philadelphia and 78 additional publicly available sequences from different geographic locations within the US. In-depth analysis and annotation of the protein variants in L and the other replication-associated proteins N, P, M2-1, and M2-2 identified the polymerase protein L as a robust target for genotyping RSV subgroups. Importantly, our analysis revealed non-synonymous variations in L that were consistently accompanied by conserved changes in its co-factor P or the M2-2 protein, suggesting associations and interactions between specific domains of these proteins. These results highlight L as an alternative to other RSV genotyping targets and demonstrate the value of in-depth analyses and annotations of RSV sequences as it can serve as a foundation for subsequent in vitro and clinical studies on the efficiency of the polymerase and fitness of different virus isolates.

Keywords: genotypes; polymerase L; replication-associated genes; respiratory syncytial virus.

Publication types

  • Preprint