Abstract
New SARS-CoV-2 mutants have been continuously indentified with enhanced transmission ever since its outbreak in early 2020. As an RNA virus, SARS-CoV-2 has a high mutation rate due to the low fidelity of RNA polymerase. To study the single nucleotide polymorphisms (SNPs) dynamics of SARS-CoV-2, 158 SNPs with high confidence were identified by deep meta-transcriptomic sequencing, and the most common SNP type was C > T. Analyses of intra-host population diversity revealed that intra-host quasispecies' composition varies with time during the early onset of symptoms, which implicates viral evolution during infection. Network analysis of co-occurring SNPs revealed the most abundant non-synonymous SNP 22,638 in the S glycoprotein RBD region and 28,144 in the ORF8 region. Furthermore, SARS-CoV-2 variations differ in an individual's respiratory tissue (nose, throat, BALF, or sputum), suggesting independent compartmentalization of SARS-CoV-2 populations in patients. The positive selection analysis of the SARS-CoV-2 genome uncovered the positive selected amino acid G251V on ORF3a. Alternative allele frequency spectrum (AAFS) of all variants revealed that ORF8 could bear alternate alleles with high frequency. Overall, the results show the quasispecies' profile of SARS-CoV-2 in the respiratory tract in the first two months after the outbreak.
Keywords:
SARS-CoV-2; Single nucleotide polymorphism (SNP); epitopes; host adaptation; intra-host single nucleotide variation (iSNV); quasispecies; selective pressure.
Publication types
-
Research Support, Non-U.S. Gov't
MeSH terms
-
Adult
-
Aged
-
Aged, 80 and over
-
Alleles
-
COVID-19 / virology
-
Computational Biology
-
Coronavirus Envelope Proteins / chemistry
-
Coronavirus Envelope Proteins / genetics
-
Female
-
Gene Frequency
-
Genome, Viral
-
HEK293 Cells
-
High-Throughput Nucleotide Sequencing
-
Humans
-
Male
-
Middle Aged
-
Phylogeny*
-
Polymorphism, Single Nucleotide*
-
Quasispecies*
-
SARS-CoV-2 / classification*
-
SARS-CoV-2 / genetics*
-
Severity of Illness Index
-
Young Adult
Substances
-
Coronavirus Envelope Proteins
Grants and funding
This work was supported by the Central Charity Fund of Chinese Academy of Medical Science [2020-PT310-009];Guangdong Basic and Applied Basic Research Foundation [2019A1515110643]; National Science Fund for Distinguished Young Scholars [82025022]; Shenzhen Science and Technology Innovation Committee [2020A1111350032, JCYJ20190809112205541]; the emergency grants for prevention and control of SARS-CoV-2 of Guangdong province [2020B111108001, 2020B1111340036, 2020B1111340029]; the China Postdoctoral Science Foundation [2019M660837].