The viridans group streptococci (VGS) are a large consortium of commensal streptococci that colonize the human body. Many species within this group are opportunistic pathogens causing bacteremia and infective endocarditis (IE), yet little is known about why some strains cause invasive disease. Identification of virulence determinants is complicated by the difficulty of distinguishing between the closely related species of this group. Here, we analyzed genomic data from VGS that were isolated from blood cultures in patients with invasive infections and oral swabs of healthy volunteers and then determined the best-performing methods for species identification. Using whole-genome sequence data, we characterized the population structure of a diverse sample of Streptococcus oralis isolates and found evidence of frequent recombination. We used multiple genome-wide association study tools to identify candidate determinants of invasiveness. These tools gave consistent results, leading to the discovery of a single synonymous single nucleotide polymorphism (SNP) that was significantly associated with invasiveness. This SNP was within a previously undescribed gene that was conserved across the majority of VGS species. Using the growth in the presence of human serum and a simulated infective endocarditis vegetation model, we were unable to identify a phenotype for the enriched allele in laboratory assays, suggesting a phenotype may be specific to natural infection. These data highlighted the power of analyzing natural populations for gaining insight into pathogenicity, particularly for organisms with complex population structures like the VGS. IMPORTANCE The viridians group streptococci (VGS) are a large collection of closely related commensal streptococci, with many being opportunistic pathogens causing invasive diseases, such as bacteremia and infective endocarditis. Little is known about virulence determinants in these species, and there is a distinct lack of genomic information available for the VGS. In this study, we collected VGS isolates from invasive infections and healthy volunteers and performed whole-genome sequencing for a suite of downstream analyses. We focused on a diverse sample of Streptococcus oralis genomes and identified high rates of recombination in the population as well as a single genome variant highly enriched in invasive isolates. The variant lies within a previously uncharacterized gene, nrdM, which shared homology with the anaerobic ribonucleoside triphosphate reductase, nrdD, and was highly conserved among VGS. This work increased our knowledge of VGS genomics and indicated that differences in virulence potential among S. oralis isolates were, at least in part, genetically determined.
Keywords: Streptococcus; bacteremia; genomics; infective endocarditis.