Genomic analysis of phylogroup D Escherichia coli strains using novel de-novo reference-based guided assembly

Sci Data. 2023 Sep 1;10(1):573. doi: 10.1038/s41597-023-02444-0.

Abstract

Escherichia coli are highly diverse bacteria with different pathogenic types, serotypes and phylogenetic types/phylotypes. In recent years, infections with E. coli have increased worldwide and so has the emergence of antibiotic resistant strains. In the present study we have assembled, annotated and analysed genome sequences of three strains of the phylogroup D of E. coli. These strains were isolated from the river Yamuna, a prominent anthropogenic urban river of northern India. These strains showed varied antibiotic susceptibilities, one was susceptible to all the antibiotics tested except ampicillin while of the other two, one was multi-β-lactam resistant and the other was multi-drug resistant (resistant to multiple β-lactams, fluoroquinolones and kanamycin). The short-sequence reads were assembled into contigs using the de-novo approach and further, scaffolding of contigs was performed by using the best reference genome for a particular isolate which resulted in a significant increase in the N50 value of each assembly. The bioinformatics assembly approach used in this study could be easily applied to study other bacterial genomes.

Publication types

  • Dataset

MeSH terms

  • Anti-Bacterial Agents
  • Computational Biology
  • Escherichia coli*
  • Genomics*
  • Phylogeny

Substances

  • Anti-Bacterial Agents