Refined annotation of the complete genome of the phytopathogenic and xanthan producing Xanthomonas campestris pv. campestris strain B100 based on RNA sequence data

J Biotechnol. 2017 Jul 10:253:55-61. doi: 10.1016/j.jbiotec.2017.05.009. Epub 2017 May 12.

Abstract

Bioinformatics tools and gene expression data were applied to identify new genes and to enhance the accuracy in genomic feature predictions for Xanthomonas campestris pv. campestris (Xcc) B100, a pathogen of cruciferous plants and model strain for the biosynthesis of xanthan, a polysaccharide with a multitude of commercial applications as a thickening agent. Results from 5'-enriched end RNA sequencing (RNA-seq) and total transcriptome RNA-seq experiments were used for this purpose. Functional gene annotations were updated where new evidence had emerged and start codon predictions were enhanced for 153 protein-coding genes (CDS). In total, 32 novel CDS, and 176 novel RNA genes and features were predicted, among them 77 isogenes of the small non-coding RNA sX9. Furthermore, the RNA-seq data facilitated the identification of 848 operons that included a total of 2551 CDS besides 1667 CDS that were mono-cistronically expressed.

Keywords: Improved genome annotation; New CDS predictions; Non-coding RNA analysis; Operon analysis; Small RNA identification.

MeSH terms

  • Bacterial Proteins / genetics
  • Genes, Bacterial
  • Genome, Bacterial*
  • Polysaccharides, Bacterial / biosynthesis
  • Sequence Analysis, RNA
  • Xanthomonas campestris / genetics*
  • Xanthomonas campestris / metabolism

Substances

  • Bacterial Proteins
  • Polysaccharides, Bacterial
  • xanthan gum