Bioinformatics tools and gene expression data were applied to identify new genes and to enhance the accuracy in genomic feature predictions for Xanthomonas campestris pv. campestris (Xcc) B100, a pathogen of cruciferous plants and model strain for the biosynthesis of xanthan, a polysaccharide with a multitude of commercial applications as a thickening agent. Results from 5'-enriched end RNA sequencing (RNA-seq) and total transcriptome RNA-seq experiments were used for this purpose. Functional gene annotations were updated where new evidence had emerged and start codon predictions were enhanced for 153 protein-coding genes (CDS). In total, 32 novel CDS, and 176 novel RNA genes and features were predicted, among them 77 isogenes of the small non-coding RNA sX9. Furthermore, the RNA-seq data facilitated the identification of 848 operons that included a total of 2551 CDS besides 1667 CDS that were mono-cistronically expressed.
Keywords: Improved genome annotation; New CDS predictions; Non-coding RNA analysis; Operon analysis; Small RNA identification.
Copyright © 2017 Elsevier B.V. All rights reserved.