Tips and tricks for the assembly of a Corynebacterium pseudotuberculosis genome using a semiconductor sequencer

Microb Biotechnol. 2013 Mar;6(2):150-6. doi: 10.1111/1751-7915.12006. Epub 2012 Dec 2.

Abstract

New sequencing platforms have enabled rapid decoding of complete prokaryotic genomes at relatively low cost. The Ion Torrent platform is an example of these technologies, characterized by lower coverage, generating challenges for the genome assembly. One particular problem is the lack of genomes that enable reference-based assembly, such as the one used in the present study, Corynebacterium pseudotuberculosis biovar equi, which causes high economic losses in the US equine industry. The quality treatment strategy incorporated into the assembly pipeline enabled a 16-fold greater use of the sequencing data obtained compared with traditional quality filter approaches. Data preprocessing prior to the de novo assembly enabled the use of known methodologies in the next-generation sequencing data assembly. Moreover, manual curation was proved to be essential for ensuring a quality assembly, which was validated by comparative genomics with other species of the genus Corynebacterium. The present study presents a modus operandi that enables a greater and better use of data obtained from semiconductor sequencing for obtaining the complete genome from a prokaryotic microorganism, C. pseudotuberculosis, which is not a traditional biological model such as Escherichia coli.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Corynebacterium Infections / microbiology
  • Corynebacterium Infections / veterinary
  • Corynebacterium pseudotuberculosis / genetics*
  • Corynebacterium pseudotuberculosis / isolation & purification
  • DNA, Bacterial / analysis
  • Electrochemical Techniques / instrumentation
  • Electrochemical Techniques / methods
  • Equipment Design
  • Genome, Bacterial / genetics*
  • Genomics / methods
  • Horse Diseases / microbiology
  • Horses
  • Semiconductors*
  • Sequence Analysis, DNA* / instrumentation
  • Sequence Analysis, DNA* / methods
  • Software

Substances

  • DNA, Bacterial