OMSim: a simulator for optical map data

Bioinformatics. 2017 Sep 1;33(17):2740-2742. doi: 10.1093/bioinformatics/btx293.

Abstract

Motivation: The Bionano Genomics platform allows for the optical detection of short sequence patterns in very long DNA molecules (up to 2.5 Mbp). Molecules with overlapping patterns can be assembled to generate a consensus optical map of the entire genome. In turn, these optical maps can be used to validate or improve de novo genome assembly projects or to detect large-scale structural variation in genomes. Simulated optical map data can assist in the development and benchmarking of tools that operate on those data, such as alignment and assembly software. Additionally, it can help to optimize the experimental setup for a genome of interest. Such a simulator is currently not available.

Results: We have developed a simulator, OMSim, that produces synthetic optical map data that mimics real Bionano Genomics data. These simulated data have been tested for compatibility with the Bionano Genomics Irys software system and the Irys-scaffolding scripts. OMSim is capable of handling very large genomes (over 30 Gbp) with high throughput and low memory requirements.

Availability and implementation: The Python simulation tool and a cross-platform graphical user interface are available as open source software under the GNU GPL v2 license ( http://www.bioinformatics.intec.ugent.be/omsim ).

Contact: [email protected].

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Genome, Human*
  • Genomics / methods
  • Humans
  • Sequence Analysis, DNA / methods*
  • Software*