OMBlast: alignment tool for optical mapping using a seed-and-extend approach

Bioinformatics. 2017 Feb 1;33(3):311-319. doi: 10.1093/bioinformatics/btw620.

Abstract

Background: Optical mapping is a technique for capturing fluorescent signal patterns of long DNA molecules (in the range of 0.1–1 Mbp). Recently, it has been complementing the widely used short-read sequencing technology by assisting with scaffolding and detecting large and complex structural variations (SVs). Here, we introduce a fast, robust and accurate tool called OMBlast for aligning optical maps, the set of signal locations on the molecules generated from optical mapping. Our method is based on the seed-and-extend approach from sequence alignment, with modifications specific to optical mapping.

Results: Experiments with both synthetic and our real data demonstrate that OMBlast has higher accuracy and faster mapping speed than existing alignment methods. Our tool also shows significant improvement when aligning data with SVs.

Availability and implementation: OMBlast is implemented for Java 1.7 and is released under a GPL license. OMBlast can be downloaded from https://github.com/aldenleung/OMBlast and run directly on machines equipped with a Java virtual machine.

Contact: [email protected] and [email protected]

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Animals
  • Caenorhabditis elegans / genetics
  • Escherichia coli / genetics
  • Genome*
  • Genomics / methods
  • Humans
  • Optical Restriction Mapping / methods*
  • Saccharomyces cerevisiae / genetics
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Software*