All Your Base: a fast and accurate probabilistic approach to base calling

Genome Biol. 2012 Feb 29;13(2):R13. doi: 10.1186/gb-2012-13-2-r13.

Abstract

The accuracy of base calls produced by Illumina sequencers is adversely affected by several processes, with laser cross-talk and cluster phasing being prominent. We introduce an explicit statistical model of the sequencing process that generalizes current models of phasing and cross-talk and forms the basis of a base calling method which improves on the best existing base callers, especially when comparing the number of error-free reads. The novel algorithms implemented in All Your Base (AYB) are comparable in speed to other competitive base-calling methods, do not require training data and are designed to be robust to gross errors, producing sensible results where other techniques struggle. AYB is available at http://www.ebi.ac.uk/goldman-srv/AYB/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Research Design
  • Software*