Statistical inference for nanopore sequencing with a biased random walk model

Biophys J. 2015 Apr 21;108(8):1852-5. doi: 10.1016/j.bpj.2015.03.013.

Abstract

Nanopore sequencing promises long read-lengths and single-molecule resolution, but the stochastic motion of the DNA molecule inside the pore is, as of this writing, a barrier to high accuracy reads. We develop a method of statistical inference that explicitly accounts for this error, and demonstrate that high accuracy (>99%) sequence inference is feasible even under highly diffusive motion by using a hidden Markov model to jointly analyze multiple stochastic reads. Using this model, we place bounds on achievable inference accuracy under a range of experimental parameters.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA / chemistry*
  • Models, Statistical*
  • Nanopores*
  • Sequence Analysis, DNA / methods*

Substances

  • DNA