AmpliVar: mutation detection in high-throughput sequence from amplicon-based libraries

Hum Mutat. 2015 Apr;36(4):411-8. doi: 10.1002/humu.22763. Epub 2015 Mar 16.

Abstract

Conventional means of identifying variants in high-throughput sequencing align each read against a reference sequence, and then call variants at each position. Here, we demonstrate an orthogonal means of identifying sequence variation by grouping the reads as amplicons prior to any alignment. We used AmpliVar to make key-value hashes of sequence reads and group reads as individual amplicons using a table of flanking sequences. Low-abundance reads were removed according to a selectable threshold, and reads above this threshold were aligned as groups, rather than as individual reads, permitting the use of sensitive alignment tools. We show that this approach is more sensitive, more specific, and more computationally efficient than comparable methods for the analysis of amplicon-based high-throughput sequencing data. The method can be extended to enable alignment-free confirmation of variants seen in hybridization capture target-enrichment data.

Keywords: amplicon sequencing; grouped reads; mutation detection; next generation sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods
  • DNA Mutational Analysis / methods*
  • Gene Library
  • Genetic Variation
  • Genomics / methods*
  • Genotyping Techniques
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Internet
  • Mutation
  • Nucleic Acid Amplification Techniques
  • Software*