AmpliVar: mutation detection in high-throughput sequence from amplicon-based libraries

Arthur L Hsu; Olga Kondrashova; Sebastian Lunke; Clare J Love; Cliff Meldrum; Renate Marquis-Nicholson; Greg Corboy; Kym Pham; Matthew Wakefield; Paul M Waring; Graham R Taylor

doi:10.1002/humu.22763

AmpliVar: mutation detection in high-throughput sequence from amplicon-based libraries

Hum Mutat. 2015 Apr;36(4):411-8. doi: 10.1002/humu.22763. Epub 2015 Mar 16.

Authors

Arthur L Hsu¹, Olga Kondrashova, Sebastian Lunke, Clare J Love, Cliff Meldrum, Renate Marquis-Nicholson, Greg Corboy, Kym Pham, Matthew Wakefield, Paul M Waring, Graham R Taylor

Affiliation

¹ Department of Pathology, The University of Melbourne, Parkville, Victoria, Australia.

PMID: 25664426
DOI: 10.1002/humu.22763

Abstract

Conventional means of identifying variants in high-throughput sequencing align each read against a reference sequence, and then call variants at each position. Here, we demonstrate an orthogonal means of identifying sequence variation by grouping the reads as amplicons prior to any alignment. We used AmpliVar to make key-value hashes of sequence reads and group reads as individual amplicons using a table of flanking sequences. Low-abundance reads were removed according to a selectable threshold, and reads above this threshold were aligned as groups, rather than as individual reads, permitting the use of sensitive alignment tools. We show that this approach is more sensitive, more specific, and more computationally efficient than comparable methods for the analysis of amplicon-based high-throughput sequencing data. The method can be extended to enable alignment-free confirmation of variants seen in hybridization capture target-enrichment data.

Keywords: amplicon sequencing; grouped reads; mutation detection; next generation sequencing.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Computational Biology / methods
DNA Mutational Analysis / methods*
Gene Library
Genetic Variation
Genomics / methods*
Genotyping Techniques
High-Throughput Nucleotide Sequencing
Humans
Internet
Mutation
Nucleic Acid Amplification Techniques
Software*