Microarrays to characterize single nucleotide polymorphisms (SNPs) provide a cost-effective and rapid method (under 24h) to genotype microbes as an alternative to sequencing. We developed a pipeline for SNP discovery and microarray design that scales to 100's of microbial genomes. Here we tested various SNP probe design strategies against 8 sequenced isolates of Bacillus anthracis to compare sequence and microarray data. The best strategy allowed probe length to vary within 32-40 bp to equalize hybridization free energy. This strategy resulted in a call rate of 99.52% and concordance rate of 99.86% for finished genomes. Other probe design strategies averaged substantially lower call rates (94.65-96.41%) and slightly lower concordance rates (99.64-99.80%). These rates were lower for draft than finished genomes, consistent with higher incidence of sequencing errors and gaps. Highly accurate SNP calls were possible in complex soil and blood backgrounds down to 1000 copies, and moderately accurate SNP calls down to 100 spiked copies. The closest genome to the spiked strain was correctly identified at only 10 spiked copies. Discrepancies between sequence and array data did not alter the SNP-based phylogeny, regardless of the probe design strategy, indicating that SNP arrays can accurately place unsequenced isolates on a phylogeny.
Keywords: Bacillus anthracis; Microbial genotyping; Probe design optimization; SNP detection; SNP microarrays; Strain typing.
© 2013. Published by Elsevier B.V. All rights reserved.