MSQT for choosing SNP assays from multiple DNA alignments

Bioinformatics. 2007 Oct 15;23(20):2784-7. doi: 10.1093/bioinformatics/btm428. Epub 2007 Sep 4.

Abstract

Motivation: One challenging aspect of genotyping and association mapping projects is often the identification of markers that are informative between groups of individuals and to convert these into genotyping assays.

Results: The Multiple SNP Query Tool (MSQT) extracts SNP information from multiple sequence alignments, stores it in a database, provides a web interface to query the database and outputs SNP information in a format directly applicable for SNP-assay design. MSQT was applied to Arabidopsis thaliana sequence data to develop SNP genotyping assays that distinguish a recurrent parent (Col-0) from five other strains. SNPs with intermediate allele frequencies were also identified and developed into markers suitable for efficient genetic mapping among random pairs of wild strains.

Availability: The source code for MSQT is available at http://msqt.weigelworld.org, together with an online instance of MSQT containing data on 1214 sequenced fragments from 96 ecotypes (wild inbred strains) of the reference plant A. thaliana. All SNP genotyping assays are available in several formats for broad community use.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Chromosome Mapping / methods
  • Database Management Systems
  • Databases, Genetic*
  • Genetic Markers / genetics
  • Information Storage and Retrieval / methods
  • Molecular Sequence Data
  • Polymorphism, Single Nucleotide / genetics*
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Software*
  • User-Computer Interface*

Substances

  • Genetic Markers