TAGster: efficient selection of LD tag SNPs in single or multiple populations

Bioinformatics. 2007 Dec 1;23(23):3254-5. doi: 10.1093/bioinformatics/btm426. Epub 2007 Sep 7.

Abstract

Genetic association studies increasingly rely on the use of linkage disequilibrium (LD) tag SNPs to reduce genotyping costs. We developed a software package TAGster to select, evaluate and visualize LD tag SNPs both for single and multiple populations. We implement several strategies to improve the efficiency of current LD tag SNP selection algorithms: (1) we modify the tag SNP selection procedure of Carlson et al. to improve selection efficiency and further generalize it to multiple populations. (2) We propose a redundant SNP elimination step to speed up the exhaustive tag SNP search algorithm proposed by Qin et al. (3) We present an additional multiple population tag SNP selection algorithm based on the framework of Howie et al., but using our modified exhaustive search procedure. We evaluate these methods using resequenced candidate gene data from the Environmental Genome Project and show improvements in both computational and tagging efficiency.

Availability: The software Package TAGster is freely available at http://www.niehs.nih.gov/research/resources/software/tagster/

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Algorithms
  • Base Sequence
  • Chromosome Mapping / methods*
  • Expressed Sequence Tags*
  • Genetic Markers / genetics*
  • Genetics, Population*
  • Linkage Disequilibrium / genetics*
  • Molecular Sequence Data
  • Polymorphism, Single Nucleotide / genetics*
  • Sequence Analysis, DNA / methods*
  • Statistics as Topic

Substances

  • Genetic Markers