SNP allele frequency estimation in DNA pools and variance components analysis

Biotechniques. 2004 May;36(5):840-5. doi: 10.2144/04365RR01.

Abstract

The estimation of single nucleotide polymorphism (SNP) allele frequency in pooled DNA samples has been proposed as a cost-effective approach to whole genome association studies. However, the key issue is the allele frequency window in which a genotyping method operates and provides a statistically reliable answer. We assessed the homogeneous mass extend assay and estimated the variance associated with each experimental stage. We report that a relationship between estimated allele frequency and variance might exist, suggesting that high statistical power can be retained at low, as well as high, allele frequencies. Assuming this relationship, the formation of subpools consisting of 100 samples retains an effective sample size greater than 70% of the true sample size, with a savings of 11-fold the cost of an individual genotyping study, regardless of allele frequency.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms*
  • DNA / analysis*
  • DNA / chemistry*
  • DNA / genetics
  • Gene Expression Profiling / methods*
  • Gene Frequency / genetics
  • Genetic Variation
  • Humans
  • Models, Genetic*
  • Models, Statistical
  • Polymorphism, Single Nucleotide / genetics*
  • Principal Component Analysis
  • Reproducibility of Results
  • Sample Size
  • Sensitivity and Specificity
  • Sequence Analysis, DNA / methods*

Substances

  • DNA