Accelerated gene counting for haplotype frequency estimation

Ann Hum Genet. 2003 Nov;67(Pt 6):608-12. doi: 10.1046/j.1529-8817.2003.00054.x.

Abstract

Current implementations of the EM algorithm for estimating haplotype frequencies from genotypes on proximal loci require computational resources that grow as nh2k, where n is the number of individuals genotyped and h is the number of haplotypes possible on k loci. For diallelic loci hk=2k. We present an approach whose computational requirement grows as n2t where t is the largest number of loci at which an individual in the sample is heterozygous. The method is illustrated by haplotype frequency estimation from a sample of 45 individuals genotyped at 26 single nucleotide polymorphisms in the PIK3R1 gene.

Publication types

  • Comparative Study

MeSH terms

  • 1-Phosphatidylinositol 4-Kinase / genetics
  • Algorithms
  • Base Sequence
  • Computational Biology*
  • Gene Frequency
  • Genetic Markers
  • Genotype*
  • Haplotypes / genetics*
  • Humans
  • Likelihood Functions
  • Polymorphism, Genetic
  • White People

Substances

  • Genetic Markers
  • 1-Phosphatidylinositol 4-Kinase