Detecting genome wide haplotype sharing using SNP or microsatellite haplotype data

Hum Genet. 2006 Mar;119(1-2):38-50. doi: 10.1007/s00439-005-0114-9. Epub 2005 Dec 14.

Abstract

Genome wide association studies using high throughput technology are already being conducted despite the significant hurdles that need to be overcome (Nat Rev Genet 6:95-108, 2005; Nat Rev Genet 6:109-118, 2005). Methods for detecting haplotype association signals in genome wide haplotype datasets are as yet very limited. Much methodological research has already been devoted to linkage disequilibrium (LD) fine mapping where the focus is the identification of the disease locus rather than the detection of a disease signal. Applications of these approaches to genome wide scanning are limited by the strong model assumptions of the sharing process, which lead to computational complexity. We describe a new algorithm for the initial identification of disease susceptibility loci in genome wide haplotype association studies. Excess sharing of ancestral haplotypes, which indicates the presence of a disease locus, is detected with a simple, easy to interpret, chi2 based statistic. The method allows genome wide scanning for qualitative traits within reasonable computational timeframes and can serve as a first pass analysis prior to the usage of likelihood based methods, providing candidate regions and inferred susceptibility haplotypes. Our method makes no assumptions regarding the population history or the pattern of background LD. Statistical significance is evaluated with permutation tests. The method is illustrated on simulated and real data where it is applied to simple (cystic fibrosis) and complex disease (multiple sclerosis) examples. The statistic has low type I error and greater power to map disease loci over conventional single marker tests for low to moderate levels of LD.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Chromosome Mapping
  • Computational Biology / methods
  • Computer Simulation
  • Cystic Fibrosis / genetics
  • Genome, Human*
  • Haplotypes / genetics*
  • Humans
  • Linkage Disequilibrium
  • Microsatellite Repeats / genetics*
  • Polymorphism, Single Nucleotide*
  • Reproducibility of Results