HapBoost: a fast approach to boosting haplotype association analyses in genome-wide association studies

IEEE/ACM Trans Comput Biol Bioinform. 2013 Jan-Feb;10(1):207-12. doi: 10.1109/TCBB.2013.6.

Abstract

Genome-wide association study (GWAS) has been successful in identifying genetic variants that are associated with complex human diseases. In GWAS, multilocus association analyses through linkage disequilibrium (LD), named haplotype-based analyses, may have greater power than single-locus analyses for detecting disease susceptibility loci. However, the large number of SNPs genotyped in GWAS poses great computational challenges in the detection of haplotype associations. We present a fast method named HapBoost for finding haplotype associations, which can be applied to quickly screen the whole genome. The effectiveness of HapBoost is demonstrated by using both synthetic and real data sets. The experimental results show that the proposed approach can achieve comparably accurate results while it performs much faster than existing methods.

MeSH terms

  • Cluster Analysis
  • Computational Biology / methods*
  • Databases, Genetic
  • Disease / genetics
  • Genome, Human
  • Genome-Wide Association Study / methods*
  • Haplotypes*
  • Humans
  • Linkage Disequilibrium
  • Markov Chains
  • Polymorphism, Single Nucleotide
  • ROC Curve
  • Software*