Detecting local high-scoring segments: a first-stage approach for genome-wide association studies

Stat Appl Genet Mol Biol. 2006:5:Article22. doi: 10.2202/1544-6115.1192. Epub 2006 Sep 17.

Abstract

Genetic epidemiology aims at identifying biological mechanisms responsible for human diseases. Genome-wide association studies, made possible by recent improvements in genotyping technologies, are now promisingly investigated. In these studies, common first-stage strategies focus on marginal effects but lead to multiple-testing and are unable to capture the possibly complex interplay between genetic factors. We have adapted the use of the local score statistic, already successfully applied to analyse long molecular sequences. Via sum statistics, this method captures local and possible distant dependences between markers. Dedicated to genome-wide association studies, it is fast to compute, able to handle large datasets, circumvents the the multiple-testing problem and outlines a set of genomic regions (segments) for further analyses. Applied to simulated and real data, our approach outperforms classical Bonferroni and FDR corrections for multiple-testing. It is implemented in a software termed LHiSA for Local High-scoring Segments for Association and available at: http://stat.genopole.cnrs.fr/software/lhisa.

MeSH terms

  • Algorithms
  • Computer Simulation
  • Genetic Linkage*
  • Genetic Predisposition to Disease
  • Genome, Human*
  • Humans
  • Lod Score*
  • Models, Genetic
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci
  • Research Design* / statistics & numerical data
  • Schizophrenia / epidemiology