A spatial probit model for fine-scale mapping of disease genes

Genet Epidemiol. 2007 Apr;31(3):252-60. doi: 10.1002/gepi.20206.

Abstract

We present a novel statistical method for linkage disequilibrium (LD) mapping of disease susceptibility loci in case-control studies. Such studies exploit the statistical correlation or LD that exist between variants physically close along the genome to identify those that correlate with disease status and might thus be close to a causative mutation, generally assumed unobserved. LD structure, however, varies markedly over short distances because of variation in local recombination rates, mutation and genetic drift among other factors. We propose a Bayesian multivariate probit model that flexibly accounts for the local spatial correlation between markers. In a case-control setting, we use a retrospective model that properly reflects the sampling scheme and identify regions where single- or multi-locus marker frequencies differ across cases and controls. We formally quantify these differences using information-theoretic distance measures while the fully Bayesian approach naturally accommodates unphased or missing genotype data. We demonstrate our approach on simulated data and on real data from the CYP2D6 region that has a confirmed role in drug metabolism.

MeSH terms

  • Bayes Theorem*
  • Case-Control Studies
  • Chromosome Mapping / methods*
  • Chromosomes, Human, Pair 22*
  • Cytochrome P-450 CYP2D6 / genetics*
  • Gene Frequency
  • Genotype
  • Haplotypes / genetics
  • Humans
  • Inactivation, Metabolic / genetics*
  • Linkage Disequilibrium / genetics*
  • Models, Genetic*

Substances

  • Cytochrome P-450 CYP2D6