Confounding and heterogeneity in genetic association studies with admixed populations

Am J Epidemiol. 2013 Feb 15;177(4):351-60. doi: 10.1093/aje/kws234. Epub 2013 Jan 18.

Abstract

Association studies among admixed populations pose many challenges including confounding of genetic effects due to population substructure and heterogeneity due to different patterns of linkage disequilibrium (LD). We use simulations to investigate controlling for confounding by indicators of global ancestry and the impact of including a covariate for local ancestry. In addition, we investigate the use of an interaction term between a single-nucleotide polymorphism (SNP) and local ancestry to capture heterogeneity in SNP effects. Although adjustment for global ancestry can control for confounding, additional adjustment for local ancestry may increase power when the induced admixture LD is in the opposite direction as the LD in the ancestral population. However, if the induced LD is in the same direction, there is the potential for reduced power because of overadjustment. Furthermore, the inclusion of a SNP by local ancestry interaction term can increase power when there is substantial differential LD between ancestry populations. We examine these approaches in genome-wide data using the University of Southern California's Children's Health Study investigating asthma risk. The analysis highlights rs10519951 (P = 8.5 × 10(-7)), a SNP lacking any evidence of association from a conventional analysis (P = 0.5).

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Alleles
  • Asthma / epidemiology
  • Asthma / genetics*
  • California / epidemiology
  • Case-Control Studies
  • Child
  • Computer Simulation
  • Confounding Factors, Epidemiologic*
  • Genetic Heterogeneity*
  • Genetic Predisposition to Disease
  • Genome, Human
  • Genome-Wide Association Study* / methods
  • Genotype
  • Hispanic or Latino / genetics*
  • Hispanic or Latino / statistics & numerical data
  • Humans
  • Linear Models
  • Linkage Disequilibrium
  • Logistic Models
  • Mathematical Computing
  • Multivariate Analysis
  • Polymorphism, Single Nucleotide*
  • White People / genetics*
  • White People / statistics & numerical data