Fast and flexible linear mixed models for genome-wide genetics

PLoS Genet. 2019 Feb 8;15(2):e1007978. doi: 10.1371/journal.pgen.1007978. eCollection 2019 Feb.

Abstract

Linear mixed effect models are powerful tools used to account for population structure in genome-wide association studies (GWASs) and estimate the genetic architecture of complex traits. However, fully-specified models are computationally demanding and common simplifications often lead to reduced power or biased inference. We describe Grid-LMM (https://github.com/deruncie/GridLMM), an extendable algorithm for repeatedly fitting complex linear models that account for multiple sources of heterogeneity, such as additive and non-additive genetic variance, spatial heterogeneity, and genotype-environment interactions. Grid-LMM can compute approximate (yet highly accurate) frequentist test statistics or Bayesian posterior summaries at a genome-wide scale in a fraction of the time compared to existing general-purpose methods. We apply Grid-LMM to two types of quantitative genetic analyses. The first is focused on accounting for spatial variability and non-additive genetic variance while scanning for QTL; and the second aims to identify gene expression traits affected by non-additive genetic variation. In both cases, modeling multiple sources of heterogeneity leads to new discoveries.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Animals
  • Arabidopsis / genetics
  • Arabidopsis / growth & development
  • Bayes Theorem
  • Body Weight / genetics
  • Computer Simulation
  • Flowers / genetics
  • Flowers / growth & development
  • Gene-Environment Interaction
  • Genetic Markers
  • Genetic Variation
  • Genome-Wide Association Study / statistics & numerical data
  • Humans
  • Linear Models*
  • Mice
  • Models, Genetic*
  • Quantitative Trait Loci

Substances

  • Genetic Markers