Fast and flexible linear mixed models for genome-wide genetics

Daniel E Runcie; Lorin Crawford

doi:10.1371/journal.pgen.1007978

Fast and flexible linear mixed models for genome-wide genetics

PLoS Genet. 2019 Feb 8;15(2):e1007978. doi: 10.1371/journal.pgen.1007978. eCollection 2019 Feb.

Authors

Daniel E Runcie¹, Lorin Crawford²

Affiliations

¹ Department of Plant Sciences, University of California Davis, Davis, California, United States of America.
² Department of Biostatistics, Brown University, Providence, Rhode Island, United States of America.

Abstract

Linear mixed effect models are powerful tools used to account for population structure in genome-wide association studies (GWASs) and estimate the genetic architecture of complex traits. However, fully-specified models are computationally demanding and common simplifications often lead to reduced power or biased inference. We describe Grid-LMM (https://github.com/deruncie/GridLMM), an extendable algorithm for repeatedly fitting complex linear models that account for multiple sources of heterogeneity, such as additive and non-additive genetic variance, spatial heterogeneity, and genotype-environment interactions. Grid-LMM can compute approximate (yet highly accurate) frequentist test statistics or Bayesian posterior summaries at a genome-wide scale in a fraction of the time compared to existing general-purpose methods. We apply Grid-LMM to two types of quantitative genetic analyses. The first is focused on accounting for spatial variability and non-additive genetic variance while scanning for QTL; and the second aims to identify gene expression traits affected by non-additive genetic variation. In both cases, modeling multiple sources of heterogeneity leads to new discoveries.

Publication types

Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms*
Animals
Arabidopsis / genetics
Arabidopsis / growth & development
Bayes Theorem
Body Weight / genetics
Computer Simulation
Flowers / genetics
Flowers / growth & development
Gene-Environment Interaction
Genetic Markers
Genetic Variation
Genome-Wide Association Study / statistics & numerical data
Humans
Linear Models*
Mice
Models, Genetic*
Quantitative Trait Loci

Substances

Genetic Markers

Grants and funding

P20 GM109035/GM/NIGMS NIH HHS/United States