Causal graphs for the analysis of genetic cohort data

Physiol Genomics. 2020 Sep 1;52(9):369-378. doi: 10.1152/physiolgenomics.00115.2019. Epub 2020 Jul 20.

Abstract

The increasing availability of genetic cohort data has led to many genome-wide association studies (GWAS) successfully identifying genetic associations with an ever-expanding list of phenotypic traits. Association, however, does not imply causation, and therefore methods have been developed to study the issue of causality. Under additional assumptions, Mendelian randomization (MR) studies have proved popular in identifying causal effects between two phenotypes, often using GWAS summary statistics. Given the widespread use of these methods, it is more important than ever to understand, and communicate, the causal assumptions upon which they are based, so that methods are transparent, and findings are clinically relevant. Causal graphs can be used to represent causal assumptions graphically and provide insights into the limitations associated with different analysis methods. Here we review GWAS and MR from a causal perspective, to build up intuition for causal diagrams in genetic problems. We also examine issues of confounding by ancestry and comment on approaches for dealing with such confounding, as well as discussing approaches for dealing with selection biases arising from study design.

Keywords: GWAS; Mendelian randomisation; causal graphs.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Causality
  • Cohort Studies
  • Genetic Variation
  • Genome-Wide Association Study / methods*
  • Humans
  • Mendelian Randomization Analysis / methods*
  • Models, Statistical
  • Neoplasms / genetics*
  • Phenotype