Pervasive polygenicity of complex traits inflates false positive rates in transcriptome-wide association studies

bioRxiv [Preprint]. 2024 Nov 15:2023.10.17.562831. doi: 10.1101/2023.10.17.562831.

Abstract

Transcription-wide association studies (TWAS) and related methods (xWAS) have been widely adopted in genetic studies to understand molecular traits as mediators between genetic variation and disease. However, the effect of polygenicity on the validity of these mediator-trait association tests has largely been overlooked. Given the widespread polygenicity of complex traits, it is necessary to assess the validity and accuracy of these mediator-trait association tests. We found that for highly polygenic target traits, the standard test based on linear regression is inflated, leading to greatly increased false positives rates, especially in large sample sizes. Here, we show the extent of the inflation as a function of the underlying GWAS sample size and polygenic heritability of the target trait. To address this inflation, we propose an effective variance control method, similar to genomic control, but which allows for a different correction factor for each gene. Using simulated and real data, as well as theoretical derivations, we show that our method yields calibrated false positive rates, outperforming existing approaches. We further demonstrate that methods analogous to TWAS that associate genetic predictors of mediating traits with target traits suffer from similar inflation issues. We advise developers of genetic predictors for molecular traits (including polygenic risk scores, PRS) to compute and provide the necessary inflation parameters to ensure proper false positive control. Finally, we have updated our PrediXcan software package and resources to facilitate this correction for end users.

Publication types

  • Preprint