Correlation-based tests for the formal comparison of polygenic scores in multiple populations

PLoS Genet. 2024 Apr 26;20(4):e1011249. doi: 10.1371/journal.pgen.1011249. eCollection 2024 Apr.

Abstract

Polygenic scores (PGS) are measures of genetic risk, derived from the results of genome wide association studies (GWAS). Previous work has proposed the coefficient of determination (R2) as an appropriate measure by which to compare PGS performance in a validation dataset. Here we propose correlation-based methods for evaluating PGS performance by adapting previous work which produced a statistical framework and robust test statistics for the comparison of multiple correlation measures in multiple populations. This flexible framework can be extended to a wider variety of hypothesis tests than currently available methods. We assess our proposed method in simulation and demonstrate its utility with two examples, assessing previously developed PGS for low-density lipoprotein cholesterol and height in multiple populations in the All of Us cohort. Finally, we provide an R package 'coranova' with both parametric and nonparametric implementations of the described methods.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Body Height / genetics
  • Cholesterol, LDL / blood
  • Cholesterol, LDL / genetics
  • Computer Simulation
  • Genetic Predisposition to Disease
  • Genetics, Population / methods
  • Genome-Wide Association Study* / methods
  • Humans
  • Models, Genetic
  • Multifactorial Inheritance* / genetics
  • Polymorphism, Single Nucleotide / genetics

Substances

  • Cholesterol, LDL