Inferring disease architecture and predictive ability with LDpred2-auto

Florian Privé; Clara Albiñana; Julyan Arbel; Bogdan Pasaniuc; Bjarni J Vilhjálmsson

doi:10.1016/j.ajhg.2023.10.010

Inferring disease architecture and predictive ability with LDpred2-auto

Am J Hum Genet. 2023 Dec 7;110(12):2042-2055. doi: 10.1016/j.ajhg.2023.10.010. Epub 2023 Nov 8.

Authors

Florian Privé¹, Clara Albiñana², Julyan Arbel³, Bogdan Pasaniuc⁴, Bjarni J Vilhjálmsson⁵

Affiliations

¹ National Centre for Register-based Research, Aarhus University, Aarhus, Denmark. Electronic address: [email protected].
² National Centre for Register-based Research, Aarhus University, Aarhus, Denmark.
³ University Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK, Grenoble, France.
⁴ Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA; Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA; Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA.
⁵ National Centre for Register-based Research, Aarhus University, Aarhus, Denmark; Bioinformatics Research Centre, Aarhus University, Aarhus, Denmark; Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute, Cambridge, MA, USA.

Abstract

LDpred2 is a widely used Bayesian method for building polygenic scores (PGSs). LDpred2-auto can infer the two parameters from the LDpred model, the SNP heritability h² and polygenicity p, so that it does not require an additional validation dataset to choose best-performing parameters. The main aim of this paper is to properly validate the use of LDpred2-auto for inferring multiple genetic parameters. Here, we present a new version of LDpred2-auto that adds an optional third parameter α to its model, for modeling negative selection. We then validate the inference of these three parameters (or two, when using the previous model). We also show that LDpred2-auto provides per-variant probabilities of being causal that are well calibrated and can therefore be used for fine-mapping purposes. We also introduce a formula to infer the out-of-sample predictive performance r² of the resulting PGS directly from the Gibbs sampler of LDpred2-auto. Finally, we extend the set of HapMap3 variants recommended to use with LDpred2 with 37% more variants to improve the coverage of this set, and we show that this new set of variants captures 12% more heritability and provides 6% more predictive performance, on average, in UK Biobank analyses.

Keywords: LDpred2; inference.

Publication types

Research Support, Non-U.S. Gov't
Research Support, N.I.H., Extramural

MeSH terms

Bayes Theorem
Genome-Wide Association Study* / methods
Humans
Multifactorial Inheritance* / genetics
Polymorphism, Single Nucleotide / genetics

Abstract

Publication types

MeSH terms

Grants and funding