Polygenic score distribution differences across European ancestry populations: implications for breast cancer risk prediction

Breast Cancer Res. 2024 Dec 29;26(1):189. doi: 10.1186/s13058-024-01947-x.

Abstract

Background: The 313-variant polygenic risk score (PRS313) provides a promising tool for clinical breast cancer risk prediction. However, evaluation of the PRS313 across different European populations which could influence risk estimation has not been performed.

Methods: We explored the distribution of PRS313 across European populations using genotype data from 94,072 females without breast cancer diagnosis, of European-ancestry from 21 countries participating in the Breast Cancer Association Consortium (BCAC) and 223,316 females without breast cancer diagnosis from the UK Biobank. The mean PRS was calculated by country in the BCAC dataset and by country of birth in the UK Biobank. We explored different approaches to reduce the observed heterogeneity in the mean PRS across the countries, and investigated the implications of the distribution variability in risk prediction.

Results: The mean PRS313 differed markedly across European countries, being highest in individuals from Greece and Italy and lowest in individuals from Ireland. Using the overall European PRS313 distribution to define risk categories, leads to overestimation and underestimation of risk in some individuals from these countries. Adjustment for principal components explained most of the observed heterogeneity in the mean PRS. The mean estimates derived when using an empirical Bayes approach were similar to the predicted means after principal component adjustment.

Conclusions: Our results demonstrate that PRS distribution differs even within European ancestry populations leading to underestimation or overestimation of risk in specific European countries, which could potentially influence clinical management of some individuals if is not appropriately accounted for. Population-specific PRS distributions may be used in breast cancer risk estimation to ensure predicted risks are correctly calibrated across risk categories.

Keywords: Breast cancer; Polygenic risk scores; Risk calibration; Risk prediction.

MeSH terms

  • Breast Neoplasms* / epidemiology
  • Breast Neoplasms* / genetics
  • Europe / epidemiology
  • Female
  • Genetic Predisposition to Disease*
  • Genome-Wide Association Study
  • Genotype
  • Humans
  • Middle Aged
  • Multifactorial Inheritance*
  • Polymorphism, Single Nucleotide
  • Risk Assessment / methods
  • Risk Factors
  • White People* / genetics