Machine learning derived retinal pigment score from ophthalmic imaging shows ethnicity is not biology

Nat Commun. 2025 Jan 2;16(1):60. doi: 10.1038/s41467-024-55198-7.

Abstract

Few metrics exist to describe phenotypic diversity within ophthalmic imaging datasets, with researchers often using ethnicity as a surrogate marker for biological variability. We derived a continuous, measured metric, the retinal pigment score (RPS), that quantifies the degree of pigmentation from a colour fundus photograph of the eye. RPS was validated using two large epidemiological studies with demographic and genetic data (UK Biobank and EPIC-Norfolk Study) and reproduced in a Tanzanian, an Australian, and a Chinese dataset. A genome-wide association study (GWAS) of RPS from UK Biobank identified 20 loci with known associations with skin, iris and hair pigmentation, of which eight were replicated in the EPIC-Norfolk cohort. There was a strong association between RPS and ethnicity, however, there was substantial overlap between each ethnicity and the respective distributions of RPS scores. RPS decouples traditional demographic variables from clinical imaging characteristics. RPS may serve as a useful metric to quantify the diversity of the training, validation, and testing datasets used in the development of AI algorithms to ensure adequate inclusion and explainability of the model performance, critical in evaluating all currently deployed AI models. The code to derive RPS is publicly available at: https://github.com/uw-biomedical-ml/retinal-pigmentation-score .

MeSH terms

  • Adult
  • Aged
  • Ethnicity / genetics
  • Female
  • Genome-Wide Association Study*
  • Humans
  • Machine Learning*
  • Male
  • Middle Aged
  • Photography
  • Retina / diagnostic imaging
  • Retina / metabolism
  • UK Biobank