From proteome to pathogenesis: investigating polycystic ovary syndrome with Mendelian randomization analysis

Front Endocrinol (Lausanne). 2024 Sep 9:15:1442483. doi: 10.3389/fendo.2024.1442483. eCollection 2024.

Abstract

Background: Polycystic ovary syndrome (PCOS) is defined by oligo/anovulation, hyperandrogenism, and polycystic ovaries with uncertain pathogenesis. The proteome represents a substantial source of therapeutic targets, and their coding genes may elucidate the mechanisms underlying PCOS. However, reports on the profiles of the human plasma protein-coding genes and PCOS are limited. Here, we aimed to investigate novel biomarkers or drug targets for PCOS by integrating genetics and the human plasma proteome.

Methods: Our study acquired the protein quantitative trait loci from DECODE Genetics, offering 4,907 proteins in 35,559 individuals while obtaining PCOS summary statistics by accessing the FinnGen biobank (1,639 cases and 218,970 controls) and the genome-wide association study catalog (797 cases and 140,558 controls). Herein, we sequentially used two-sample Mendelian randomization (MR) analyses and colocalization to verify the causal link between candidate proteins, their coding genes, and PCOS. Further PCOS data download was conducted by accessing the Gene Expression Omnibus and Zenodo platforms. Gene expression level analysis, pathway enrichment analysis, immune cell infiltration, and transcription factor prediction were performed, aiming at detecting specific cell types with enriched expression and exploring potential optimized treatments for PCOS.

Results: MR analysis revealed 243 protein-coding genes with a causal relationship to PCOS risk, of which 12 were prioritized with the most significant evidence. Through colocalization analysis, three key genes, CUB domain-containing protein 1 (CDCP1), glutaredoxin 2 (GLRX2), and kirre-like nephrin family adhesion molecule 2 (KIRREL2), were identified. Subsequently, the three genes were strongly related to immune function and metabolism in terms of biological significance. In single-cell analysis, the expression levels of genes in ovarian theca cells were explored.

Conclusion: Overall, three protein-coding genes (CDCP1, GLRX2, and KIRREL2) may be related to a higher PCOS risk, suggesting that they may be entry points for exploration of PCOS pathogenesis and treatment, warranting further clinical investigations.

Keywords: Mendelian randomization; bioinformatics; metabolism; polycystic ovary syndrome; proteome.

MeSH terms

  • Biomarkers
  • Female
  • Genetic Predisposition to Disease
  • Genome-Wide Association Study*
  • Humans
  • Mendelian Randomization Analysis*
  • Polycystic Ovary Syndrome* / genetics
  • Proteome*
  • Quantitative Trait Loci

Substances

  • Proteome
  • Biomarkers

Grants and funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.