Unveiling the molecular complexity of proliferative diabetic retinopathy through scRNA-seq, AlphaFold 2, and machine learning

Front Endocrinol (Lausanne). 2024 May 10:15:1382896. doi: 10.3389/fendo.2024.1382896. eCollection 2024.

Abstract

Background: Proliferative diabetic retinopathy (PDR), a major cause of blindness, is characterized by complex pathogenesis. This study integrates single-cell RNA sequencing (scRNA-seq), Non-negative Matrix Factorization (NMF), machine learning, and AlphaFold 2 methods to explore the molecular level of PDR.

Methods: We analyzed scRNA-seq data from PDR patients and healthy controls to identify distinct cellular subtypes and gene expression patterns. NMF was used to define specific transcriptional programs in PDR. The oxidative stress-related genes (ORGs) identified within Meta-Program 1 were utilized to construct a predictive model using twelve machine learning algorithms. Furthermore, we employed AlphaFold 2 for the prediction of protein structures, complementing this with molecular docking to validate the structural foundation of potential therapeutic targets. We also analyzed protein-protein interaction (PPI) networks and the interplay among key ORGs.

Results: Our scRNA-seq analysis revealed five major cell types and 14 subcell types in PDR patients, with significant differences in gene expression compared to those in controls. We identified three key meta-programs underscoring the role of microglia in the pathogenesis of PDR. Three critical ORGs (ALKBH1, PSIP1, and ATP13A2) were identified, with the best-performing predictive model demonstrating high accuracy (AUC of 0.989 in the training cohort and 0.833 in the validation cohort). Moreover, AlphaFold 2 predictions combined with molecular docking revealed that resveratrol has a strong affinity for ALKBH1, indicating its potential as a targeted therapeutic agent. PPI network analysis, revealed a complex network of interactions among the hub ORGs and other genes, suggesting a collective role in PDR pathogenesis.

Conclusion: This study provides insights into the cellular and molecular aspects of PDR, identifying potential biomarkers and therapeutic targets using advanced technological approaches.

Keywords: ALKBH1; AlphaFold 2; NMF; PPI; diabetic retinopathy; machine learning; oxidative stress; single-cell analysis.

MeSH terms

  • Case-Control Studies
  • Diabetic Retinopathy* / genetics
  • Diabetic Retinopathy* / metabolism
  • Diabetic Retinopathy* / pathology
  • Female
  • Humans
  • Machine Learning*
  • Male
  • Molecular Docking Simulation
  • Oxidative Stress
  • Protein Interaction Maps
  • RNA-Seq
  • Sequence Analysis, RNA / methods
  • Single-Cell Analysis / methods
  • Single-Cell Gene Expression Analysis

Grants and funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was supported in part by the Shenzhen Science and Technology Program (No. JCYJ20220818102603007, GCZX2015043017281705) and the General Project of the Shenzhen Natural Science Foundation (No. JCYJ20210324113808023 and JCYJ20220530152813030).