Development and validation of preeclampsia predictive models using key genes from bioinformatics and machine learning approaches

Front Immunol. 2024 Oct 31:15:1416297. doi: 10.3389/fimmu.2024.1416297. eCollection 2024.

Abstract

Background: Preeclampsia (PE) poses significant diagnostic and therapeutic challenges. This study aims to identify novel genes for potential diagnostic and therapeutic targets, illuminating the immune mechanisms involved.

Methods: Three GEO datasets were analyzed, merging two for training set, and using the third for external validation. Intersection analysis of differentially expressed genes (DEGs) and WGCNA highlighted candidate genes. These were further refined through LASSO, SVM-RFE, and RF algorithms to identify diagnostic hub genes. Diagnostic efficacy was assessed using ROC curves. A predictive nomogram and fully Connected Neural Network (FCNN) were developed for PE prediction. ssGSEA and correlation analysis were employed to investigate the immune landscape. Further validation was provided by qRT-PCR on human placental samples.

Result: Five biomarkers were identified with validation AUCs: CGB5 (0.663, 95% CI: 0.577-0.750), LEP (0.850, 95% CI: 0.792-0.908), LRRC1 (0.797, 95% CI: 0.728-0.867), PAPPA2 (0.839, 95% CI: 0.775-0.902), and SLC20A1 (0.811, 95% CI: 0.742-0.880), all of which are involved in key biological processes. The nomogram showed strong predictive power (C-index 0.873), while FCNN achieved an optimal AUC of 0.911 (95% CI: 0.732-1.000) in five-fold cross-validation. Immune infiltration analysis revealed the importance of T cell subsets, neutrophils, and NK cells in PE, linking these genes to immune mechanisms underlying PE pathogenesis.

Conclusion: CGB5, LEP, LRRC1, PAPPA2, and SLC20A1 are validated as key diagnostic biomarkers for PE. Nomogram and FCNN could credibly predict PE. Their association with immune infiltration underscores the crucial role of immune responses in PE pathogenesis.

Keywords: bioinformatics; deep learning; immune cell infiltration; machine learning; preeclampsia.

Publication types

  • Validation Study

MeSH terms

  • Biomarkers*
  • Computational Biology* / methods
  • Databases, Genetic
  • Female
  • Gene Expression Profiling
  • Gene Regulatory Networks
  • Humans
  • Machine Learning*
  • Nomograms
  • Pre-Eclampsia* / diagnosis
  • Pre-Eclampsia* / genetics
  • Pre-Eclampsia* / immunology
  • Pregnancy
  • Transcriptome

Substances

  • Biomarkers

Grants and funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was supported by the National Key Research and Development Program of China (2018YFC1002800), the National Natural Science Foundation of China (82171669), Shanghai Jiao Tong University Trans-Med Awards Research (STAR) (Major Project) (grant number 20210201), and the Funds for Outstanding Newcomers, Shanghai Sixth People’s Hospital (X-3664).