Common biomarkers of idiopathic pulmonary fibrosis and systemic sclerosis based on WGCNA and machine learning

Sci Rep. 2025 Jan 3;15(1):610. doi: 10.1038/s41598-024-84820-3.

Abstract

Interstitial lung disease (ILD) is known to be a major complication of systemic sclerosis (SSc) and a leading cause of death in SSc patients. As the most common type of ILD, the pathogenesis of idiopathic pulmonary fibrosis (IPF) has not been fully elucidated. In this study, weighted correlation network analysis (WGCNA), protein‒protein interaction, Kaplan-Meier curve, univariate Cox analysis and machine learning methods were used on datasets from the Gene Expression Omnibus database. CCL2 was identified as a common characteristic gene of IPF and SSc. The genes associated with CCL2 expression in both diseases were enriched mainly in chemokine-related pathways and lipid metabolism-related pathways according to Gene Set Enrichment Analysis. Single-cell RNA sequencing (sc-RNAseq) revealed a significant difference in CCL2 expression in alveolar epithelial type 1/2 cells, mast cells, ciliated cells, club cells, fibroblasts, M1/M2 macrophages, monocytes and plasma cells between IPF patients and healthy donors. Statistical analyses revealed that CCL2 was negatively correlated with lung function in IPF patients and decreased after mycophenolate mofetil (MMF) treatment in SSc patients. Finally, we identified CCL2 as a common biomarker from IPF and SSc, revealing the common mechanism of these two diseases and providing clues for the study of the treatment and mechanism of these two diseases.

Keywords: CCL2; GSEA; Idiopathic pulmonary fibrosis; Systemic sclerosis; WGCNA; scRNA sequencing analysis.

MeSH terms

  • Biomarkers* / metabolism
  • Chemokine CCL2* / genetics
  • Chemokine CCL2* / metabolism
  • Female
  • Gene Expression Profiling
  • Gene Regulatory Networks
  • Humans
  • Idiopathic Pulmonary Fibrosis* / genetics
  • Idiopathic Pulmonary Fibrosis* / metabolism
  • Machine Learning*
  • Male
  • Middle Aged
  • Scleroderma, Systemic* / genetics
  • Scleroderma, Systemic* / metabolism

Substances

  • Biomarkers
  • Chemokine CCL2
  • CCL2 protein, human