Background and objective: The aim of this study was to investigate potential hub genes for dilated cardiomyopathy (DCM).
Methods: Five DCM-related microarray datasets were downloaded from the Gene Expression Omnibus (GEO). Differentially expressed genes (DEGs) were used for identification. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment, disease ontology, gene ontology annotation and protein-protein interaction (PPI) network analysis were then performed, while a random forest was constructed to explore central genes. Artificial neural networks were used to compare with known genes and to develop new diagnostic models. 240 population blood samples were collected and expression of hub genes was verified in these samples using RT-PCR and demonstrated by Nomogram.
Results: After differential analysis, 33 genes were statistically significant (adjusted P < 0.05). Functional enrichment of these differential genes resulted in 85 Gene Ontology (GO) functions identified and 6 pathways enriched for the KEGG pathway. PPI networks and molecular complex assays identified 10 hub genes (adjusted P < 0.05). Random forest identified SMOC2 and SFRP4 as the most important, followed by FCER1G and FRZB. NeuraHF models (SMOC2, SFRP4, FCER1G and FRZB) were selected by artificial neural network model and had better diagnostic efficacy for the onset of DCM, compared with the traditional KG-DCM models (MYH7, ACTC1, TTN and LMNA). Finally, SFRP4 and FRZB were expressed higher in DCM verified by RT-PCR and as a factor for DCM identified by Nomogram.
Conclusions: We performed an integrated analysis and identified SFRP4 and FRZB as a new factor for DCM. But the exact mechanism still needs further experimental verification.
Keywords: Artificial neural network; Dilated cardiomyopathy; GEO analysis; Random forest.
© 2024. The Author(s).