Identification of epithelial-related artificial neural network prognostic models for the prediction of bladder cancer prognosis through comprehensive analysis of single-cell and bulk RNA sequencing

Heliyon. 2024 Jul 17;10(15):e34632. doi: 10.1016/j.heliyon.2024.e34632. eCollection 2024 Aug 15.

Abstract

Background: Bladder cancer (BLCA) presents as a heterogeneous epithelial malignancy. Progress in the early detection and effective treatment of BLCA relies heavily on the identification of novel biomarkers. Therefore, the primary goal of this study is to pinpoint potential biomarkers for BLCA through the fusion of single-cell RNA sequencing and RNA sequencing assessments. Furthermore, the aim is to establish practical clinical prognostic models that can facilitate accurate categorization and individualized therapy for patients.

Methods: In this research, training sets were acquired from the TCGA database, whereas validation sets (GSE32894) and single-cell datasets (GSE135337) were extracted from the GEO database. Single-cell analysis was utilized to obtain characteristic subpopulations along with their associated marker genes. Subsequently, a novel BLCA subtype was identified within TCGA-BLCA. Furthermore, an artificial neural network prognostic model was constructed within the TCGA-BLCA cohort and subsequently verified utilizing a validation set. Two machine learning algorithms were employed to screen hub genes. QRT-qPCR was performed to detect the gene expression levels utilized in the construction of prognostic models across various cell lines. Additionally, the cMAP database and molecular docking were utilized for searching small molecule drugs.

Results: The results of single-cell analysis revealed the presence of epithelial cells in multiple subpopulations, with 1579 marker genes selected for subsequent investigations. Subsequently, four epithelial cell subtypes were identified within the TCGA-BLCA cohort. Notably, cluster A exhibited a significant survival advantage. Concurrently, an artificial neural network prognostic model comprising 17 feature genes was constructed, accurately stratifying patient risk. Patients categorized in the low-risk group demonstrated a considerable survival advantage. The ROC analysis suggested that the model has strong prognostic ability. Furthermore, the findings of the validation group align consistently with those from the training group. Two types of machine learning algorithms screened NFIC as hub genes. Forskolin, a small molecule drug that binds to NFIC, was identified by employing a cMAP database and molecular docking.

Conclusion: The analysis results supplement the research on the role of epithelial cells in BLCA. An artificial neural network prognostic model containing 17 characteristic genes demonstrates the capability to accurately stratify patient risk, thereby potentially improving clinical decision-making and optimizing personalized therapeutic approaches.

Keywords: Artificial neural network prognostic models; Bladder cancer; Machine learning; NFIC; Single-cell analysis.