Using machine learning approach for screening metastatic biomarkers in colorectal cancer and predictive modeling with experimental validation

Sci Rep. 2023 Nov 8;13(1):19426. doi: 10.1038/s41598-023-46633-8.

Abstract

Colorectal cancer (CRC) liver metastasis accounts for the majority of fatalities associated with CRC. Early detection of metastasis is crucial for improving patient outcomes but can be delayed due to a lack of symptoms. In this research, we aimed to investigate CRC metastasis-related biomarkers by employing a machine learning (ML) approach and experimental validation. The gene expression profile of CRC patients with liver metastasis was obtained using the GSE41568 dataset, and the differentially expressed genes between primary and metastatic samples were screened. Subsequently, we carried out feature selection to identify the most relevant DEGs using LASSO and Penalized-SVM methods. DEGs commonly selected by these methods were selected for further analysis. Finally, the experimental validation was done through qRT-PCR. 11 genes were commonly selected by LASSO and P-SVM algorithms, among which seven had prognostic value in colorectal cancer. It was found that the expression of the MMP3 gene decreases in stage IV of colorectal cancer compared to other stages (P value < 0.01). Also, the expression level of the WNT11 gene was observed to increase significantly in this stage (P value < 0.001). It was also found that the expression of WNT5a, TNFSF11, and MMP3 is significantly lower, and the expression level of WNT11 is significantly higher in liver metastasis samples compared to primary tumors. In summary, this study has identified a set of potential biomarkers for CRC metastasis using ML algorithms. The findings of this research may provide new insights into identifying biomarkers for CRC metastasis and may potentially lay the groundwork for innovative therapeutic strategies for treatment of this disease.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers, Tumor / genetics
  • Biomarkers, Tumor / metabolism
  • Colorectal Neoplasms* / diagnosis
  • Colorectal Neoplasms* / genetics
  • Colorectal Neoplasms* / pathology
  • Early Detection of Cancer
  • Gene Expression Profiling / methods
  • Humans
  • Liver Neoplasms* / diagnosis
  • Liver Neoplasms* / genetics
  • Liver Neoplasms* / secondary
  • Matrix Metalloproteinase 3 / genetics

Substances

  • Matrix Metalloproteinase 3
  • Biomarkers, Tumor