A network-based predictive gene expression signature for recurrence risks in stage II colorectal cancer

Cancer Med. 2020 Jan;9(1):179-193. doi: 10.1002/cam4.2642. Epub 2019 Nov 14.

Abstract

The current criteria for defining the recurrence risks of stage II colorectal cancer (CRC) are not robust; therefore, we aimed to explore novel gene signatures to predict recurrence risks and to reveal the underlying mechanisms of stage II CRC. First, the gene expression profiles of 124 patients with stage II CRC from The Cancer Genome Atlas (TCGA) database were obtained to screen differentially expressed genes (DEGs). A total of 202 DEGs, including 128 upregulated and 74 downregulated, were identified in the recurrence group (n = 24) compared to the nonrecurrence group (n = 100). Furthermore, the top 5 DEGs (ZNF561, WFS1, SLC2A1, MFI2, and PTGR1) were identified by random forest variable hunting, and four (ZNF561, WFS1, SLC2A1, and PTGR1) were selected to create a four-gene recurrent model (GRM), with an area under the curve (AUC) of 0.882 according to the receiver operating characteristic curve, and the robust diagnostic effectiveness of the GRM was further validated with another gene expression profiling dataset (GSE12032), with an AUC of 0.943. The diagnostic effectiveness of the GRM regarding recurrence was associated with poor disease-free survival in all stages of CRC. In addition, gene ontology functional annotation and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses revealed 18 enriched functions and 6 enriched pathways. Four genes, ABCG2, CACNA1F, CYP19A1, and TF, were identified as hub genes by the protein-protein interaction network, which further validated that these genes were correlated with a poor pathologic stage and overall survival in all stages of CRC. In conclusion, the GRM can effectively classify stage II CRC into groups of high and low risks of recurrence, thereby making up for the prognostic value of the traditional clinicopathological risk factors defined by the National Comprehensive Cancer Network guidelines. The hub genes may be useful therapeutic targets for recurrence. Thus, the GRM and hub genes could offer clinical value in directing individualized and precision therapeutic regimens for stage II CRC patients.

Keywords: bioinformatics analysis; colorectal cancer; recurrence mechanisms; recurrence risks.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Antineoplastic Agents / pharmacology
  • Antineoplastic Agents / therapeutic use
  • Biomarkers, Tumor / antagonists & inhibitors
  • Biomarkers, Tumor / genetics*
  • Colorectal Neoplasms / genetics*
  • Colorectal Neoplasms / mortality
  • Colorectal Neoplasms / pathology
  • Datasets as Topic
  • Disease-Free Survival
  • Down-Regulation
  • Follow-Up Studies
  • Gene Expression Regulation, Neoplastic
  • Gene Regulatory Networks / drug effects
  • Humans
  • Molecular Targeted Therapy / methods
  • Neoplasm Recurrence, Local / epidemiology*
  • Neoplasm Recurrence, Local / genetics
  • Neoplasm Recurrence, Local / prevention & control
  • Neoplasm Staging
  • Precision Medicine / methods
  • Prognosis
  • Protein Interaction Maps / drug effects
  • Protein Interaction Maps / genetics
  • RNA-Seq
  • ROC Curve
  • Risk Assessment / methods
  • Transcriptome / drug effects
  • Transcriptome / genetics*
  • Up-Regulation

Substances

  • Antineoplastic Agents
  • Biomarkers, Tumor