Integrative Analysis of DNA Methylation and Gene Expression Profiles Identifies Colorectal Cancer-Related Diagnostic Biomarkers

Pathol Oncol Res. 2021 Jul 21:27:1609784. doi: 10.3389/pore.2021.1609784. eCollection 2021.

Abstract

Background: Colorectal cancer (CRC) is a common human malignancy worldwide. The prognosis of patients is largely frustrated by delayed diagnosis or misdiagnosis. DNA methylation alterations have been previously proved to be involved in CRC carcinogenesis. Methods: In this study, we proposed to identify CRC-related diagnostic biomarkers by analyzing DNA methylation and gene expression profiles. TCGA-COAD datasets downloaded from the Cancer Genome Atlas (TCGA) were used as the training set to screen differential expression genes (DEGs) and methylation CpG sites (dmCpGs) in CRC samples. A logistic regression model was constructed based on hyper-methylated CpG sites which were located in downregulated genes for CRC diagnosis. Another two independent datasets from the Gene Expression Omnibus (GEO) were used as a testing set to evaluate the performance of the model in CRC diagnosis. Results: We found that CpG island methylator phenotype (CIMP) was a potential signature of poor prognosis by dividing CRC samples into CIMP and noCIMP groups based on a set of CpG sites with methylation standard deviation (sd) > 0.2 among CRC samples and low methylation levels (mean β < 0.05) in adjacent samples. Hyper-methylated CpGs tended to be more closed to CpG island (CGI) and transcription start site (TSS) relative to hypo-methylated CpGs (p-value < 0.05, Fisher exact test). A logistic regression model was finally constructed based on two hyper-methylated CpGs, which had an area under receiver operating characteristic curve of 0.98 in the training set, and 0.85 and 0.95 in the two independent testing sets. Conclusions: In conclusion, our study identified promising DNA methylation biomarkers for CRC diagnosis.

Keywords: Colorectal cancer; CpG island methylator phenotype; DNA methylation; Gene Expression Omnibus; The Cancer Genome Atlas; diagnosis; logistic regression model.

MeSH terms

  • Biomarkers, Tumor / genetics*
  • Case-Control Studies
  • Colorectal Neoplasms / diagnosis*
  • Colorectal Neoplasms / genetics
  • CpG Islands
  • DNA Methylation*
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic*
  • Humans
  • Phenotype*
  • Prognosis
  • Survival Rate
  • Transcriptome*

Substances

  • Biomarkers, Tumor