Identifying the hub genes in non-small cell lung cancer by integrated bioinformatics methods and analyzing the prognostic values

Pathol Res Pract. 2021 Dec:228:153654. doi: 10.1016/j.prp.2021.153654. Epub 2021 Oct 13.

Abstract

Background: Lung cancer, a malignant tumor, has the highest mortality and second most common morbidity worldwide. Non-small cell lung cancer (NSCLC) is the most common pathological subtype of lung cancer. This study aimed to identify the gene signature associated with the NSCLC prognosis using bioinformatics analysis.

Materials and methods: The dataset GSE103512 was utilized to construct co-expression networks using weighted gene co-expression network analysis (WGCNA). Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses were performed using Database for Annotation, Visualization, and Integrated Discovery. Gene set enrichment analysis was conducted to ascertain the function of the hub genes more accurately. The relationship between the hub genes and immune infiltration was investigated using a single sample gene set enrichment analysis. Hub genes were screened and validated by other datasets and online websites.

Results: The results of WGCNA demonstrated that the blue module was most significantly related to tumor progression in NSCLC. Functional enrichment analysis showed that the blue module was associated with DNA replication, cell division, mitotic nuclear division, and cell cycle. A total of five hub genes (RFC5, UBE2S, CHAF1A, FANCI, and TMEM194A) were chosen to be identified and validated at transcriptional and translational levels. Receiver operating characteristic curve verified that the mRNA levels of these five genes can excellently discriminate between normal and tumor tissues. Survival analysis was also performed. Additionally, the protein levels of these five genes were also significantly different between tumor and normal tissues. Immune infiltration analysis showed that the expression levels of the hub genes had a negative correlation with the infiltration levels of many cells related to innate immune response, antigen-presenting process, humoral immune response, or T cell-mediated immune responses.

Conclusions: We identified five hub genes associated with the NSCLC tumorigenesis. NSCLC patients with higher expressions of each hub gene had a worse prognosis than those with lower expressions. Moreover, the hub genes might serve as biomarkers and therapeutic targets for precise diagnosis, target therapy, and immunotherapy of NSCLC in the future.

Keywords: Immune infiltration analysis; Non-small cell lung cancer; Prognostic; WGCNA; hub Gene.

MeSH terms

  • Carcinoma, Non-Small-Cell Lung / genetics*
  • Computational Biology / methods*
  • Gene Expression Profiling / methods
  • Gene Regulatory Networks
  • Humans
  • Lung Neoplasms / genetics*
  • Prognosis
  • Transcriptome*