A computational study of gene expression patterns in head and neck squamous cell carcinoma using TCGA data

Future Sci OA. 2024 Dec 31;10(1):2380590. doi: 10.1080/20565623.2024.2380590. Epub 2024 Aug 14.

Abstract

Aim: Head and Neck squamous cell carcinoma (HNSCC) is the second most prevalent cancer in Pakistan. Methods: Gene expression data from TCGA and GETx for normal genes to analyze Differentially Expressed Genes (DEGs). Data was further investigated using the Enrichr tool to perform Gene Ontology (GO). Results: Our analysis identified most significantly differentially expressed genes and explored their established cellular functions as well as their potential involvement in tumor development. We found that the highly expressed Keratin family and S100A9 genes. The under-expressed genes KRT4 and KRT13 provide instructions for the production of keratin proteins. Conclusion: Our study suggests that factors such as poor oral hygiene and smokeless tobacco can result in oral stress and cellular damage and cause cancer.

Keywords: Big Data; DEG; Gene Ontology; HNSCC; KRT13; TCGA.

Plain language summary

The Cancer Genome Atlas (TCGA) holds vast cancer data processed with powerful computers and cloud tech. This sparks new bioinformatics for better cancer diagnosis, treatment, and prevention. In Southeast Asia, Head and Neck Squamous Cell Carcinoma (HNSCC) is prevalent. We used TCGA and GETx data to study gene expression. High-expression Keratin and S100A9 genes fight cellular damage under stress, while under-expressed KRT4 and KRT13 genes shape cell structure. Poor oral care and smokeless tobacco could induce cell damage, sparking cancer mutations. Unveiling HNSCC mechanisms may guide targeted treatments and preventive strategies.