Integrating machine learning algorithms and multiple immunohistochemistry validation to unveil novel diagnostic markers based on costimulatory molecules for predicting immune microenvironment status in triple-negative breast cancer

Front Immunol. 2024 Jun 28:15:1424259. doi: 10.3389/fimmu.2024.1424259. eCollection 2024.

Abstract

Introduction: Costimulatory molecules are putative novel targets or potential additions to current available immunotherapy, but their expression patterns and clinical value in triple-negative breast cancer (TNBC) are to be clarified.

Methods: The gene expression profiles datasets of TNBC patients were obtained from The Cancer Genome Atlas and the Gene Expression Omnibus databases. Diagnostic biomarkers for stratifying individualized tumor immune microenvironment (TIME) were identified using the Least Absolute Shrinkage and Selection Operator (LASSO) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) algorithms. Additionally, we explored their associations with response to immunotherapy via the multiplex immunohistochemistry (mIHC).

Results: A total of 60 costimulatory molecule genes (CMGs) were obtained, and we determined two different TIME subclasses ("hot" and "cold") through the K-means clustering method. The "hot" tumors presented a higher infiltration of activated immune cells, i.e., CD4 memory-activated T cells, resting NK cells, M1 macrophages, and CD8 T cells, thereby enriched in the B cell and T cell receptor signaling pathways. LASSO and SVM-RFE algorithms identified three CMGs (CD86, TNFRSF17 and TNFRSF1B) as diagnostic biomarkers. Following, a novel diagnostic nomogram was constructed for predicting individualized TIME status and was validated with good predictive accuracy in TCGA, GSE76250 and GSE58812 databases. Further mIHC conformed that TNBC patients with high CD86, TNFRSF17 and TNFRSF1B levels tended to respond to immunotherapy.

Conclusion: This study supplemented evidence about the value of CMGs in TNBC. In addition, CD86, TNFRSF17 and TNFRSF1B were found as potential biomarkers, significantly promoting TNBC patient selection for immunotherapeutic guidance.

Keywords: costimulatory molecules; diagnostic biomarker; machine learning algorithm; triple-negative breast cancer; tumor immune microenvironment.

MeSH terms

  • Algorithms
  • Biomarkers, Tumor*
  • Female
  • Gene Expression Profiling
  • Humans
  • Immunohistochemistry*
  • Immunotherapy
  • Lymphocytes, Tumor-Infiltrating / immunology
  • Lymphocytes, Tumor-Infiltrating / metabolism
  • Machine Learning*
  • Transcriptome
  • Triple Negative Breast Neoplasms* / diagnosis
  • Triple Negative Breast Neoplasms* / immunology
  • Tumor Microenvironment* / immunology

Substances

  • Biomarkers, Tumor

Grants and funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was supported by the cultivation foundation for the junior teachers in Sun Yat-sen University (No.20ykpy164).