A2HTL: An Automated Hybrid Transformer-Based Learning for Predicting Survival of Esophageal Cancer Using CT Images

IEEE Trans Nanobioscience. 2024 Oct;23(4):548-555. doi: 10.1109/TNB.2024.3441533. Epub 2024 Oct 15.

Abstract

Esophageal cancer is a common malignant tumor, precisely predicting survival of esophageal cancer is crucial for personalized treatment. However, current region of interest (ROI) based methodologies not only necessitate prior medical knowledge for tumor delineation, but may also cause the model to be overly sensitive to ROI. To address these challenges, we develop an automated Hybrid Transformer based learning that integrates a Hybrid Transformer size-aware U-Net with a ranked survival prediction network to enable automatic survival prediction for esophageal cancer. Specifically, we first incorporate the Transformer with shifted windowing multi-head self-attention mechanism (SW-MSA) into the base of the U-Net encoder to capture the long-range dependency in CT images. Furthermore, to alleviate the imbalance between the ROI and the background in CT images, we devise a size-aware coefficient for the segmentation loss. Finally, we also design a ranked pair sorting loss to more comprehensively capture the ranked information inherent in CT images. We evaluate our proposed method on a dataset comprising 759 samples with esophageal cancer. Experimental results demonstrate the superior performance of our proposed method in survival prediction, even without ROI ground truth.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Esophageal Neoplasms* / diagnostic imaging
  • Humans
  • Machine Learning
  • Radiographic Image Interpretation, Computer-Assisted / methods
  • Survival Analysis
  • Tomography, X-Ray Computed* / methods