BiFTransNet: A unified and simultaneous segmentation network for gastrointestinal images of CT & MRI

Xin Jiang; Yizhou Ding; Mingzhe Liu; Yong Wang; Yan Li; Zongda Wu

doi:10.1016/j.compbiomed.2023.107326

BiFTransNet: A unified and simultaneous segmentation network for gastrointestinal images of CT & MRI

Comput Biol Med. 2023 Oct:165:107326. doi: 10.1016/j.compbiomed.2023.107326. Epub 2023 Aug 8.

Authors

Xin Jiang¹, Yizhou Ding², Mingzhe Liu³, Yong Wang⁴, Yan Li⁵, Zongda Wu⁶

Affiliations

¹ School of Artificial Intelligence, Chongqing University of Technology, Chongqing 401135, China; Department of Radiology, Chongqing University Cancer Hospital, Chongqing, 400030, China. Electronic address: [email protected].
² School of Artificial Intelligence, Chongqing University of Technology, Chongqing 401135, China. Electronic address: [email protected].
³ School of Data Science and Artificial Intelligence, Wenzhou University of Technology, Wenzhou, 325000, China. Electronic address: [email protected].
⁴ School of Artificial Intelligence, Chongqing University of Technology, Chongqing 401135, China. Electronic address: [email protected].
⁵ School of Artificial Intelligence, Chongqing University of Technology, Chongqing 401135, China. Electronic address: [email protected].
⁶ Department of Computer Science and Engineering, Shaoxing University, Shaoxing 312000, China. Electronic address: [email protected].

PMID: 37619324
DOI: 10.1016/j.compbiomed.2023.107326

Abstract

Gastrointestinal (GI) cancer is a malignancy affecting the digestive organs. During radiation therapy, the radiation oncologist must precisely aim the X-ray beam at the tumor while avoiding unaffected areas of the stomach and intestines. Consequently, accurate, automated GI image segmentation is urgently needed in clinical practice. While the fully convolutional network (FCN) and U-Net framework have shown impressive results in medical image segmentation, their ability to model long-range dependencies is constrained by the convolutional kernel's restricted receptive field. The transformer has a robust capacity for global modeling owing to its inherent global self-attention mechanism. The TransUnet model leverages the strengths of both the convolutional neural network (CNN) and transformer models through a hybrid CNN-transformer encoder. However, the concatenation of high- and low-level features in the decoder is ineffective in fusing global and local information. To overcome this limitation, we propose an innovative transformer-based medical image segmentation architecture called BiFTransNet, which introduces a BiFusion module into the decoder stage, enabling effective global and local feature fusion by enabling feature integration from various modules. Further, a multilevel loss (ML) strategy is introduced to oversee the learning process of each decoder layer and optimize the use of globally and locally fused contextual features at different scales. Our method achieved a Dice score of 89.51% and an intersection-over-union (IoU) score of 86.54% on the UW-Madison Gastrointestinal Segmentation dataset. Moreover, our method attained a Dice score of 78.77% and a Hausdorff distance (HD) of 27.94% on the Synapse Multi-organ Segmentation dataset. Compared with the state-of-the-art methods, our proposed method achieves superior segmentation performance in gastrointestinal segmentation tasks. More significantly, our method can be easily extended to medical segmentation in different modalities such as CT and MRI. Our method achieves clinical multimodal medical segmentation and provides decision supports for clinical radiotherapy plans.

Keywords: Deep learning; Feature fusion; Medical image; Medical image segmentation; Multi-level loss.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Image Processing, Computer-Assisted
Learning
Magnetic Resonance Imaging*
Neural Networks, Computer
Stomach*
Tomography, X-Ray Computed