A lung nodule segmentation model based on the transformer with multiple thresholds and coordinate attention

Tianjiao Hu; Yihua Lan; Yingqi Zhang; Jiashu Xu; Shuai Li; Chih-Cheng Hung

doi:10.1038/s41598-024-82877-8

A lung nodule segmentation model based on the transformer with multiple thresholds and coordinate attention

Sci Rep. 2024 Dec 30;14(1):31743. doi: 10.1038/s41598-024-82877-8.

Authors

Tianjiao Hu¹, Yihua Lan^{2

3}, Yingqi Zhang¹, Jiashu Xu¹, Shuai Li¹, Chih-Cheng Hung⁴

Affiliations

¹ School of Artificial Intelligence and Software Engineering, Nanyang Normal University, Nanyang, 473061, China.
² School of Artificial Intelligence and Software Engineering, Nanyang Normal University, Nanyang, 473061, China. [email protected].
³ Henan Engineering Research Center of Intelligent Processing for Big Data of Digital Image, Nanyang, 473061, China. [email protected].
⁴ Laboratory for Machine Vision and Security Research, Kennesaw State University-Marietta Campus, Marietta, USA.

Abstract

Accurate lung nodule segmentation is fundamental for the early detection of lung cancer. With the rapid development of deep learning, lung nodule segmentation models based on the encoder-decoder structure have become the mainstream research approach. However, during the encoding process, most models have limitations in extracting edge and semantic information and in capturing long-range dependencies. To address these problems, we propose a new lung nodule segmentation model, abbreviated as MCAT-Net. In this model, we construct a multi-threshold feature separation module to capture edge and texture features from different levels and specified intensities of the input image. Secondly, we introduce the coordinate attention mechanism, which allows the model to better recognize and utilize spatial information when handling long-range dependencies, enabling the deep network to maintain its sensitivity to nodule positions. Thirdly, we use the transformer to fully capture the long-range dependencies, further enhancing the global information integration of the network. The proposed method was verified on the LIDC-IDRI and LNDb datasets. The Dice similarity coefficient (DSC) values achieved were 88.29% and 78.51%, and the sensitivities were 86.33% and 75.05%, respectively. The experimental results demonstrated its high practical value for the early diagnosis of lung cancer.

MeSH terms

Algorithms
Deep Learning
Early Detection of Cancer / methods
Humans
Image Processing, Computer-Assisted / methods
Lung Neoplasms* / diagnostic imaging
Lung Neoplasms* / pathology
Radiographic Image Interpretation, Computer-Assisted / methods
Solitary Pulmonary Nodule* / diagnostic imaging
Tomography, X-Ray Computed / methods

Abstract

MeSH terms

Grants and funding