VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation

Zhang, Mingya; Yu, Yue; Gu, Limei; Lin, Tingsheng; Tao, Xianping

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2403.09157 (eess)

[Submitted on 14 Mar 2024]

Title:VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation

Authors:Mingya Zhang, Yue Yu, Limei Gu, Tingsheng Lin, Xianping Tao

View PDF HTML (experimental)

Abstract:In the field of medical image segmentation, models based on both CNN and Transformer have been thoroughly investigated. However, CNNs have limited modeling capabilities for long-range dependencies, making it challenging to exploit the semantic information within images fully. On the other hand, the quadratic computational complexity poses a challenge for Transformers. Recently, State Space Models (SSMs), such as Mamba, have been recognized as a promising method. They not only demonstrate superior performance in modeling long-range interactions, but also preserve a linear computational complexity. Inspired by the Mamba architecture, We proposed Vison Mamba-UNetV2, the Visual State Space (VSS) Block is introduced to capture extensive contextual information, the Semantics and Detail Infusion (SDI) is introduced to augment the infusion of low-level and high-level features. We conduct comprehensive experiments on the ISIC17, ISIC18, CVC-300, CVC-ClinicDB, Kvasir, CVC-ColonDB and ETIS-LaribPolypDB public datasets. The results indicate that VM-UNetV2 exhibits competitive performance in medical image segmentation tasks. Our code is available at this https URL.

Comments:	12 pages, 4 figures
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.09157 [eess.IV]
	(or arXiv:2403.09157v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2403.09157

Submission history

From: Mingya Zhang [view email]
[v1] Thu, 14 Mar 2024 08:12:39 UTC (2,409 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators