Application of visual transformer in renal image analysis

Yuwei Yin; Zhixian Tang; Huachun Weng

doi:10.1186/s12938-024-01209-z

Application of visual transformer in renal image analysis

Biomed Eng Online. 2024 Mar 5;23(1):27. doi: 10.1186/s12938-024-01209-z.

Authors

Yuwei Yin^{1

2}, Zhixian Tang³, Huachun Weng^{4

5}

Affiliations

¹ The College of Health Sciences and Engineering, University of Shanghai for Science and Technology, 516 Jungong Highway, Yangpu Area, Shanghai, 200093, China.
² The College of Medical Technology, Shanghai University of Medicine & Health Sciences, 279 Zhouzhu Highway, Pudong New Area, Shanghai, 201318, China.
³ The College of Medical Technology, Shanghai University of Medicine & Health Sciences, 279 Zhouzhu Highway, Pudong New Area, Shanghai, 201318, China. [email protected].
⁴ The College of Health Sciences and Engineering, University of Shanghai for Science and Technology, 516 Jungong Highway, Yangpu Area, Shanghai, 200093, China. [email protected].
⁵ The College of Medical Technology, Shanghai University of Medicine & Health Sciences, 279 Zhouzhu Highway, Pudong New Area, Shanghai, 201318, China. [email protected].

Abstract

Deep Self-Attention Network (Transformer) is an encoder-decoder architectural model that excels in establishing long-distance dependencies and is first applied in natural language processing. Due to its complementary nature with the inductive bias of convolutional neural network (CNN), Transformer has been gradually applied to medical image processing, including kidney image processing. It has become a hot research topic in recent years. To further explore new ideas and directions in the field of renal image processing, this paper outlines the characteristics of the Transformer network model and summarizes the application of the Transformer-based model in renal image segmentation, classification, detection, electronic medical records, and decision-making systems, and compared with CNN-based renal image processing algorithm, analyzing the advantages and disadvantages of this technique in renal image processing. In addition, this paper gives an outlook on the development trend of Transformer in renal image processing, which provides a valuable reference for a lot of renal image analysis.

Keywords: Attention mechanism; Convolutional neural network; Deep learning; Kidney disease; Transformer.

Publication types

Review

MeSH terms

Algorithms*
Electronic Health Records*
Image Processing, Computer-Assisted
Kidney / diagnostic imaging
Natural Language Processing

Grants and funding

82170728/National Natural Science Foundation of China