Application of visual transformer in renal image analysis

Biomed Eng Online. 2024 Mar 5;23(1):27. doi: 10.1186/s12938-024-01209-z.

Abstract

Deep Self-Attention Network (Transformer) is an encoder-decoder architectural model that excels in establishing long-distance dependencies and is first applied in natural language processing. Due to its complementary nature with the inductive bias of convolutional neural network (CNN), Transformer has been gradually applied to medical image processing, including kidney image processing. It has become a hot research topic in recent years. To further explore new ideas and directions in the field of renal image processing, this paper outlines the characteristics of the Transformer network model and summarizes the application of the Transformer-based model in renal image segmentation, classification, detection, electronic medical records, and decision-making systems, and compared with CNN-based renal image processing algorithm, analyzing the advantages and disadvantages of this technique in renal image processing. In addition, this paper gives an outlook on the development trend of Transformer in renal image processing, which provides a valuable reference for a lot of renal image analysis.

Keywords: Attention mechanism; Convolutional neural network; Deep learning; Kidney disease; Transformer.

Publication types

  • Review

MeSH terms

  • Algorithms*
  • Electronic Health Records*
  • Image Processing, Computer-Assisted
  • Kidney / diagnostic imaging
  • Natural Language Processing