FPT: Fine-grained Prompt Tuning for Parameter and Memory Efficient Fine Tuning in High-resolution Medical Image Classification

Huang, Yijin; Cheng, Pujin; Tam, Roger; Tang, Xiaoying

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.07576v1 (cs)

[Submitted on 12 Mar 2024 (this version), latest version 2 Jul 2024 (v4)]

Title:FPT: Fine-grained Prompt Tuning for Parameter and Memory Efficient Fine Tuning in High-resolution Medical Image Classification

Authors:Yijin Huang, Pujin Cheng, Roger Tam, Xiaoying Tang

View PDF HTML (experimental)

Abstract:Parameter-efficient fine-tuning (PEFT) is proposed as a cost-effective way to transfer pre-trained models to downstream tasks, avoiding the high cost of updating entire large-scale pre-trained models (LPMs). In this work, we present Fine-grained Prompt Tuning (FPT), a novel PEFT method for medical image classification. FPT significantly reduces memory consumption compared to other PEFT methods, especially in high-resolution contexts. To achieve this, we first freeze the weights of the LPM and construct a learnable lightweight side network. The frozen LPM takes high-resolution images as input to extract fine-grained features, while the side network is fed low-resolution images to reduce memory usage. To allow the side network to access pre-trained knowledge, we introduce fine-grained prompts that summarize information from the LPM through a fusion module. Important tokens selection and preloading techniques are employed to further reduce training cost and memory requirements. We evaluate FPT on four medical datasets with varying sizes, modalities, and complexities. Experimental results demonstrate that FPT achieves comparable performance to fine-tuning the entire LPM while using only 1.8% of the learnable parameters and 13% of the memory costs of an encoder ViT-B model with a 512 x 512 input resolution.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.07576 [cs.CV]
	(or arXiv:2403.07576v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.07576

Submission history

From: Yijin Huang [view email]
[v1] Tue, 12 Mar 2024 12:05:43 UTC (413 KB)
[v2] Tue, 26 Mar 2024 10:55:51 UTC (413 KB)
[v3] Tue, 25 Jun 2024 16:15:54 UTC (414 KB)
[v4] Tue, 2 Jul 2024 05:28:03 UTC (233 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:FPT: Fine-grained Prompt Tuning for Parameter and Memory Efficient Fine Tuning in High-resolution Medical Image Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:FPT: Fine-grained Prompt Tuning for Parameter and Memory Efficient Fine Tuning in High-resolution Medical Image Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators