Knowledge-enhanced Visual-Language Pretraining for Computational Pathology

Zhou, Xiao; Zhang, Xiaoman; Wu, Chaoyi; Zhang, Ya; Xie, Weidi; Wang, Yanfeng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.09942 (cs)

[Submitted on 15 Apr 2024]

Title:Knowledge-enhanced Visual-Language Pretraining for Computational Pathology

Authors:Xiao Zhou, Xiaoman Zhang, Chaoyi Wu, Ya Zhang, Weidi Xie, Yanfeng Wang

View PDF HTML (experimental)

Abstract:In this paper, we consider the problem of visual representation learning for computational pathology, by exploiting large-scale image-text pairs gathered from public resources, along with the domain specific knowledge in pathology. Specifically, we make the following contributions: (i) We curate a pathology knowledge tree that consists of 50,470 informative attributes for 4,718 diseases requiring pathology diagnosis from 32 human tissues. To our knowledge, this is the first comprehensive structured pathology knowledge base; (ii) We develop a knowledge-enhanced visual-language pretraining approach, where we first project pathology-specific knowledge into latent embedding space via language model, and use it to guide the visual representation learning; (iii) We conduct thorough experiments to validate the effectiveness of our proposed components, demonstrating significant performance improvement on various downstream tasks, including cross-modal retrieval, zero-shot classification on pathology patches, and zero-shot tumor subtyping on whole slide images (WSIs). All codes, models and the pathology knowledge tree will be released to the research community

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.09942 [cs.CV]
	(or arXiv:2404.09942v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.09942

Submission history

From: Xiao Zhou [view email]
[v1] Mon, 15 Apr 2024 17:11:25 UTC (4,185 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Knowledge-enhanced Visual-Language Pretraining for Computational Pathology

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Knowledge-enhanced Visual-Language Pretraining for Computational Pathology

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators