Self-supervised based clustering for retinal optical coherence tomography images

Yilong Luo; Tian Lin; Aidi Lin; Xiaoting Mai; Haoyu Chen

doi:10.1038/s41433-024-03444-z

Self-supervised based clustering for retinal optical coherence tomography images

Eye (Lond). 2024 Oct 28. doi: 10.1038/s41433-024-03444-z. Online ahead of print.

Authors

Yilong Luo¹, Tian Lin¹, Aidi Lin¹, Xiaoting Mai¹, Haoyu Chen²

Affiliations

¹ Joint Shantou International Eye Center, Shantou University & the Chinese University of Hong Kong, Shantou, China.
² Joint Shantou International Eye Center, Shantou University & the Chinese University of Hong Kong, Shantou, China. [email protected].

PMID: 39468266
DOI: 10.1038/s41433-024-03444-z

Abstract

Background: In response to the inadequacy of manual analysis in meeting the rising demand for retinal optical coherence tomography (OCT) images, a self-supervised learning-based clustering model was implemented.

Methods: A public dataset was utilized, with 83,484 OCT images with categories of choroidal neovascularization (CNV), diabetic macular edema (DME), drusen, and normal fundus. This study employed the Semantic Pseudo Labeling for Image Clustering (SPICE) framework, a self-supervised learning-based method, to cluster unlabeled OCT images into binary and four categories, and the performances were compared with baseline models. We also analysed feature distribution using t-SNE, and explored the cluster centers, attention maps, and misclassified images. In addition, DME and CNV subsets were clustered binarily, and the results were interpreted by two retinal specialists.

Results: SPICE demonstrated superior performance in binary and four categories classification tasks, achieving the accuracy of 0.886 and 0.846, respectively. In t-SNE analysis, the four types exhibited significant clustering into distinct groups. The cluster centers corresponded to the human labels, and the heat map revealed that the model focused on important biomarkers. The misclassified images exposed similar features to the inaccurate classes. The model also grouped DME and CNV into two distinct categories respectively.

Conclusions: Self-supervised clustering effectively distinguished disease variances and revealed common features, with a notable capability to detect disease heterogeneity through biomarkers.