Performance of a Deep Learning System and Performance of Optometrists for the Detection of Glaucomatous Optic Neuropathy Using Colour Retinal Photographs

Catherine L Jan; Algis Vingrys; Jacqueline Henwood; Xianwen Shang; Christian Davey; Peter van Wijngaarden; George Y X Kong; Jennifer C Fan Gaskin; Bernardo P Soares Bezerra; Randall S Stafford; Mingguang He

doi:10.3390/bioengineering11111139

Performance of a Deep Learning System and Performance of Optometrists for the Detection of Glaucomatous Optic Neuropathy Using Colour Retinal Photographs

Bioengineering (Basel). 2024 Nov 13;11(11):1139. doi: 10.3390/bioengineering11111139.

Authors

Catherine L Jan^{1

2

3}, Algis Vingrys^{1

4}, Jacqueline Henwood¹, Xianwen Shang^{1

2}, Christian Davey⁵, Peter van Wijngaarden^{1

2}, George Y X Kong^{1

2}, Jennifer C Fan Gaskin^{1

2}, Bernardo P Soares Bezerra^{1

2}, Randall S Stafford⁶, Mingguang He^{1

2

7

8

9}

Affiliations

¹ Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, East Melbourne, VIC 3002, Australia.
² Ophthalmology, Department of Surgery, The University of Melbourne, Melbourne, VIC 3010, Australia.
³ Lost Child's Vision Project, Sydney, NSW 2000, Australia.
⁴ Department of Optometry and Vision Sciences, The University of Melbourne, Melbourne, VIC 3053, Australia.
⁵ School of Mathematics and Statistics, University of Melbourne, Melbourne, VIC 3010, Australia.
⁶ Stanford Prevention Research Center, Stanford University School of Medicine, Stanford, CA 94304, USA.
⁷ School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong.
⁸ Research Centre for SHARP Vision (RCSV), The Hong Kong Polytechnic University, Kowloon, Hong Kong.
⁹ Centre for Eye and Vision Research (CEVR), 17W Hong Kong Science Park, Hong Kong.

Abstract

Background/objectives: Glaucoma is the leading cause of irreversible blindness, with a significant proportion of cases remaining undiagnosed globally. The interpretation of optic disc and retinal nerve fibre layer images poses challenges for optometrists and ophthalmologists, often leading to misdiagnosis. AI has the potential to improve diagnosis. This study aims to validate an AI system (a convolutional neural network based on the Inception-v3 architecture) for detecting glaucomatous optic neuropathy (GON) using colour fundus photographs from a UK population and to compare its performance against Australian optometrists.

Methods: A retrospective external validation study was conducted, comparing AI's performance with that of 11 AHPRA-registered optometrists in Australia on colour retinal photographs, evaluated against a reference (gold) standard established by a panel of glaucoma specialists. Statistical analyses were performed using sensitivity, specificity, and area under the receiver operating characteristic curve (AUROC).

Results: For referable GON, the sensitivity of the AI (33.3% [95%CI: 32.4-34.3) was significantly lower than that of optometrists (65.1% [95%CI: 64.1-66.0]), p < 0.0001, although with significantly higher specificity (AI: 97.4% [95%CI: 97.0-97.7]; optometrists: 85.5% [95%CI: 84.8-86.2], p < 0.0001). The optometrists demonstrated significantly higher AUROC (0.753 [95%CI: 0.744-0.762]) compared to AI (0.654 [95%CI: 0.645-0.662], p < 0.0001).

Conclusion: The AI system exhibited lower performance than optometrists in detecting referable glaucoma. Our findings suggest that while AI can serve as a screening tool, both AI and optometrists have suboptimal performance for the nuanced diagnosis of glaucoma using fundus photographs alone. Enhanced training with diverse populations for AI is essential for improving GON detection and addressing the significant challenge of undiagnosed cases.

Keywords: artificial intelligence; deep learning; glaucoma detection; primary care.

Abstract

Grants and funding