Deep Learning and Glaucoma Specialists: The Relative Importance of Optic Disc Features to Predict Glaucoma Referral in Fundus Photographs

Sonia Phene; R Carter Dunn; Naama Hammel; Yun Liu; Jonathan Krause; Naho Kitade; Mike Schaekermann; Rory Sayres; Derek J Wu; Ashish Bora; Christopher Semturs; Anita Misra; Abigail E Huang; Arielle Spitze; Felipe A Medeiros; April Y Maa; Monica Gandhi; Greg S Corrado; Lily Peng; Dale R Webster

doi:10.1016/j.ophtha.2019.07.024

Deep Learning and Glaucoma Specialists: The Relative Importance of Optic Disc Features to Predict Glaucoma Referral in Fundus Photographs

Ophthalmology. 2019 Dec;126(12):1627-1639. doi: 10.1016/j.ophtha.2019.07.024. Epub 2019 Sep 24.

Authors

Sonia Phene¹, R Carter Dunn¹, Naama Hammel², Yun Liu¹, Jonathan Krause¹, Naho Kitade¹, Mike Schaekermann¹, Rory Sayres¹, Derek J Wu¹, Ashish Bora¹, Christopher Semturs¹, Anita Misra¹, Abigail E Huang¹, Arielle Spitze³, Felipe A Medeiros⁴, April Y Maa⁵, Monica Gandhi⁶, Greg S Corrado¹, Lily Peng¹, Dale R Webster¹

Affiliations

¹ Google Health, Google LLC, Mountain View, California.
² Google Health, Google LLC, Mountain View, California. Electronic address: [email protected].
³ Virginia Ophthalmology Associates, Norfolk, Virginia; Department of Ophthalmology, Eastern Virginia Medical School, Norfolk, Virginia.
⁴ Department of Ophthalmology, Duke University, Durham, North Carolina.
⁵ Department of Ophthalmology, Emory University School of Medicine, Atlanta, Georgia; Ophthalmology Section, Atlanta Veterans Affairs Medical Center, Atlanta, Georgia.
⁶ Dr. Shroff's Charity Eye Hospital, New Delhi, India.

PMID: 31561879
DOI: 10.1016/j.ophtha.2019.07.024

Abstract

Purpose: To develop and validate a deep learning (DL) algorithm that predicts referable glaucomatous optic neuropathy (GON) and optic nerve head (ONH) features from color fundus images, to determine the relative importance of these features in referral decisions by glaucoma specialists (GSs) and the algorithm, and to compare the performance of the algorithm with eye care providers.

Design: Development and validation of an algorithm.

Participants: Fundus images from screening programs, studies, and a glaucoma clinic.

Methods: A DL algorithm was trained using a retrospective dataset of 86 618 images, assessed for glaucomatous ONH features and referable GON (defined as ONH appearance worrisome enough to justify referral for comprehensive examination) by 43 graders. The algorithm was validated using 3 datasets: dataset A (1205 images, 1 image/patient; 18.1% referable), images adjudicated by panels of GSs; dataset B (9642 images, 1 image/patient; 9.2% referable), images from a diabetic teleretinal screening program; and dataset C (346 images, 1 image/patient; 81.7% referable), images from a glaucoma clinic.

Main outcome measures: The algorithm was evaluated using the area under the receiver operating characteristic curve (AUC), sensitivity, and specificity for referable GON and glaucomatous ONH features.

Results: The algorithm's AUC for referable GON was 0.945 (95% confidence interval [CI], 0.929-0.960) in dataset A, 0.855 (95% CI, 0.841-0.870) in dataset B, and 0.881 (95% CI, 0.838-0.918) in dataset C. Algorithm AUCs ranged between 0.661 and 0.973 for glaucomatous ONH features. The algorithm showed significantly higher sensitivity than 7 of 10 graders not involved in determining the reference standard, including 2 of 3 GSs, and showed higher specificity than 3 graders (including 1 GS), while remaining comparable to others. For both GSs and the algorithm, the most crucial features related to referable GON were: presence of vertical cup-to-disc ratio of 0.7 or more, neuroretinal rim notching, retinal nerve fiber layer defect, and bared circumlinear vessels.

Conclusions: A DL algorithm trained on fundus images alone can detect referable GON with higher sensitivity than and comparable specificity to eye care providers. The algorithm maintained good performance on an independent dataset with diagnoses based on a full glaucoma workup.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't
Validation Study

MeSH terms

Aged
Area Under Curve
Datasets as Topic
Deep Learning*
Female
Glaucoma, Open-Angle / diagnosis*
Humans
Male
Middle Aged
Nerve Fibers / pathology
Ophthalmologists*
Optic Disk / pathology*
Optic Nerve Diseases / diagnosis*
ROC Curve
Referral and Consultation
Retinal Ganglion Cells / pathology
Retrospective Studies
Sensitivity and Specificity
Specialization*