Longitudinal Screening for Diabetic Retinopathy in a Nationwide Screening Program: Comparing Deep Learning and Human Graders

Jirawut Limwattanayingyong; Variya Nganthavee; Kasem Seresirikachorn; Tassapol Singalavanija; Ngamphol Soonthornworasiri; Varis Ruamviboonsuk; Chetan Rao; Rajiv Raman; Andrzej Grzybowski; Mike Schaekermann; Lily H Peng; Dale R Webster; Christopher Semturs; Jonathan Krause; Rory Sayres; Fred Hersch; Richa Tiwari; Yun Liu; Paisan Ruamviboonsuk

doi:10.1155/2020/8839376

Longitudinal Screening for Diabetic Retinopathy in a Nationwide Screening Program: Comparing Deep Learning and Human Graders

J Diabetes Res. 2020 Dec 15:2020:8839376. doi: 10.1155/2020/8839376. eCollection 2020.

Authors

Jirawut Limwattanayingyong¹, Variya Nganthavee¹, Kasem Seresirikachorn¹, Tassapol Singalavanija², Ngamphol Soonthornworasiri³, Varis Ruamviboonsuk⁴, Chetan Rao⁵, Rajiv Raman⁵, Andrzej Grzybowski^{6

7}, Mike Schaekermann⁸, Lily H Peng⁸, Dale R Webster⁸, Christopher Semturs⁸, Jonathan Krause⁸, Rory Sayres⁸, Fred Hersch⁸, Richa Tiwari⁹, Yun Liu⁸, Paisan Ruamviboonsuk¹

Affiliations

¹ Department of Ophthalmology, College of Medicine, Rangsit University, Rajavithi Hospital, Bangkok, Thailand.
² Department of Ophthalmology, Chulabhorn Hospital, HRH Princess Chulabhorn College of Medical Science, Chulabhorn Royal Academy, Bangkok, Thailand.
³ Department of Tropical Hygiene, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand.
⁴ Department of Biochemistry, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.
⁵ Shri Bhagwan Mahavir Vitreoretinal Services, Sankara Nethralaya, Chennai, Tamil Nadu, India.
⁶ Department of Ophthalmology, University of Warmia and Mazury, Olsztyn, Poland.
⁷ Institute for Research in Ophthalmology, Foundation for Ophthalmology Development, Poznan, Poland.
⁸ Google Health, Palo Alto, CA, USA.
⁹ Work done at Google via Optimum Solutions Pte Ltd, Singapore.

Abstract

Objective: To evaluate diabetic retinopathy (DR) screening via deep learning (DL) and trained human graders (HG) in a longitudinal cohort, as case spectrum shifts based on treatment referral and new-onset DR.

Methods: We randomly selected patients with diabetes screened twice, two years apart within a nationwide screening program. The reference standard was established via adjudication by retina specialists. Each patient's color fundus photographs were graded, and a patient was considered as having sight-threatening DR (STDR) if the worse eye had severe nonproliferative DR, proliferative DR, or diabetic macular edema. We compared DR screening via two modalities: DL and HG. For each modality, we simulated treatment referral by excluding patients with detected STDR from the second screening using that modality.

Results: There were 5,738 patients (12.3% STDR) in the first screening. DL and HG captured different numbers of STDR cases, and after simulated referral and excluding ungradable cases, 4,148 and 4,263 patients remained in the second screening, respectively. The STDR prevalence at the second screening was 5.1% and 6.8% for DL- and HG-based screening, respectively. Along with the prevalence decrease, the sensitivity for both modalities decreased from the first to the second screening (DL: from 95% to 90%, p = 0.008; HG: from 74% to 57%, p < 0.001). At both the first and second screenings, the rate of false negatives for the DL was a fifth that of HG (0.5-0.6% vs. 2.9-3.2%).

Conclusion: On 2-year longitudinal follow-up of a DR screening cohort, STDR prevalence decreased for both DL- and HG-based screening. Follow-up screenings in longitudinal DR screening can be more difficult and induce lower sensitivity for both DL and HG, though the false negative rate was substantially lower for DL. Our data may be useful for health-economics analyses of longitudinal screening settings.

Publication types

Comparative Study

MeSH terms

Aged
Cell Proliferation
Deep Learning*
Diabetic Retinopathy / diagnostic imaging*
Diabetic Retinopathy / epidemiology
Female
Fundus Oculi*
Humans
Image Interpretation, Computer-Assisted*
Incidence
Longitudinal Studies
Macular Edema / diagnostic imaging*
Macular Edema / epidemiology
Male
Mass Screening*
Middle Aged
National Health Programs
Photography*
Predictive Value of Tests
Prevalence
Reproducibility of Results
Severity of Illness Index
Thailand / epidemiology