Aims: Periodical screening for diabetic retinopathy (DR) by an ophthalmologist is expensive and demanding. Automated DR image evaluation with Artificial Intelligence tools may represent a clinical and cost-effective alternative for the detection of retinopathy. We aimed to evaluate the accuracy and reliability of a machine learning algorithm.
Methods: This was an observational diagnostic precision study that compared human grader classification with that of DAIRET®, an algorithm nested in an electronic medical record powered by Retmarker SA. Retinal images were taken from 637 consecutive patients attending a routine annual diabetic visit between June 2021 and February 2023. They were manually graded by an ophthalmologist following the International Clinical Diabetic Retinopathy Severity Scale and the results were compared with those of the AI responses. The main outcome measures were screening performance, such as sensitivity and specificity and diagnostic accuracy by 95% confidence intervals.
Results: The rate of cases classified as ungradable was 1.2%, a figure consistent with the literature. DAIRET® sensitivity in the detection of cases of referable DR (moderate and above, "sight-threatening" forms of retinopathy) was equal to 1 (100%). The specificity, that is the true negative rate of absence of DR, was 80 ± 0.04.
Conclusions: DAIRET® achieved excellent sensitivity for referable retinopathy compared with that of human graders. This is undoubtedly the key finding of the study and translates into the certainty that no patient in need of the ophthalmologist is misdiagnosed as negative. It also had sufficient specificity to represent a cost-effective alternative to manual grade alone.
Keywords: Artificial Intelligence; Diabetic Retinopathy Screening; Machine Learning; Ophthalmologist referral; Sensitivity Specificity study.
© 2023. Springer-Verlag Italia S.r.l., part of Springer Nature.