A case study in applying artificial intelligence-based named entity recognition to develop an automated ophthalmic disease registry

Carmelo Z Macri; Sheng Chieh Teoh; Stephen Bacchi; Ian Tan; Robert Casson; Michelle T Sun; Dinesh Selva; WengOnn Chan

doi:10.1007/s00417-023-06190-2

A case study in applying artificial intelligence-based named entity recognition to develop an automated ophthalmic disease registry

Graefes Arch Clin Exp Ophthalmol. 2023 Nov;261(11):3335-3344. doi: 10.1007/s00417-023-06190-2. Epub 2023 Aug 3.

Authors

Carmelo Z Macri^{1

2}, Sheng Chieh Teoh³, Stephen Bacchi^{4

3}, Ian Tan³, Robert Casson^{4

3}, Michelle T Sun^{4

3}, Dinesh Selva^{4

3}, WengOnn Chan^{4

3}

Affiliations

¹ Discipline of Ophthalmology and Visual Sciences, The University of Adelaide, Adelaide, South Australia, Australia. [email protected].
² Department of Ophthalmology, The Royal Adelaide Hospital, Adelaide, South Australia, Australia. [email protected].
³ Department of Ophthalmology, The Royal Adelaide Hospital, Adelaide, South Australia, Australia.
⁴ Discipline of Ophthalmology and Visual Sciences, The University of Adelaide, Adelaide, South Australia, Australia.

Abstract

Purpose: Advances in artificial intelligence (AI)-based named entity extraction (NER) have improved the ability to extract diagnostic entities from unstructured, narrative, free-text data in electronic health records. However, there is a lack of ready-to-use tools and workflows to encourage the use among clinicians who often lack experience and training in AI. We sought to demonstrate a case study for developing an automated registry of ophthalmic diseases accompanied by a ready-to-use low-code tool for clinicians.

Methods: We extracted deidentified electronic clinical records from a single centre's adult outpatient ophthalmology clinic from November 2019 to May 2022. We used a low-code annotation software tool (Prodigy) to annotate diagnoses and train a bespoke spaCy NER model to extract diagnoses and create an ophthalmic disease registry.

Results: A total of 123,194 diagnostic entities were extracted from 33,455 clinical records. After decapitalisation and removal of non-alphanumeric characters, there were 5070 distinct extracted diagnostic entities. The NER model achieved a precision of 0.8157, recall of 0.8099, and F score of 0.8128.

Conclusion: We presented a case study using low-code artificial intelligence-based NLP tools to produce an automated ophthalmic disease registry. The workflow created a NER model with a moderate overall ability to extract diagnoses from free-text electronic clinical records. We have produced a ready-to-use tool for clinicians to implement this low-code workflow in their institutions and encourage the uptake of artificial intelligence methods for case finding in electronic health records.

Keywords: Application; Artificial intelligence; Case study; Electronic health records; Named entity recognition; Registry; Tool.