We demonstrate a novel method of using unstructured health data for infectious disease surveillance. A model incorporating the dynamics of documentation of a test diagnosis (UTI) in free text, without using grammatical or syntactic analysis, achieved performance comparable to ICD-10 codes (sensitivity 57.3, positive predictive value 69.5%, negative predictive value 95.9%) and detected missed cases (15% of total).
Keywords: Detection; Electronic.
Copyright © 2020 Association for Professionals in Infection Control and Epidemiology, Inc. Published by Elsevier Inc. All rights reserved.