Broadening the Capture of Natural Products Mentioned in FAERS Using Fuzzy String-Matching and a Siamese Neural Network

Res Sq [Preprint]. 2023 Aug 23:rs.3.rs-3283654. doi: 10.21203/rs.3.rs-3283654/v1.

Abstract

Increased sales of natural products (NPs) in the US and growing safety concerns highlight the need for NP pharmacovigilance. A challenge for NP pharmacovigilance is ambiguity when referring to NPs in spontaneous reporting systems. We used a combination of fuzzy string-matching and a neural network to reduce this ambiguity. We aim to increase the capture of reports involving NPs in the US Food and Drug Administration Adverse Event Reporting System (FAERS). Gestalt pattern-matching (GPM) and Siamese neural network (SM) were used to identify potential mentions of NPs of interest in 389,386 FAERS reports with unmapped drug names. We refined the identified candidates through manual review and annotation by health professionals. After adjudication, GPM identified 595 unique NP names and SM 504. There was little overlap between candidates identified by the approaches (Non-overlapping: GPM 347, SM 248). In total, 686 novel NP names were identified in the unmapped FAERS reports. Including these names in the FAERS collection yielded 3,486 additional reports mentioning NPs.

Keywords: Adverse Events; Gestalt Pattern-Matching; Natural Products; Pharmacovigilance; Siamese Model Similarity.

Publication types

  • Preprint