Post-marketing surveillance of anticancer drugs using natural language processing of electronic medical records

Yoshimasa Kawazoe; Kiminori Shimamoto; Tomohisa Seki; Masami Tsuchiya; Emiko Shinohara; Shuntaro Yada; Shoko Wakamiya; Shungo Imai; Satoko Hori; Eiji Aramaki

doi:10.1038/s41746-024-01323-1

Post-marketing surveillance of anticancer drugs using natural language processing of electronic medical records

NPJ Digit Med. 2024 Nov 9;7(1):315. doi: 10.1038/s41746-024-01323-1.

Authors

Affiliations

¹ Artificial Intelligence and Digital Twin in Healthcare, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan. [email protected].
² Artificial Intelligence and Digital Twin in Healthcare, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.
³ Department of Healthcare Information Management, The University of Tokyo Hospital, Tokyo, Japan.
⁴ Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan.
⁵ Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, Nara, Japan.

Abstract

This study demonstrates that adverse events (AEs) extracted using natural language processing (NLP) from clinical texts reflect the known frequencies of AEs associated with anticancer drugs. Using data from 44,502 cancer patients at a single hospital, we identified cases prescribed anticancer drugs (platinum, PLT; taxane, TAX; pyrimidine, PYA) and compared them to non-treatment (NTx) group using propensity score matching. Over 365 days, AEs (peripheral neuropathy, PN; oral mucositis, OM; taste abnormality, TA; appetite loss, AL) were extracted from clinical text using an NLP tool. The hazard ratios (HRs) for the anticancer drugs were: PN, 1.15-1.95; OM, 3.11-3.85; TA, 3.48-4.71; and AL, 1.98-3.84; the HRs were significantly higher than that of the NTx group. Sensitivity analysis revealed that the HR for TA may have been underestimated; however, the remaining three types of AEs extracted from clinical text by NLP were consistently associated with the three anticancer drugs.

Abstract

Grants and funding