Predicting Adverse Drug Reactions on Distributed Health Data using Federated Learning

Olivia Choudhury; Yoonyoung Park; Theodoros Salonidis; Aris Gkoulalas-Divanis; Issa Sylla; Amar K Das

Predicting Adverse Drug Reactions on Distributed Health Data using Federated Learning

AMIA Annu Symp Proc. 2020 Mar 4:2019:313-322. eCollection 2019.

Authors

Olivia Choudhury¹, Yoonyoung Park¹, Theodoros Salonidis², Aris Gkoulalas-Divanis³, Issa Sylla¹, Amar K Das¹

Affiliations

¹ IBM Research Cambridge, Massachusetts, USA.
² IBM T.J. Watson Research Center, Yorktown Heights, NY, USA.
³ IBM Watson Health, Cambridge, Massachusetts, USA.

PMID: 32308824
PMCID: PMC7153050

Abstract

Using electronic health data to predict adverse drug reaction (ADR) incurs practical challenges, such as lack of adequate data from any single site for rare ADR detection, resource constraints on integrating data from multiple sources, and privacy concerns with creating a centralized database from person-specific, sensitive data. We introduce a federated learning framework that can learn a global ADR prediction model from distributed health data held locally at different sites. We propose two novel methods of local model aggregation to improve the predictive capability of the global model. Through comprehensive experimental evaluation using real-world health data from 1 million patients, we demonstrate the effectiveness of our proposed approach in achieving comparable performance to centralized learning and outperforming localized learning models for two types of ADRs. We also demonstrate that, for varying data distributions, our aggregation methods outperform state-of-the-art techniques, in terms of precision, recall, and accuracy.

MeSH terms

Adverse Drug Reaction Reporting Systems*
Databases, Factual
Drug-Related Side Effects and Adverse Reactions*
Electronic Health Records*
Humans
Logistic Models
Machine Learning*
Support Vector Machine