Sociodemographic bias in clinical machine learning models: A scoping review of algorithmic bias instances and mechanisms

Michael Colacci; Yu Qing Huang; Gemma Postill; Pavel Zhelnov; Orna Fennelly; Amol Verma; Sharon Straus; Andrea C Tricco

doi:10.1016/j.jclinepi.2024.111606

Sociodemographic bias in clinical machine learning models: A scoping review of algorithmic bias instances and mechanisms

J Clin Epidemiol. 2024 Nov 10:111606. doi: 10.1016/j.jclinepi.2024.111606. Online ahead of print.

Authors

Michael Colacci¹, Yu Qing Huang², Gemma Postill³, Pavel Zhelnov⁴, Orna Fennelly⁴, Amol Verma⁵, Sharon Straus⁵, Andrea C Tricco²

Affiliations

¹ St. Michael's Hospital, Unity Health Toronto, Toronto, Canada; Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Canada. Electronic address: [email protected].
² St. Michael's Hospital, Unity Health Toronto, Toronto, Canada; Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Canada.
³ Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Canada; Temerty Faculty of Medicine, University of Toronto, Toronto, Canada.
⁴ St. Michael's Hospital, Unity Health Toronto, Toronto, Canada.
⁵ St. Michael's Hospital, Unity Health Toronto, Toronto, Canada; Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Canada; Temerty Faculty of Medicine, University of Toronto, Toronto, Canada.

PMID: 39532254
DOI: 10.1016/j.jclinepi.2024.111606

Abstract

Background: Clinical machine learning (ML) technologies can sometimes be biased and their use could exacerbate health disparities. The extent to which bias is present, the groups who most frequently experience bias, and the mechanism through which bias is introduced in clinical ML applications is not well described. The objective of this study was to examine instances of bias in clinical ML models. We identified the sociodemographic subgroups (using the PROGRESS-Plus framework) that experienced bias and the reported mechanisms of bias introduction METHODS: We searched MEDLINE, EMBASE, PsycINFO and Web of Science for all studies that evaluated bias on sociodemographic factors within ML algorithms created for the purpose of facilitating clinical care. The scoping review was conducted according to the JBI guide and reported using the PRISMA extension for scoping reviews.

Results: We identified 6448 articles, of which 760 reported on a clinical ML model and 91 (12.0%) completed a bias evaluation and met all inclusion criteria. Most studies evaluated a single sociodemographic factor (n=56, 61.5%). The most frequently evaluated sociodemographic factor was race (n=59, 64.8%), followed by sex/gender (n=41, 45.1%), and age (n=24, 26.4%), with one study (1.1%) evaluating intersectional factors. Of all studies, 74.7% (n=68) reported that bias was present, 18.7% (n=17) reported bias was not present, and 6.6% (n=6) did not state whether bias was present. When present, 87% of studies reported bias against groups with socioeconomic disadvantage.

Conclusion: Most ML algorithms that were evaluated for bias demonstrated bias on sociodemographic factors. Furthermore, most bias evaluations concentrated on race, sex/gender, and age, while other sociodemographic factors and their intersection were infrequently assessed. Given potential health equity implications, bias assessments should be completed for all clinical ML models.