A refined reweighing technique for nondiscriminatory classification

Yuefeng Liang; Cho-Jui Hsieh; Thomas C M Lee

doi:10.1371/journal.pone.0308661

A refined reweighing technique for nondiscriminatory classification

PLoS One. 2024 Aug 20;19(8):e0308661. doi: 10.1371/journal.pone.0308661. eCollection 2024.

Authors

Yuefeng Liang¹, Cho-Jui Hsieh², Thomas C M Lee¹

Affiliations

¹ Department of Statistics, University of California at Davis, CA, United States of America.
² Department of Computer Science, University of California at Los Angeles, Los Angeles, CA, United States of America.

Abstract

Discrimination-aware classification methods remedy socioeconomic disparities exacerbated by machine learning systems. In this paper, we propose a novel data pre-processing technique that assigns weights to training instances in order to reduce discrimination without changing any of the inputs or labels. While the existing reweighing approach only looks into sensitive attributes, we refine the weights by utilizing both sensitive and insensitive ones. We formulate our weight assignment as a linear programming problem. The weights can be directly used in any classification model into which they are incorporated. We demonstrate three advantages of our approach on synthetic and benchmark datasets. First, discrimination reduction comes at a small cost in accuracy. Second, our method is more scalable than most other pre-processing methods. Third, the trade-off between fairness and accuracy can be explicitly monitored by model users. Code is available at https://github.com/frnliang/refined_reweighing.

Copyright: © 2024 Liang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

MeSH terms

Algorithms
Humans
Machine Learning*

Grants and funding

This work was partially supported by the National Science Foundation under grants CCF-1934568, DMS-1916125, DMS-2113605, DMS-2210388, IIS-2008173 and IIS2048280.