Testing Occupational Gender Bias in Language Models: Towards Robust Measurement and Zero-Shot Debiasing

Chen, Yuen; Raghuram, Vethavikashini Chithrra; Mattern, Justus; Sachan, Mrinmaya; Mihalcea, Rada; Schölkopf, Bernhard; Jin, Zhijing

Computer Science > Computation and Language

arXiv:2212.10678 (cs)

[Submitted on 20 Dec 2022 (v1), last revised 15 Jul 2024 (this version, v2)]

Title:Testing Occupational Gender Bias in Language Models: Towards Robust Measurement and Zero-Shot Debiasing

Authors:Yuen Chen, Vethavikashini Chithrra Raghuram, Justus Mattern, Mrinmaya Sachan, Rada Mihalcea, Bernhard Schölkopf, Zhijing Jin

View PDF HTML (experimental)

Abstract:Generated texts from large language models (LLMs) have been shown to exhibit a variety of harmful, human-like biases against various demographics. These findings motivate research efforts aiming to understand and measure such effects. Prior works have proposed benchmarks for identifying and techniques for mitigating these stereotypical associations. However, as recent research pointed out, existing benchmarks lack a robust experimental setup, hindering the inference of meaningful conclusions from their evaluation metrics. In this paper, we introduce a list of desiderata for robustly measuring biases in generative language models. Building upon these design principles, we propose a benchmark called OCCUGENDER, with a bias-measuring procedure to investigate occupational gender bias. We then use this benchmark to test several state-of-the-art open-source LLMs, including Llama, Mistral, and their instruction-tuned versions. The results show that these models exhibit substantial occupational gender bias. We further propose prompting techniques to mitigate these biases without requiring fine-tuning. Finally, we validate the effectiveness of our methods through experiments on the same set of models.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2212.10678 [cs.CL]
	(or arXiv:2212.10678v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2212.10678

Submission history

From: Yuen Chen [view email]
[v1] Tue, 20 Dec 2022 22:41:24 UTC (486 KB)
[v2] Mon, 15 Jul 2024 15:10:45 UTC (143 KB)

Computer Science > Computation and Language

Title:Testing Occupational Gender Bias in Language Models: Towards Robust Measurement and Zero-Shot Debiasing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Testing Occupational Gender Bias in Language Models: Towards Robust Measurement and Zero-Shot Debiasing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators