ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs

Han, Pengrui; Kocielnik, Rafal; Saravanan, Adhithya; Jiang, Roy; Sharir, Or; Anandkumar, Anima

Computer Science > Computation and Language

arXiv:2402.11764 (cs)

[Submitted on 19 Feb 2024]

Title:ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs

Authors:Pengrui Han, Rafal Kocielnik, Adhithya Saravanan, Roy Jiang, Or Sharir, Anima Anandkumar

View PDF

Abstract:Large Language models (LLMs), while powerful, exhibit harmful social biases. Debiasing is often challenging due to computational costs, data constraints, and potential degradation of multi-task language capabilities. This work introduces a novel approach utilizing ChatGPT to generate synthetic training data, aiming to enhance the debiasing of LLMs. We propose two strategies: Targeted Prompting, which provides effective debiasing for known biases but necessitates prior specification of bias in question; and General Prompting, which, while slightly less effective, offers debiasing across various categories. We leverage resource-efficient LLM debiasing using adapter tuning and compare the effectiveness of our synthetic data to existing debiasing datasets. Our results reveal that: (1) ChatGPT can efficiently produce high-quality training data for debiasing other LLMs; (2) data produced via our approach surpasses existing datasets in debiasing performance while also preserving internal knowledge of a pre-trained LLM; and (3) synthetic data exhibits generalizability across categories, effectively mitigating various biases, including intersectional ones. These findings underscore the potential of synthetic data in advancing the fairness of LLMs with minimal retraining cost.

Comments:	Accepted to EACL 2024 Workshop on Language Technology for Equality, Diversity, Inclusion (LT-EDI-2024)
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
MSC classes:	68T50
ACM classes:	I.2.7; K.4.1
Cite as:	arXiv:2402.11764 [cs.CL]
	(or arXiv:2402.11764v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2402.11764

Submission history

From: Rafal Kocielnik [view email]
[v1] Mon, 19 Feb 2024 01:28:48 UTC (2,124 KB)

Computer Science > Computation and Language

Title:ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators