MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models

Aggarwal, Divyanshu; Sathe, Ashutosh; Watts, Ishaan; Sitaram, Sunayana

Computer Science > Computation and Language

arXiv:2401.07598 (cs)

[Submitted on 15 Jan 2024 (v1), last revised 22 Jul 2024 (this version, v3)]

Title:MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models

Authors:Divyanshu Aggarwal, Ashutosh Sathe, Ishaan Watts, Sunayana Sitaram

View PDF HTML (experimental)

Abstract:Parameter Efficient Finetuning (PEFT) has emerged as a viable solution for improving the performance of Large Language Models (LLMs) without requiring massive resources and compute. Prior work on multilingual evaluation has shown that there is a large gap between the performance of LLMs on English and other languages. Further, there is also a large gap between the performance of smaller open-source models and larger LLMs. Finetuning can be an effective way to bridge this gap and make language models more equitable. In this work, we finetune the LLama-2-7B and Mistral-7B models on two synthetic multilingual instruction tuning datasets to determine its effect on model performance on six downstream tasks covering forty languages in all. Additionally, we experiment with various parameters, such as rank for low-rank adaptation and values of quantisation to determine their effects on downstream performance and find that higher rank and higher quantisation values benefit low-resource languages. We find that PEFT of smaller open-source models sometimes bridges the gap between the performance of these models and the larger ones, however, English performance can take a hit. We also find that finetuning sometimes improves performance on low-resource languages, while degrading performance on high-resource languages.

Comments:	46 pages, 23 figures, 45 tables. Accepted in ACL 2024 findings
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2401.07598 [cs.CL]
	(or arXiv:2401.07598v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2401.07598

Submission history

From: Divyanshu Aggarwal [view email]
[v1] Mon, 15 Jan 2024 11:06:43 UTC (2,900 KB)
[v2] Tue, 20 Feb 2024 08:06:39 UTC (1,576 KB)
[v3] Mon, 22 Jul 2024 11:13:54 UTC (2,486 KB)

Computer Science > Computation and Language

Title:MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators