Efficient Fine-Tuning of BERT Models on the Edge

Vucetic, Danilo; Tayaranian, Mohammadreza; Ziaeefard, Maryam; Clark, James J.; Meyer, Brett H.; Gross, Warren J.

doi:10.1109/ISCAS48785.2022.9937567

Computer Science > Machine Learning

arXiv:2205.01541 (cs)

[Submitted on 3 May 2022]

Title:Efficient Fine-Tuning of BERT Models on the Edge

Authors:Danilo Vucetic, Mohammadreza Tayaranian, Maryam Ziaeefard, James J. Clark, Brett H. Meyer, Warren J. Gross

View PDF

Abstract:Resource-constrained devices are increasingly the deployment targets of machine learning applications. Static models, however, do not always suffice for dynamic environments. On-device training of models allows for quick adaptability to new scenarios. With the increasing size of deep neural networks, as noted with the likes of BERT and other natural language processing models, comes increased resource requirements, namely memory, computation, energy, and time. Furthermore, training is far more resource intensive than inference. Resource-constrained on-device learning is thus doubly difficult, especially with large BERT-like models. By reducing the memory usage of fine-tuning, pre-trained BERT models can become efficient enough to fine-tune on resource-constrained devices. We propose Freeze And Reconfigure (FAR), a memory-efficient training regime for BERT-like models that reduces the memory usage of activation maps during fine-tuning by avoiding unnecessary parameter updates. FAR reduces fine-tuning time on the DistilBERT model and CoLA dataset by 30%, and time spent on memory operations by 47%. More broadly, reductions in metric performance on the GLUE and SQuAD datasets are around 1% on average.

Comments:	4 pages, 2 figures, 3 tables. To be published in ISCAS 2022 and made available on IEEE Xplore
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2205.01541 [cs.LG]
	(or arXiv:2205.01541v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.01541
Related DOI:	https://doi.org/10.1109/ISCAS48785.2022.9937567

Submission history

From: Danilo Vucetic [view email]
[v1] Tue, 3 May 2022 14:51:53 UTC (4,549 KB)

Computer Science > Machine Learning

Title:Efficient Fine-Tuning of BERT Models on the Edge

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient Fine-Tuning of BERT Models on the Edge

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators