Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation

Qin, Chengwei; Chen, Chen; Joty, Shafiq

Computer Science > Computation and Language

arXiv:2310.09886 (cs)

[Submitted on 15 Oct 2023 (v1), last revised 22 Nov 2023 (this version, v4)]

Title:Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation

Authors:Chengwei Qin, Chen Chen, Shafiq Joty

View PDF

Abstract:Lifelong sequence generation (LSG), a problem in continual learning, aims to continually train a model on a sequence of generation tasks to learn constantly emerging new generation patterns while avoiding the forgetting of previous knowledge. Existing LSG methods mainly focus on maintaining old knowledge while paying little attention to knowledge transfer across tasks. In contrast, humans can better learn new tasks by leveraging previously acquired knowledge from similar tasks. Inspired by the learning paradigm of humans, we propose Dynamic Module Expansion and Adaptation (DMEA), which enables the model to dynamically determine the architecture for acquiring new knowledge based on task correlation and select the most similar previous tasks to facilitate adaptation to new tasks. In addition, as the learning process can easily be biased towards the current task which might cause more severe forgetting of previously learned knowledge, we propose dynamic gradient scaling to balance the learning of the current task and replayed tasks. With extensive experiments, we demonstrate that DMEA can consistently outperform existing methods in different LSG settings.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2310.09886 [cs.CL]
	(or arXiv:2310.09886v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.09886

Submission history

From: Chengwei Qin [view email]
[v1] Sun, 15 Oct 2023 16:51:11 UTC (7,947 KB)
[v2] Fri, 20 Oct 2023 07:37:48 UTC (7,946 KB)
[v3] Sun, 19 Nov 2023 13:52:19 UTC (7,948 KB)
[v4] Wed, 22 Nov 2023 06:44:16 UTC (7,948 KB)

Computer Science > Computation and Language

Title:Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators