Compositional Zero-Shot Domain Transfer with Text-to-Text Models

Liu, Fangyu; Liu, Qianchu; Bannur, Shruthi; Pérez-García, Fernando; Usuyama, Naoto; Zhang, Sheng; Naumann, Tristan; Nori, Aditya; Poon, Hoifung; Alvarez-Valle, Javier; Oktay, Ozan; Hyland, Stephanie L.

Computer Science > Computation and Language

arXiv:2303.13386 (cs)

[Submitted on 23 Mar 2023]

Title:Compositional Zero-Shot Domain Transfer with Text-to-Text Models

Authors:Fangyu Liu, Qianchu Liu, Shruthi Bannur, Fernando Pérez-García, Naoto Usuyama, Sheng Zhang, Tristan Naumann, Aditya Nori, Hoifung Poon, Javier Alvarez-Valle, Ozan Oktay, Stephanie L. Hyland

View PDF

Abstract:Label scarcity is a bottleneck for improving task performance in specialised domains. We propose a novel compositional transfer learning framework (DoT5 - domain compositional zero-shot T5) for zero-shot domain transfer. Without access to in-domain labels, DoT5 jointly learns domain knowledge (from MLM of unlabelled in-domain free text) and task knowledge (from task training on more readily available general-domain data) in a multi-task manner. To improve the transferability of task training, we design a strategy named NLGU: we simultaneously train NLG for in-domain label-to-data generation which enables data augmentation for self-finetuning and NLU for label prediction. We evaluate DoT5 on the biomedical domain and the resource-lean subdomain of radiology, focusing on NLI, text summarisation and embedding learning. DoT5 demonstrates the effectiveness of compositional transfer learning through multi-task learning. In particular, DoT5 outperforms the current SOTA in zero-shot transfer by over 7 absolute points in accuracy on RadNLI. We validate DoT5 with ablations and a case study demonstrating its ability to solve challenging NLI examples requiring in-domain expertise.

Comments:	Accepted at TACL, pre-MIT Press publication version. 16 pages, 4 figures
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2303.13386 [cs.CL]
	(or arXiv:2303.13386v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2303.13386

Submission history

From: Stephanie L. Hyland [view email]
[v1] Thu, 23 Mar 2023 15:58:41 UTC (365 KB)

Computer Science > Computation and Language

Title:Compositional Zero-Shot Domain Transfer with Text-to-Text Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Compositional Zero-Shot Domain Transfer with Text-to-Text Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators