Label Semantic Aware Pre-training for Few-shot Text Classification

Mueller, Aaron; Krone, Jason; Romeo, Salvatore; Mansour, Saab; Mansimov, Elman; Zhang, Yi; Roth, Dan

Computer Science > Computation and Language

arXiv:2204.07128 (cs)

[Submitted on 14 Apr 2022 (v1), last revised 29 May 2022 (this version, v2)]

Title:Label Semantic Aware Pre-training for Few-shot Text Classification

Authors:Aaron Mueller, Jason Krone, Salvatore Romeo, Saab Mansour, Elman Mansimov, Yi Zhang, Dan Roth

View PDF

Abstract:In text classification tasks, useful information is encoded in the label names. Label semantic aware systems have leveraged this information for improved text classification performance during fine-tuning and prediction. However, use of label-semantics during pre-training has not been extensively explored. We therefore propose Label Semantic Aware Pre-training (LSAP) to improve the generalization and data efficiency of text classification systems. LSAP incorporates label semantics into pre-trained generative models (T5 in our case) by performing secondary pre-training on labeled sentences from a variety of domains. As domain-general pre-training requires large amounts of data, we develop a filtering and labeling pipeline to automatically create sentence-label pairs from unlabeled text. We perform experiments on intent (ATIS, Snips, TOPv2) and topic classification (AG News, Yahoo! Answers). LSAP obtains significant accuracy improvements over state-of-the-art models for few-shot text classification while maintaining performance comparable to state of the art in high-resource settings.

Comments:	Accepted at ACL 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2204.07128 [cs.CL]
	(or arXiv:2204.07128v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2204.07128

Submission history

From: Aaron Mueller [view email]
[v1] Thu, 14 Apr 2022 17:33:34 UTC (6,178 KB)
[v2] Sun, 29 May 2022 18:48:54 UTC (6,178 KB)

Computer Science > Computation and Language

Title:Label Semantic Aware Pre-training for Few-shot Text Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Label Semantic Aware Pre-training for Few-shot Text Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators