Learning to Abstract with Nonparametric Variational Information Bottleneck

Behjati, Melika; Fehr, Fabio; Henderson, James

Computer Science > Computation and Language

arXiv:2310.17284v1 (cs)

[Submitted on 26 Oct 2023]

Title:Learning to Abstract with Nonparametric Variational Information Bottleneck

Authors:Melika Behjati, Fabio Fehr, James Henderson

View PDF

Abstract:Learned representations at the level of characters, sub-words, words and sentences, have each contributed to advances in understanding different NLP tasks and linguistic phenomena. However, learning textual embeddings is costly as they are tokenization specific and require different models to be trained for each level of abstraction. We introduce a novel language representation model which can learn to compress to different levels of abstraction at different layers of the same model. We apply Nonparametric Variational Information Bottleneck (NVIB) to stacked Transformer self-attention layers in the encoder, which encourages an information-theoretic compression of the representations through the model. We find that the layers within the model correspond to increasing levels of abstraction and that their representations are more linguistically informed. Finally, we show that NVIB compression results in a model which is more robust to adversarial perturbations.

Comments:	Accepted to Findings of EMNLP 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2310.17284 [cs.CL]
	(or arXiv:2310.17284v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.17284

Submission history

From: Fabio Fehr [view email]
[v1] Thu, 26 Oct 2023 10:04:31 UTC (9,008 KB)

Computer Science > Computation and Language

Title:Learning to Abstract with Nonparametric Variational Information Bottleneck

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Learning to Abstract with Nonparametric Variational Information Bottleneck

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators