Patterns of Lexical Ambiguity in Contextualised Language Models

Haber, Janosch; Poesio, Massimo

Computer Science > Computation and Language

arXiv:2109.13032 (cs)

[Submitted on 27 Sep 2021 (v1), last revised 29 Sep 2021 (this version, v2)]

Title:Patterns of Lexical Ambiguity in Contextualised Language Models

Authors:Janosch Haber, Massimo Poesio

View PDF

Abstract:One of the central aspects of contextualised language models is that they should be able to distinguish the meaning of lexically ambiguous words by their contexts. In this paper we investigate the extent to which the contextualised embeddings of word forms that display multiplicity of sense reflect traditional distinctions of polysemy and homonymy. To this end, we introduce an extended, human-annotated dataset of graded word sense similarity and co-predication acceptability, and evaluate how well the similarity of embeddings predicts similarity in meaning. Both types of human judgements indicate that the similarity of polysemic interpretations falls in a continuum between identity of meaning and homonymy. However, we also observe significant differences within the similarity ratings of polysemes, forming consistent patterns for different types of polysemic sense alternation. Our dataset thus appears to capture a substantial part of the complexity of lexical ambiguity, and can provide a realistic test bed for contextualised embeddings. Among the tested models, BERT Large shows the strongest correlation with the collected word sense similarity ratings, but struggles to consistently replicate the observed similarity patterns. When clustering ambiguous word forms based on their embeddings, the model displays high confidence in discerning homonyms and some types of polysemic alternations, but consistently fails for others.

Comments:	Accepted at Findings of EMNLP 2021. Data available at this https URL . 9 pages, 4 figure, 4 tables. Includes appendix with 3 figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2109.13032 [cs.CL]
	(or arXiv:2109.13032v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.13032

Submission history

From: Janosch Haber [view email]
[v1] Mon, 27 Sep 2021 13:11:44 UTC (1,265 KB)
[v2] Wed, 29 Sep 2021 12:40:45 UTC (1,265 KB)

Computer Science > Computation and Language

Title:Patterns of Lexical Ambiguity in Contextualised Language Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Patterns of Lexical Ambiguity in Contextualised Language Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators