Survey of Hallucination in Natural Language Generation

Ji, Ziwei; Lee, Nayeon; Frieske, Rita; Yu, Tiezheng; Su, Dan; Xu, Yan; Ishii, Etsuko; Bang, Yejin; Madotto, Andrea; Fung, Pascale

Computer Science > Computation and Language

arXiv:2202.03629v1 (cs)

[Submitted on 8 Feb 2022 (this version), latest version 14 Jul 2024 (v7)]

Title:Survey of Hallucination in Natural Language Generation

Authors:Ziwei Ji, Nayeon Lee, Rita Frieske, Tiezheng Yu, Dan Su, Yan Xu, Etsuko Ishii, Yejin Bang, Andrea Madotto, Pascale Fung

View PDF

Abstract:Natural Language Generation (NLG) has improved exponentially in recent years thanks to the development of deep learning technologies such as Transformer-based language models. This advancement has led to more fluent and coherent natural language generation, naturally leading to development in downstream tasks such as abstractive summarization, dialogue generation and data-to-text generation. However, it is also investigated that such generation includes hallucinated texts, which makes the performances of text generation fail to meet users' expectations in many real-world scenarios. In order to address this issue, studies in evaluation and mitigation methods of hallucinations have been presented in various tasks, but have not been reviewed in a combined manner. In this survey, we provide a broad overview of the research progress and challenges in the hallucination problem of NLG. The survey is organized into two big divisions: (i) a general overview of metrics, mitigation methods, and future directions; (ii) task-specific research progress for hallucinations in a large set of downstream tasks: abstractive summarization, dialogue generation, generative question answering, data-to-text generation, and machine translation. This survey could facilitate collaborative efforts among researchers in these tasks.

Subjects:	Computation and Language (cs.CL)
ACM classes:	A.1
Cite as:	arXiv:2202.03629 [cs.CL]
	(or arXiv:2202.03629v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2202.03629

Submission history

From: Ziwei Ji [view email]
[v1] Tue, 8 Feb 2022 03:55:01 UTC (947 KB)
[v2] Tue, 22 Feb 2022 05:42:44 UTC (175 KB)
[v3] Tue, 5 Apr 2022 09:25:33 UTC (170 KB)
[v4] Tue, 10 May 2022 05:02:49 UTC (169 KB)
[v5] Mon, 7 Nov 2022 14:51:16 UTC (3,548 KB)
[v6] Mon, 19 Feb 2024 14:13:08 UTC (3,589 KB)
[v7] Sun, 14 Jul 2024 12:40:59 UTC (3,635 KB)

Computer Science > Computation and Language

Title:Survey of Hallucination in Natural Language Generation

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Survey of Hallucination in Natural Language Generation

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators