Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration

Dai, Sunhao; Liu, Weihao; Zhou, Yuqi; Pang, Liang; Ruan, Rongju; Wang, Gang; Dong, Zhenhua; Xu, Jun; Wen, Ji-Rong

Computer Science > Information Retrieval

arXiv:2405.16546 (cs)

[Submitted on 26 May 2024 (v1), last revised 2 Jul 2024 (this version, v2)]

Title:Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration

Authors:Sunhao Dai, Weihao Liu, Yuqi Zhou, Liang Pang, Rongju Ruan, Gang Wang, Zhenhua Dong, Jun Xu, Ji-Rong Wen

View PDF HTML (experimental)

Abstract:The proliferation of Large Language Models (LLMs) has led to an influx of AI-generated content (AIGC) on the internet, transforming the corpus of Information Retrieval (IR) systems from solely human-written to a coexistence with LLM-generated content. The impact of this surge in AIGC on IR systems remains an open question, with the primary challenge being the lack of a dedicated benchmark for researchers. In this paper, we introduce Cocktail, a comprehensive benchmark tailored for evaluating IR models in this mixed-sourced data landscape of the LLM era. Cocktail consists of 16 diverse datasets with mixed human-written and LLM-generated corpora across various text retrieval tasks and domains. Additionally, to avoid the potential bias from previously included dataset information in LLMs, we also introduce an up-to-date dataset, named NQ-UTD, with queries derived from recent events. Through conducting over 1,000 experiments to assess state-of-the-art retrieval models against the benchmarked datasets in Cocktail, we uncover a clear trade-off between ranking performance and source bias in neural retrieval models, highlighting the necessity for a balanced approach in designing future IR systems. We hope Cocktail can serve as a foundational resource for IR research in the LLM era, with all data and code publicly available at \url{this https URL}.

Comments:	Accepted by Findings of ACL 2024; Datasets Link: this https URL
Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL)
Cite as:	arXiv:2405.16546 [cs.IR]
	(or arXiv:2405.16546v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2405.16546

Submission history

From: Sunhao Dai [view email]
[v1] Sun, 26 May 2024 12:30:20 UTC (766 KB)
[v2] Tue, 2 Jul 2024 12:23:37 UTC (799 KB)

Computer Science > Information Retrieval

Title:Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators