diff History for Neural Language Agents

Piterbarg, Ulyana; Pinto, Lerrel; Fergus, Rob

Computer Science > Artificial Intelligence

arXiv:2312.07540 (cs)

[Submitted on 12 Dec 2023 (v1), last revised 11 Jun 2024 (this version, v3)]

Title:diff History for Neural Language Agents

Authors:Ulyana Piterbarg, Lerrel Pinto, Rob Fergus

View PDF HTML (experimental)

Abstract:Neural Language Models (LMs) offer an exciting solution for general-purpose embodied control. However, a key technical issue arises when using an LM-based controller: environment observations must be converted to text, which coupled with history, results in long and verbose textual prompts. As a result, prior work in LM agents is limited to restricted domains with small observation size as well as minimal needs for interaction history or instruction tuning. In this paper, we introduce diff history, a simple and highly effective solution to these issues. By applying the Unix diff command on consecutive text observations in the interaction histories used to prompt LM policies, we can both abstract away redundant information and focus the content of textual inputs on the salient changes in the environment. On NetHack, an unsolved video game that requires long-horizon reasoning for decision-making, LMs tuned with diff history match state-of-the-art performance for neural agents while needing 1800x fewer training examples compared to prior work. Even on the simpler BabyAI-Text environment with concise text observations, we find that although diff history increases the length of prompts, the representation it provides offers a 25% improvement in the efficiency of low-sample instruction tuning. Further, we show that diff history scales favorably across different tuning dataset sizes. We open-source our code and data to this https URL.

Comments:	ICML 2024 version
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2312.07540 [cs.AI]
	(or arXiv:2312.07540v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2312.07540

Submission history

From: Ulyana Piterbarg [view email]
[v1] Tue, 12 Dec 2023 18:59:30 UTC (2,792 KB)
[v2] Wed, 14 Feb 2024 18:59:41 UTC (1,607 KB)
[v3] Tue, 11 Jun 2024 17:57:15 UTC (1,609 KB)

Computer Science > Artificial Intelligence

Title:diff History for Neural Language Agents

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:diff History for Neural Language Agents

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators