Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents

Wang, Jane X.; King, Michael; Porcel, Nicolas; Kurth-Nelson, Zeb; Zhu, Tina; Deck, Charlie; Choy, Peter; Cassin, Mary; Reynolds, Malcolm; Song, Francis; Buttimore, Gavin; Reichert, David P.; Rabinowitz, Neil; Matthey, Loic; Hassabis, Demis; Lerchner, Alexander; Botvinick, Matthew

Computer Science > Machine Learning

arXiv:2102.02926 (cs)

[Submitted on 4 Feb 2021 (v1), last revised 20 Oct 2021 (this version, v3)]

Title:Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents

Authors:Jane X. Wang, Michael King, Nicolas Porcel, Zeb Kurth-Nelson, Tina Zhu, Charlie Deck, Peter Choy, Mary Cassin, Malcolm Reynolds, Francis Song, Gavin Buttimore, David P. Reichert, Neil Rabinowitz, Loic Matthey, Demis Hassabis, Alexander Lerchner, Matthew Botvinick

View PDF

Abstract:There has been rapidly growing interest in meta-learning as a method for increasing the flexibility and sample efficiency of reinforcement learning. One problem in this area of research, however, has been a scarcity of adequate benchmark tasks. In general, the structure underlying past benchmarks has either been too simple to be inherently interesting, or too ill-defined to support principled analysis. In the present work, we introduce a new benchmark for meta-RL research, emphasizing transparency and potential for in-depth analysis as well as structural richness. Alchemy is a 3D video game, implemented in Unity, which involves a latent causal structure that is resampled procedurally from episode to episode, affording structure learning, online inference, hypothesis testing and action sequencing based on abstract domain knowledge. We evaluate a pair of powerful RL agents on Alchemy and present an in-depth analysis of one of these agents. Results clearly indicate a frank and specific failure of meta-learning, providing validation for Alchemy as a challenging benchmark for meta-RL. Concurrent with this report, we are releasing Alchemy as public resource, together with a suite of analysis tools and sample agent trajectories.

Comments:	Published in Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 2021
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2102.02926 [cs.LG]
	(or arXiv:2102.02926v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2102.02926

Submission history

From: Jane Wang [view email]
[v1] Thu, 4 Feb 2021 23:40:44 UTC (8,345 KB)
[v2] Sun, 17 Oct 2021 10:59:41 UTC (16,245 KB)
[v3] Wed, 20 Oct 2021 12:33:37 UTC (16,244 KB)

Computer Science > Machine Learning

Title:Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators