On the importance of data collection for training general goal-reaching policies

Jacq, Alexis; Orsini, Manu; Dulac-Arnold, Gabriel; Pietquin, Olivier; Geist, Matthieu; Bachem, Olivier

Computer Science > Artificial Intelligence

arXiv:2211.03521v2 (cs)

[Submitted on 7 Nov 2022 (v1), last revised 20 Feb 2023 (this version, v2)]

Title:On the importance of data collection for training general goal-reaching policies

Authors:Alexis Jacq, Manu Orsini, Gabriel Dulac-Arnold, Olivier Pietquin, Matthieu Geist, Olivier Bachem

View PDF

Abstract:Recent advances in ML suggest that the quantity of data available to a model is one of the primary bottlenecks to high performance. Although for language-based tasks there exist almost unlimited amounts of reasonably coherent data to train from, this is generally not the case for Reinforcement Learning, especially when dealing with a novel environment. In effect, even a relatively trivial continuous environment has an almost limitless number of states, but simply sampling random states and actions will likely not provide transitions that are interesting or useful for any potential downstream task. How should one generate massive amounts of useful data given only an MDP with no indication of downstream tasks? Are the quantity and quality of data truly transformative to the performance of a general controller? We propose to answer both of these questions. First, we introduce a principled unsupervised exploration method, ChronoGEM, which aims to achieve uniform coverage over the manifold of achievable states, which we believe is the most reasonable goal given no prior task information. Secondly, we investigate the effects of both data quantity and data quality on the training of a downstream goal-achievement policy, and show that both large quantities and high-quality of data are essential to train a general controller: a high-precision pose-achievement policy capable of attaining a large number of poses over numerous continuous control embodiments including humanoid.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2211.03521 [cs.AI]
	(or arXiv:2211.03521v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2211.03521

Submission history

From: Alexis Jacq [view email]
[v1] Mon, 7 Nov 2022 13:02:40 UTC (3,981 KB)
[v2] Mon, 20 Feb 2023 14:28:14 UTC (4,223 KB)

Computer Science > Artificial Intelligence

Title:On the importance of data collection for training general goal-reaching policies

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:On the importance of data collection for training general goal-reaching policies

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators