Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

Dasagi, Vibhavari; Lee, Robert; Bruce, Jake; Leitner, Jürgen

Computer Science > Machine Learning

arXiv:1911.08666 (cs)

[Submitted on 20 Nov 2019]

Title:Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

Authors:Vibhavari Dasagi, Robert Lee, Jake Bruce, Jürgen Leitner

View PDF

Abstract:Deep reinforcement learning has been shown to solve challenging tasks where large amounts of training experience is available, usually obtained online while learning the task. Robotics is a significant potential application domain for many of these algorithms, but generating robot experience in the real world is expensive, especially when each task requires a lengthy online training procedure. Off-policy algorithms can in principle learn arbitrary tasks from a diverse enough fixed dataset. In this work, we evaluate popular exploration methods by generating robotics datasets for the purpose of learning to solve tasks completely offline without any further interaction in the real world. We present results on three popular continuous control tasks in simulation, as well as continuous control of a high-dimensional real robot arm. Code documenting all algorithms, experiments, and hyper-parameters is available at this https URL.

Subjects:	Machine Learning (cs.LG); Robotics (cs.RO); Machine Learning (stat.ML)
Cite as:	arXiv:1911.08666 [cs.LG]
	(or arXiv:1911.08666v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1911.08666

Submission history

From: Vibhavari Dasagi [view email]
[v1] Wed, 20 Nov 2019 02:03:35 UTC (3,198 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-11

Change to browse by:

cs
cs.RO
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Vibhavari Dasagi
Robert Lee
Jake Bruce
Jürgen Leitner

export BibTeX citation

Computer Science > Machine Learning

Title:Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators