Google Scholar

Scaling data-driven robotics with reward sketching and batch reinforcement learning

S Cabi, SG Colmenarejo, A Novikov… - arXiv preprint arXiv …, 2019 - arxiv.org

S Cabi, SG Colmenarejo, A Novikov, K Konyushkova, S Reed, R Jeong, K Zolna, Y Aytar…

arXiv preprint arXiv:1909.12200, 2019•arxiv.org

We present a framework for data-driven robotics that makes use of a large dataset of recorded robot experience and scales to several tasks using learned reward functions. We show how to apply this framework to accomplish three different object manipulation tasks on a real robot platform. Given demonstrations of a task together with task-agnostic recorded experience, we use a special form of human annotation as supervision to learn a reward function, which enables us to deal with real-world tasks where the reward signal cannot be acquired directly. Learned rewards are used in combination with a large dataset of experience from different tasks to learn a robot policy offline using batch RL. We show that using our approach it is possible to train agents to perform a variety of challenging manipulation tasks including stacking rigid objects and handling cloth.

arxiv.org

Show moreShow less

Speichern Sie Cite Cited by 141 Related articles All 2 versions View as HTML

Cite

Advanced search

Saved to My library

Scaling data-driven robotics with reward sketching and batch reinforcement learning