Google Scholar

Visual semantic planning using deep successor representations

Y Zhu, D Gordon, E Kolve, D Fox… - Proceedings of the …, 2017 - openaccess.thecvf.com

Y Zhu, D Gordon, E Kolve, D Fox, L Fei-Fei, A Gupta, R Mottaghi, A Farhadi

Proceedings of the IEEE international conference on computer …, 2017•openaccess.thecvf.com

Abstract

A crucial capability of real-world intelligent agents is their ability to plan a sequence of actions to achieve their goals in the visual world. In this work, we address the problem of visual semantic planning: the task of predicting a sequence of actions from visual observations that transform a dynamic environment from an initial state to a goal state. Doing so entails knowledge about objects and their affordances, as well as actions and their preconditions and effects. We propose learning these through interacting with a visual and dynamic environment. Our proposed solution involves bootstrapping reinforcement learning with imitation learning. To ensure cross task generalization, we develop a deep predictive model based on successor representations. Our experimental results show near optimal results across a wide range of tasks in the challenging THOR environment. The supplementary video can be accessed at the following link: https://goo. gl/vXsbQP.

openaccess.thecvf.com

Show moreShow less

Speichern Sie Cite Cited by 166 Related articles All 4 versions View as HTML

Showing the best result for this search. See all results

Cite

Advanced search

Saved to My library

Visual semantic planning using deep successor representations