Imitation by Predicting Observations

Jaegle, Andrew; Sulsky, Yury; Ahuja, Arun; Bruce, Jake; Fergus, Rob; Wayne, Greg

Computer Science > Machine Learning

arXiv:2107.03851 (cs)

[Submitted on 8 Jul 2021]

Title:Imitation by Predicting Observations

Authors:Andrew Jaegle, Yury Sulsky, Arun Ahuja, Jake Bruce, Rob Fergus, Greg Wayne

View PDF

Abstract:Imitation learning enables agents to reuse and adapt the hard-won expertise of others, offering a solution to several key challenges in learning behavior. Although it is easy to observe behavior in the real-world, the underlying actions may not be accessible. We present a new method for imitation solely from observations that achieves comparable performance to experts on challenging continuous control tasks while also exhibiting robustness in the presence of observations unrelated to the task. Our method, which we call FORM (for "Future Observation Reward Model") is derived from an inverse RL objective and imitates using a model of expert behavior learned by generative modelling of the expert's observations, without needing ground truth actions. We show that FORM performs comparably to a strong baseline IRL method (GAIL) on the DeepMind Control Suite benchmark, while outperforming GAIL in the presence of task-irrelevant features.

Comments:	ICML 2021
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2107.03851 [cs.LG]
	(or arXiv:2107.03851v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2107.03851

Submission history

From: Andrew Jaegle [view email]
[v1] Thu, 8 Jul 2021 14:09:30 UTC (10,728 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-07

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Andrew Jaegle
Arun Ahuja
Jake Bruce
Rob Fergus
Greg Wayne

export BibTeX citation

Computer Science > Machine Learning

Title:Imitation by Predicting Observations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Imitation by Predicting Observations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators