Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Sendonaris, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:1704.03732  [pdf, ps, other

    cs.AI cs.LG

    Deep Q-learning from Demonstrations

    Authors: Todd Hester, Matej Vecerik, Olivier Pietquin, Marc Lanctot, Tom Schaul, Bilal Piot, Dan Horgan, John Quan, Andrew Sendonaris, Gabriel Dulac-Arnold, Ian Osband, John Agapiou, Joel Z. Leibo, Audrunas Gruslys

    Abstract: Deep reinforcement learning (RL) has achieved several high profile successes in difficult decision-making problems. However, these algorithms typically require a huge amount of data before they reach reasonable performance. In fact, their performance during learning can be extremely poor. This may be acceptable for a simulator, but it severely limits the applicability of deep RL to many real-world… ▽ More

    Submitted 22 November, 2017; v1 submitted 12 April, 2017; originally announced April 2017.

    Comments: Published at AAAI 2018. Previously on arxiv as "Learning from Demonstrations for Real World Reinforcement Learning"