Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Hester, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2108.11482  [pdf, other

    cs.LG cs.AI cs.SI

    ETA Prediction with Graph Neural Networks in Google Maps

    Authors: Austin Derrow-Pinion, Jennifer She, David Wong, Oliver Lange, Todd Hester, Luis Perez, Marc Nunkesser, Seongjae Lee, Xueying Guo, Brett Wiltshire, Peter W. Battaglia, Vishal Gupta, Ang Li, Zhongwen Xu, Alvaro Sanchez-Gonzalez, Yujia Li, Petar Veličković

    Abstract: Travel-time prediction constitutes a task of high importance in transportation networks, with web mapping services like Google Maps regularly serving vast quantities of travel time queries from users and enterprises alike. Further, such a task requires accounting for complex spatiotemporal interactions (modelling both the topological properties of the road network and anticipating events -- such a… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: To appear at CIKM 2021 (Applied Research Track). 10 pages, 4 figures

  2. arXiv:2003.11881  [pdf, other

    cs.LG cs.AI

    An empirical investigation of the challenges of real-world reinforcement learning

    Authors: Gabriel Dulac-Arnold, Nir Levine, Daniel J. Mankowitz, Jerry Li, Cosmin Paduraru, Sven Gowal, Todd Hester

    Abstract: Reinforcement learning (RL) has proven its worth in a series of artificial domains, and is beginning to show some successes in real-world scenarios. However, much of the research advances in RL are hard to leverage in real-world systems due to a series of assumptions that are rarely satisfied in practice. In this work, we identify and formalize a series of independent challenges that embody the di… ▽ More

    Submitted 4 March, 2021; v1 submitted 24 March, 2020; originally announced March 2020.

    Comments: arXiv admin note: text overlap with arXiv:1904.12901

  3. arXiv:1906.07516  [pdf, other

    cs.LG cs.AI stat.ML

    Robust Reinforcement Learning for Continuous Control with Model Misspecification

    Authors: Daniel J. Mankowitz, Nir Levine, Rae Jeong, Yuanyuan Shi, Jackie Kay, Abbas Abdolmaleki, Jost Tobias Springenberg, Timothy Mann, Todd Hester, Martin Riedmiller

    Abstract: We provide a framework for incorporating robustness -- to perturbations in the transition dynamics which we refer to as model misspecification -- into continuous control Reinforcement Learning (RL) algorithms. We specifically focus on incorporating robustness into a state-of-the-art continuous control RL algorithm called Maximum a-posteriori Policy Optimization (MPO). We achieve this by learning a… ▽ More

    Submitted 11 February, 2020; v1 submitted 18 June, 2019; originally announced June 2019.

  4. arXiv:1904.12901  [pdf, ps, other

    cs.LG cs.AI cs.RO stat.ML

    Challenges of Real-World Reinforcement Learning

    Authors: Gabriel Dulac-Arnold, Daniel Mankowitz, Todd Hester

    Abstract: Reinforcement learning (RL) has proven its worth in a series of artificial domains, and is beginning to show some successes in real-world scenarios. However, much of the research advances in RL are often hard to leverage in real-world systems due to a series of assumptions that are rarely satisfied in practice. We present a set of nine unique challenges that must be addressed to productionize RL t… ▽ More

    Submitted 29 April, 2019; originally announced April 2019.

  5. arXiv:1810.01531  [pdf, other

    cs.RO

    A Practical Approach to Insertion with Variable Socket Position Using Deep Reinforcement Learning

    Authors: Mel Vecerik, Oleg Sushkov, David Barker, Thomas Rothörl, Todd Hester, Jon Scholz

    Abstract: Insertion is a challenging haptic and visual control problem with significant practical value for manufacturing. Existing approaches in the model-based robotics community can be highly effective when task geometry is known, but are complex and cumbersome to implement, and must be tailored to each individual problem by a qualified engineer. Within the learning community there is a long history of i… ▽ More

    Submitted 8 October, 2018; v1 submitted 2 October, 2018; originally announced October 2018.

  6. arXiv:1805.11593  [pdf, other

    cs.LG cs.AI stat.ML

    Observe and Look Further: Achieving Consistent Performance on Atari

    Authors: Tobias Pohlen, Bilal Piot, Todd Hester, Mohammad Gheshlaghi Azar, Dan Horgan, David Budden, Gabriel Barth-Maron, Hado van Hasselt, John Quan, Mel Večerík, Matteo Hessel, Rémi Munos, Olivier Pietquin

    Abstract: Despite significant advances in the field of deep Reinforcement Learning (RL), today's algorithms still fail to learn human-level policies consistently over a set of diverse tasks such as Atari 2600 games. We identify three key challenges that any algorithm needs to master in order to perform well on all games: processing diverse reward distributions, reasoning over long time horizons, and explori… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

  7. arXiv:1801.08757  [pdf, other

    cs.AI

    Safe Exploration in Continuous Action Spaces

    Authors: Gal Dalal, Krishnamurthy Dvijotham, Matej Vecerik, Todd Hester, Cosmin Paduraru, Yuval Tassa

    Abstract: We address the problem of deploying a reinforcement learning (RL) agent on a physical system such as a datacenter cooling unit or robot, where critical constraints must never be violated. We show how to exploit the typically smooth dynamics of these systems and enable RL algorithms to never violate constraints during learning. Our technique is to directly add to the policy a safety layer that anal… ▽ More

    Submitted 26 January, 2018; originally announced January 2018.

  8. arXiv:1707.08817  [pdf, other

    cs.AI

    Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards

    Authors: Mel Vecerik, Todd Hester, Jonathan Scholz, Fumin Wang, Olivier Pietquin, Bilal Piot, Nicolas Heess, Thomas Rothörl, Thomas Lampe, Martin Riedmiller

    Abstract: We propose a general and model-free approach for Reinforcement Learning (RL) on real robotics with sparse rewards. We build upon the Deep Deterministic Policy Gradient (DDPG) algorithm to use demonstrations. Both demonstrations and actual interactions are used to fill a replay buffer and the sampling ratio between demonstrations and transitions is automatically tuned via a prioritized replay mecha… ▽ More

    Submitted 8 October, 2018; v1 submitted 27 July, 2017; originally announced July 2017.

  9. arXiv:1704.03732  [pdf, ps, other

    cs.AI cs.LG

    Deep Q-learning from Demonstrations

    Authors: Todd Hester, Matej Vecerik, Olivier Pietquin, Marc Lanctot, Tom Schaul, Bilal Piot, Dan Horgan, John Quan, Andrew Sendonaris, Gabriel Dulac-Arnold, Ian Osband, John Agapiou, Joel Z. Leibo, Audrunas Gruslys

    Abstract: Deep reinforcement learning (RL) has achieved several high profile successes in difficult decision-making problems. However, these algorithms typically require a huge amount of data before they reach reasonable performance. In fact, their performance during learning can be extremely poor. This may be acceptable for a simulator, but it severely limits the applicability of deep RL to many real-world… ▽ More

    Submitted 22 November, 2017; v1 submitted 12 April, 2017; originally announced April 2017.

    Comments: Published at AAAI 2018. Previously on arxiv as "Learning from Demonstrations for Real World Reinforcement Learning"

  10. arXiv:1612.09465  [pdf, other

    cs.LG cs.AI stat.ML

    Adaptive Lambda Least-Squares Temporal Difference Learning

    Authors: Timothy A. Mann, Hugo Penedones, Shie Mannor, Todd Hester

    Abstract: Temporal Difference learning or TD($λ$) is a fundamental algorithm in the field of reinforcement learning. However, setting TD's $λ$ parameter, which controls the timescale of TD updates, is generally left up to the practitioner. We formalize the $λ$ selection problem as a bias-variance trade-off where the solution is the value of $λ$ that leads to the smallest Mean Squared Value Error (MSVE). To… ▽ More

    Submitted 30 December, 2016; originally announced December 2016.

  11. arXiv:1105.1749  [pdf, ps, other

    cs.AI cs.RO cs.SE

    A Real-Time Model-Based Reinforcement Learning Architecture for Robot Control

    Authors: Todd Hester, Michael Quinlan, Peter Stone

    Abstract: Reinforcement Learning (RL) is a method for learning decision-making tasks that could enable robots to learn and adapt to their situation on-line. For an RL algorithm to be practical for robotic control tasks, it must learn in very few actions, while continually taking those actions in real-time. Existing model-based RL methods learn in relatively few actions, but typically take too much time betw… ▽ More

    Submitted 21 May, 2011; v1 submitted 9 May, 2011; originally announced May 2011.

    Comments: Added a reference Presents a real-time parallel architecture for model-based reinforcement learning methods

    ACM Class: D.2.11; I.2.6