Zum Hauptinhalt springen

Showing 1–13 of 13 results for author: Dasagi, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.01704  [pdf, other

    cs.CL cs.AI cs.GT

    States as Strings as Strategies: Steering Language Models with Game-Theoretic Solvers

    Authors: Ian Gemp, Yoram Bachrach, Marc Lanctot, Roma Patel, Vibhavari Dasagi, Luke Marris, Georgios Piliouras, Siqi Liu, Karl Tuyls

    Abstract: Game theory is the study of mathematical models of strategic interactions among rational agents. Language is a key medium of interaction for humans, though it has historically proven difficult to model dialogue and its strategic motivations mathematically. A suitable model of the players, strategies, and payoffs associated with linguistic interactions (i.e., a binding to the conventional symbolic… ▽ More

    Submitted 6 February, 2024; v1 submitted 24 January, 2024; originally announced February 2024.

    Comments: 32 pages, 8 figures, code available @ https://github.com/google-deepmind/open_spiel/blob/master/open_spiel/python/games/chat_game.py

  2. arXiv:2301.07608  [pdf, other

    cs.LG cs.AI cs.NE

    Human-Timescale Adaptation in an Open-Ended Task Space

    Authors: Adaptive Agent Team, Jakob Bauer, Kate Baumli, Satinder Baveja, Feryal Behbahani, Avishkar Bhoopchand, Nathalie Bradley-Schmieg, Michael Chang, Natalie Clay, Adrian Collister, Vibhavari Dasagi, Lucy Gonzalez, Karol Gregor, Edward Hughes, Sheleem Kashem, Maria Loks-Thompson, Hannah Openshaw, Jack Parker-Holder, Shreya Pathak, Nicolas Perez-Nieves, Nemanja Rakicevic, Tim Rocktäschel, Yannick Schroecker, Jakub Sygnowski, Karl Tuyls , et al. (3 additional authors not shown)

    Abstract: Foundation models have shown impressive adaptation and scalability in supervised and self-supervised learning problems, but so far these successes have not fully translated to reinforcement learning (RL). In this work, we demonstrate that training an RL agent at scale leads to a general in-context learning algorithm that can adapt to open-ended novel embodied 3D problems as quickly as humans. In a… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  3. arXiv:2209.10958  [pdf, ps, other

    cs.MA cs.AI

    Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments

    Authors: Ian Gemp, Thomas Anthony, Yoram Bachrach, Avishkar Bhoopchand, Kalesha Bullard, Jerome Connor, Vibhavari Dasagi, Bart De Vylder, Edgar Duenez-Guzman, Romuald Elie, Richard Everett, Daniel Hennes, Edward Hughes, Mina Khan, Marc Lanctot, Kate Larson, Guy Lever, Siqi Liu, Luke Marris, Kevin R. McKee, Paul Muller, Julien Perolat, Florian Strub, Andrea Tacchetti, Eugene Tarassov , et al. (2 additional authors not shown)

    Abstract: The Game Theory & Multi-Agent team at DeepMind studies several aspects of multi-agent learning ranging from computing approximations to fundamental concepts in game theory to simulating social dilemmas in rich spatial environments and training 3-d humanoids in difficult team coordination tasks. A signature aim of our group is to use the resources and expertise made available to us at DeepMind in d… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: Published in AI Communications 2022

  4. arXiv:2201.11861  [pdf, other

    cs.LG

    The Challenges of Exploration for Offline Reinforcement Learning

    Authors: Nathan Lambert, Markus Wulfmeier, William Whitney, Arunkumar Byravan, Michael Bloesch, Vibhavari Dasagi, Tim Hertweck, Martin Riedmiller

    Abstract: Offline Reinforcement Learning (ORL) enablesus to separately study the two interlinked processes of reinforcement learning: collecting informative experience and inferring optimal behaviour. The second step has been widely studied in the offline setting, but just as critical to data-efficient RL is the collection of informative data. The task-agnostic setting for data collection, where the task is… ▽ More

    Submitted 18 February, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

  5. arXiv:2112.05299  [pdf, other

    cs.RO cs.AI

    Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots

    Authors: Krishan Rana, Vibhavari Dasagi, Jesse Haviland, Ben Talbot, MIchael Milford, Niko Sünderhauf

    Abstract: While deep reinforcement learning (RL) agents have demonstrated incredible potential in attaining dexterous behaviours for robotics, they tend to make errors when deployed in the real world due to mismatches between the training and execution environments. In contrast, the classical robotics community have developed a range of controllers that can safely operate across most states in the real worl… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: Accepted for a poster and spotlight presentation at Neurips 2021 Workshop on Deployable Decision Making in Embodied Systems (DDM). arXiv admin note: substantial text overlap with arXiv:2107.09822

  6. arXiv:2109.08603  [pdf, other

    cs.LG cs.NE cs.RO

    Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration

    Authors: Oliver Groth, Markus Wulfmeier, Giulia Vezzani, Vibhavari Dasagi, Tim Hertweck, Roland Hafner, Nicolas Heess, Martin Riedmiller

    Abstract: Curiosity-based reward schemes can present powerful exploration mechanisms which facilitate the discovery of solutions for complex, sparse or long-horizon tasks. However, as the agent learns to reach previously unexplored spaces and the objective adapts to reward new areas, many behaviours emerge only to disappear due to being overwritten by the constantly shifting objective. We argue that merely… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: 14 pages, 7 figures, 2 tables

    ACM Class: I.2.6; I.2.9

  7. arXiv:2107.09822  [pdf, other

    cs.RO cs.AI eess.SY

    Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics

    Authors: Krishan Rana, Vibhavari Dasagi, Jesse Haviland, Ben Talbot, Michael Milford, Niko Sünderhauf

    Abstract: We present Bayesian Controller Fusion (BCF): a hybrid control strategy that combines the strengths of traditional hand-crafted controllers and model-free deep reinforcement learning (RL). BCF thrives in the robotics domain, where reliable but suboptimal control priors exist for many tasks, but RL from scratch remains unsafe and data-inefficient. By fusing uncertainty-aware distributional outputs f… ▽ More

    Submitted 3 April, 2023; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: The International Journal of Robotics Research (IJRR), 2023. Project page: https://krishanrana.github.io/bcf

  8. arXiv:2010.03209  [pdf, other

    cs.RO

    Learning Arbitrary-Goal Fabric Folding with One Hour of Real Robot Experience

    Authors: Robert Lee, Daniel Ward, Akansel Cosgun, Vibhavari Dasagi, Peter Corke, Jurgen Leitner

    Abstract: Manipulating deformable objects, such as fabric, is a long standing problem in robotics, with state estimation and control posing a significant challenge for traditional methods. In this paper, we show that it is possible to learn fabric folding skills in only an hour of self-supervised real robot experience, without human supervision or simulation. Our approach relies on fully convolutional netwo… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  9. arXiv:2003.05117  [pdf, other

    cs.RO cs.AI

    Multiplicative Controller Fusion: Leveraging Algorithmic Priors for Sample-efficient Reinforcement Learning and Safe Sim-To-Real Transfer

    Authors: Krishan Rana, Vibhavari Dasagi, Ben Talbot, Michael Milford, Niko Sünderhauf

    Abstract: Learning-based approaches often outperform hand-coded algorithmic solutions for many problems in robotics. However, learning long-horizon tasks on real robot hardware can be intractable, and transferring a learned policy from simulation to reality is still extremely challenging. We present a novel approach to model-free reinforcement learning that can leverage existing sub-optimal solutions as an… ▽ More

    Submitted 27 July, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: Accepted for presentation at IROS2020. Project site available at https://sites.google.com/view/mcf-nav/home

  10. arXiv:1911.08666  [pdf, other

    cs.LG cs.RO stat.ML

    Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

    Authors: Vibhavari Dasagi, Robert Lee, Jake Bruce, Jürgen Leitner

    Abstract: Deep reinforcement learning has been shown to solve challenging tasks where large amounts of training experience is available, usually obtained online while learning the task. Robotics is a significant potential application domain for many of these algorithms, but generating robot experience in the real world is expensive, especially when each task requires a lengthy online training procedure. Off… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

  11. arXiv:1910.03732  [pdf, other

    cs.LG cs.RO stat.ML

    Ctrl-Z: Recovering from Instability in Reinforcement Learning

    Authors: Vibhavari Dasagi, Jake Bruce, Thierry Peynot, Jürgen Leitner

    Abstract: When learning behavior, training data is often generated by the learner itself; this can result in unstable training dynamics, and this problem has particularly important applications in safety-sensitive real-world control tasks such as robotics. In this work, we propose a principled and model-agnostic approach to mitigate the issue of unstable learning dynamics by maintaining a history of a reinf… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: Submitted to ICRA2020, under review

  12. arXiv:1909.10972  [pdf, other

    cs.RO cs.LG

    Residual Reactive Navigation: Combining Classical and Learned Navigation Strategies For Deployment in Unknown Environments

    Authors: Krishan Rana, Ben Talbot, Vibhavari Dasagi, Michael Milford, Niko Sünderhauf

    Abstract: In this work we focus on improving the efficiency and generalisation of learned navigation strategies when transferred from its training environment to previously unseen ones. We present an extension of the residual reinforcement learning framework from the robotic manipulation literature and adapt it to the vast and unstructured environments that mobile robots can operate in. The concept is based… ▽ More

    Submitted 11 March, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: Accepted as a conference paper at ICRA2020. Project site available at https://sites.google.com/view/srrn/home

  13. arXiv:1809.07480  [pdf, other

    cs.LG stat.ML

    Sim-to-Real Transfer of Robot Learning with Variable Length Inputs

    Authors: Vibhavari Dasagi, Robert Lee, Serena Mou, Jake Bruce, Niko Sünderhauf, Jürgen Leitner

    Abstract: Current end-to-end deep Reinforcement Learning (RL) approaches require jointly learning perception, decision-making and low-level control from very sparse reward signals and high-dimensional inputs, with little capability of incorporating prior knowledge. This results in prohibitively long training times for use on real-world robotic tasks. Existing algorithms capable of extracting task-level repr… ▽ More

    Submitted 8 October, 2019; v1 submitted 20 September, 2018; originally announced September 2018.