Skip to main content

Showing 1–17 of 17 results for author: Bruce, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.17139  [pdf, other

    cs.CV cs.AI

    Video as the New Language for Real-World Decision Making

    Authors: Sherry Yang, Jacob Walker, Jack Parker-Holder, Yilun Du, Jake Bruce, Andre Barreto, Pieter Abbeel, Dale Schuurmans

    Abstract: Both text and video data are abundant on the internet and support large-scale self-supervised learning through next token or frame prediction. However, they have not been equally leveraged: language models have had significant real-world impact, whereas video generation has remained largely limited to media entertainment. Yet video data captures important information about the physical world that… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  2. arXiv:2402.15391  [pdf, other

    cs.LG cs.AI cs.CV

    Genie: Generative Interactive Environments

    Authors: Jake Bruce, Michael Dennis, Ashley Edwards, Jack Parker-Holder, Yuge Shi, Edward Hughes, Matthew Lai, Aditi Mavalankar, Richie Steigerwald, Chris Apps, Yusuf Aytar, Sarah Bechtle, Feryal Behbahani, Stephanie Chan, Nicolas Heess, Lucy Gonzalez, Simon Osindero, Sherjil Ozair, Scott Reed, Jingwei Zhang, Konrad Zolna, Jeff Clune, Nando de Freitas, Satinder Singh, Tim Rocktäschel

    Abstract: We introduce Genie, the first generative interactive environment trained in an unsupervised manner from unlabelled Internet videos. The model can be prompted to generate an endless variety of action-controllable virtual worlds described through text, synthetic images, photographs, and even sketches. At 11B parameters, Genie can be considered a foundation world model. It is comprised of a spatiotem… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: https://sites.google.com/corp/view/genie-2024/

  3. arXiv:2305.10912  [pdf, other

    cs.AI cs.RO

    A Generalist Dynamics Model for Control

    Authors: Ingmar Schubert, Jingwei Zhang, Jake Bruce, Sarah Bechtle, Emilio Parisotto, Martin Riedmiller, Jost Tobias Springenberg, Arunkumar Byravan, Leonard Hasenclever, Nicolas Heess

    Abstract: We investigate the use of transformer sequence models as dynamics models (TDMs) for control. We find that TDMs exhibit strong generalization capabilities to unseen environments, both in a few-shot setting, where a generalist TDM is fine-tuned with small amounts of data from the target environment, and in a zero-shot setting, where a generalist TDM is applied to an unseen environment without any fu… ▽ More

    Submitted 23 September, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  4. arXiv:2305.00982  [pdf, other

    cs.LG cs.AI eess.SY

    Two-phase Dual COPOD Method for Anomaly Detection in Industrial Control System

    Authors: Emmanuel Aboah Boateng, Jerry Bruce

    Abstract: Critical infrastructures like water treatment facilities and power plants depend on industrial control systems (ICS) for monitoring and control, making them vulnerable to cyber attacks and system malfunctions. Traditional ICS anomaly detection methods lack transparency and interpretability, which make it difficult for practitioners to understand and trust the results. This paper proposes a two-pha… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: 11 pages, 9 figures, journal article

  5. arXiv:2304.00046  [pdf, other

    cs.LG cs.AI

    Accelerating exploration and representation learning with offline pre-training

    Authors: Bogdan Mazoure, Jake Bruce, Doina Precup, Rob Fergus, Ankit Anand

    Abstract: Sequential decision-making agents struggle with long horizon tasks, since solving them requires multi-step reasoning. Most reinforcement learning (RL) algorithms address this challenge by improved credit assignment, introducing memory capability, altering the agent's intrinsic motivation (i.e. exploration) or its worldview (i.e. knowledge representation). Many of these components could be learned… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  6. arXiv:2205.06175  [pdf, other

    cs.AI cs.CL cs.LG cs.RO

    A Generalist Agent

    Authors: Scott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Alexander Novikov, Gabriel Barth-Maron, Mai Gimenez, Yury Sulsky, Jackie Kay, Jost Tobias Springenberg, Tom Eccles, Jake Bruce, Ali Razavi, Ashley Edwards, Nicolas Heess, Yutian Chen, Raia Hadsell, Oriol Vinyals, Mahyar Bordbar, Nando de Freitas

    Abstract: Inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment generalist policy. The same network with the same weights can play Atari, caption images, chat, stack blocks with a real robot arm and much more, dec… ▽ More

    Submitted 11 November, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: Published at TMLR, 42 pages

    Journal ref: Transactions on Machine Learning Research, 11/2022, https://openreview.net/forum?id=1ikK0kHjvj

  7. arXiv:2107.03851  [pdf, other

    cs.LG cs.AI

    Imitation by Predicting Observations

    Authors: Andrew Jaegle, Yury Sulsky, Arun Ahuja, Jake Bruce, Rob Fergus, Greg Wayne

    Abstract: Imitation learning enables agents to reuse and adapt the hard-won expertise of others, offering a solution to several key challenges in learning behavior. Although it is easy to observe behavior in the real-world, the underlying actions may not be accessible. We present a new method for imitation solely from observations that achieves comparable performance to experts on challenging continuous con… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: ICML 2021

  8. arXiv:1911.08666  [pdf, other

    cs.LG cs.RO stat.ML

    Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

    Authors: Vibhavari Dasagi, Robert Lee, Jake Bruce, Jürgen Leitner

    Abstract: Deep reinforcement learning has been shown to solve challenging tasks where large amounts of training experience is available, usually obtained online while learning the task. Robotics is a significant potential application domain for many of these algorithms, but generating robot experience in the real world is expensive, especially when each task requires a lengthy online training procedure. Off… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

  9. arXiv:1910.03732  [pdf, other

    cs.LG cs.RO stat.ML

    Ctrl-Z: Recovering from Instability in Reinforcement Learning

    Authors: Vibhavari Dasagi, Jake Bruce, Thierry Peynot, Jürgen Leitner

    Abstract: When learning behavior, training data is often generated by the learner itself; this can result in unstable training dynamics, and this problem has particularly important applications in safety-sensitive real-world control tasks such as robotics. In this work, we propose a principled and model-agnostic approach to mitigate the issue of unstable learning dynamics by maintaining a history of a reinf… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: Submitted to ICRA2020, under review

  10. arXiv:1809.07480  [pdf, other

    cs.LG stat.ML

    Sim-to-Real Transfer of Robot Learning with Variable Length Inputs

    Authors: Vibhavari Dasagi, Robert Lee, Serena Mou, Jake Bruce, Niko Sünderhauf, Jürgen Leitner

    Abstract: Current end-to-end deep Reinforcement Learning (RL) approaches require jointly learning perception, decision-making and low-level control from very sparse reward signals and high-dimensional inputs, with little capability of incorporating prior knowledge. This results in prohibitively long training times for use on real-world robotic tasks. Existing algorithms capable of extracting task-level repr… ▽ More

    Submitted 8 October, 2019; v1 submitted 20 September, 2018; originally announced September 2018.

  11. arXiv:1807.05211  [pdf, other

    cs.RO

    Learning Deployable Navigation Policies at Kilometer Scale from a Single Traversal

    Authors: Jake Bruce, Niko Sünderhauf, Piotr Mirowski, Raia Hadsell, Michael Milford

    Abstract: Model-free reinforcement learning has recently been shown to be effective at learning navigation policies from complex image input. However, these algorithms tend to require large amounts of interaction with the environment, which can be prohibitively costly to obtain on robots in the real world. We present an approach for efficiently learning goal-directed navigation policies on a mobile robot, f… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

  12. arXiv:1711.10137  [pdf, other

    cs.AI cs.LG cs.RO

    One-Shot Reinforcement Learning for Robot Navigation with Interactive Replay

    Authors: Jake Bruce, Niko Suenderhauf, Piotr Mirowski, Raia Hadsell, Michael Milford

    Abstract: Recently, model-free reinforcement learning algorithms have been shown to solve challenging problems by learning from extensive interaction with the environment. A significant issue with transferring this success to the robotics domain is that interaction with the real world is costly, but training on limited experience is prone to overfitting. We present a method for learning to navigate, to a fi… ▽ More

    Submitted 28 November, 2017; v1 submitted 28 November, 2017; originally announced November 2017.

    Comments: NIPS Workshop on Acting and Interacting in the Real World: Challenges in Robot Learning

    Journal ref: Bruce, Jake, et al. "One-Shot Reinforcement Learning for Robot Navigation with Interactive Replay." Proceedings of the NIPS Workshop on Acting and Interacting in the Real World: Challenges in Robot Learning. 2017

  13. arXiv:1711.07280  [pdf, other

    cs.CV cs.AI cs.CL cs.RO

    Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments

    Authors: Peter Anderson, Qi Wu, Damien Teney, Jake Bruce, Mark Johnson, Niko Sünderhauf, Ian Reid, Stephen Gould, Anton van den Hengel

    Abstract: A robot that can carry out a natural-language instruction has been a dream since before the Jetsons cartoon series imagined a life of leisure mediated by a fleet of attentive robot helpers. It is a dream that remains stubbornly distant. However, recent advances in vision and language methods have made incredible progress in closely related areas. This is significant because a robot interpreting a… ▽ More

    Submitted 5 April, 2018; v1 submitted 20 November, 2017; originally announced November 2017.

    Comments: CVPR 2018 Spotlight presentation

  14. Look No Further: Adapting the Localization Sensory Window to the Temporal Characteristics of the Environment

    Authors: Jake Bruce, Adam Jacobson, Michael Milford

    Abstract: Many localization algorithms use a spatiotemporal window of sensory information in order to recognize spatial locations, and the length of this window is often a sensitive parameter that must be tuned to the specifics of the application. This letter presents a general method for environment-driven variation of the length of the spatiotemporal window based on searching for the most significant loca… ▽ More

    Submitted 23 July, 2017; v1 submitted 18 June, 2017; originally announced June 2017.

    Comments: Pre-print of article appearing in 2017 IEEE Robotics and Automation Letters. v2: incorporated reviewer feedback

    MSC Class: 68T40

    Journal ref: IEEE Robotics and Automation Letters, vol. 2, no. 4, pp. 2209-2216, October 2017

  15. arXiv:1609.09049  [pdf, other

    cs.RO cs.LG

    Deep Reinforcement Learning for Tensegrity Robot Locomotion

    Authors: Marvin Zhang, Xinyang Geng, Jonathan Bruce, Ken Caluwaerts, Massimo Vespignani, Vytas SunSpiral, Pieter Abbeel, Sergey Levine

    Abstract: Tensegrity robots, composed of rigid rods connected by elastic cables, have a number of unique properties that make them appealing for use as planetary exploration rovers. However, control of tensegrity robots remains a difficult problem due to their unusual structures and complex dynamics. In this work, we show how locomotion gaits can be learned automatically using a novel extension of mirror de… ▽ More

    Submitted 7 March, 2017; v1 submitted 28 September, 2016; originally announced September 2016.

    Comments: International Conference on Robotics and Automation (ICRA), 2017. Project website link is http://rll.berkeley.edu/drl_tensegrity

  16. arXiv:1510.07595  [pdf, other

    cs.RO

    A light-weight, multi-axis compliant tensegrity joint

    Authors: Steven Lessard, Jonathan Bruce, Erik Jung, Mircea Teodorescu, Vytas SunSpiral, Adrian Agogino

    Abstract: In this paper, we present a light-weight, multi- axis compliant tenegrity joint that is biologically inspired by the human elbow. This tensegrity elbow actuates by shortening and lengthening cable in a method inspired by muscular actuation in a person. Unlike many series elastic actuators, this joint is structurally compliant not just along each axis of rotation, but along other axes as well. Comp… ▽ More

    Submitted 26 October, 2015; originally announced October 2015.

    Comments: Initial submission

  17. State Estimation for Tensegrity Robots

    Authors: Ken Caluwaerts, Jonathan Bruce, Jeffrey M. Friesen, Vytas SunSpiral

    Abstract: Tensegrity robots are a class of compliant robots that have many desirable traits when designing mass efficient systems that must interact with uncertain environments. Various promising control approaches have been proposed for tensegrity systems in simulation. Unfortunately, state estimation methods for tensegrity robots have not yet been thoroughly studied. In this paper, we present the design… ▽ More

    Submitted 19 February, 2016; v1 submitted 5 October, 2015; originally announced October 2015.

    Comments: accepted for publication at the IEEE International Conference on Robotics and Automation (ICRA) 2016