Skip to main content

Showing 1–36 of 36 results for author: Berseth, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20971  [pdf, other

    cs.LG cs.CV

    Amortizing intractable inference in diffusion models for vision, language, and control

    Authors: Siddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: Diffusion models have emerged as effective distribution estimators in vision, language, and reinforcement learning, but their use as priors in downstream tasks poses an intractable posterior inference problem. This paper studies amortized sampling of the posterior over data, $\mathbf{x}\sim p^{\rm post}(\mathbf{x})\propto p(\mathbf{x})r(\mathbf{x})$, in a model that consists of a diffusion generat… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Code: https://github.com/GFNOrg/diffusion-finetuning

  2. arXiv:2405.19548  [pdf, other

    cs.LG

    RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning

    Authors: Mingqi Yuan, Roger Creus Castanyer, Bo Li, Xin Jin, Glen Berseth, Wenjun Zeng

    Abstract: Extrinsic rewards can effectively guide reinforcement learning (RL) agents in specific tasks. However, extrinsic rewards frequently fall short in complex environments due to the significant human effort needed for their design and annotation. This limitation underscores the necessity for intrinsic rewards, which offer auxiliary and dense signals and can enable agents to learn in an unsupervised ma… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 25 pages, 19 figures

  3. arXiv:2405.17243  [pdf, other

    cs.LG cs.AI

    Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning

    Authors: Adriana Hugessen, Roger Creus Castanyer, Faisal Mohamed, Glen Berseth

    Abstract: Both entropy-minimizing and entropy-maximizing (curiosity) objectives for unsupervised reinforcement learning (RL) have been shown to be effective in different environments, depending on the environment's level of natural entropy. However, neither method alone results in an agent that will consistently learn intelligent behavior across environments. In an effort to find a single entropy-based meth… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Published at the Reinforcement Learning Conference 2024

  4. arXiv:2405.01684  [pdf, other

    cs.LG cs.AI

    Intelligent Switching for Reset-Free RL

    Authors: Darshan Patil, Janarthanan Rajendran, Glen Berseth, Sarath Chandar

    Abstract: In the real world, the strong episode resetting mechanisms that are needed to train agents in simulation are unavailable. The \textit{resetting} assumption limits the potential of reinforcement learning in the real world, as providing resets to an agent usually requires the creation of additional handcrafted mechanisms or human interventions. Recent work aims to train agents (\textit{forward}) wit… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Published at ICLR 2024

  5. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  6. arXiv:2401.16889  [pdf, other

    cs.RO cs.AI eess.SY

    Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control

    Authors: Zhongyu Li, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath

    Abstract: This paper presents a comprehensive study on using deep reinforcement learning (RL) to create dynamic locomotion controllers for bipedal robots. Going beyond focusing on a single locomotion skill, we develop a general control solution that can be used for a range of dynamic bipedal skills, from periodic walking and running to aperiodic jumping and standing. Our RL-based controller incorporates a n… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  7. arXiv:2401.11237  [pdf, other

    cs.LG

    Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View

    Authors: Raj Ghugare, Matthieu Geist, Glen Berseth, Benjamin Eysenbach

    Abstract: Some reinforcement learning (RL) algorithms can stitch pieces of experience to solve a task never seen before during training. This oft-sought property is one of the few ways in which RL methods based on dynamic-programming differ from RL methods based on supervised-learning (SL). Yet, certain RL methods based on off-the-shelf SL algorithms achieve excellent results without an explicit mechanism f… ▽ More

    Submitted 11 March, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Comments: ICLR 2024, Project code: https://github.com/RajGhugare19/stitching-is-combinatorial-generalisation

  8. arXiv:2310.18144  [pdf, other

    cs.LG cs.AI

    Improving Intrinsic Exploration by Creating Stationary Objectives

    Authors: Roger Creus Castanyer, Joshua Romoff, Glen Berseth

    Abstract: Exploration bonuses in reinforcement learning guide long-horizon exploration by defining custom intrinsic objectives. Several exploration objectives like count-based bonuses, pseudo-counts, and state-entropy maximization are non-stationary and hence are difficult to optimize for the agent. While this issue is generally known, it is usually omitted and solutions remain under-explored. The key contr… ▽ More

    Submitted 22 April, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted at ICLR 2024

  9. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  10. arXiv:2310.02902  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI

    Searching for High-Value Molecules Using Reinforcement Learning and Transformers

    Authors: Raj Ghugare, Santiago Miret, Adriana Hugessen, Mariano Phielipp, Glen Berseth

    Abstract: Reinforcement learning (RL) over text representations can be effective for finding high-value policies that can search over graphs. However, RL requires careful structuring of the search space and algorithm design to be effective in this challenge. Through extensive experiments, we explore how different design choices for text grammar and algorithmic choices for training can affect an RL policy's… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  11. arXiv:2309.06599  [pdf, other

    cs.LG

    Reasoning with Latent Diffusion in Offline Reinforcement Learning

    Authors: Siddarth Venkatraman, Shivesh Khaitan, Ravi Tej Akella, John Dolan, Jeff Schneider, Glen Berseth

    Abstract: Offline reinforcement learning (RL) holds promise as a means to learn high-reward policies from a static dataset, without the need for further environment interactions. However, a key challenge in offline RL lies in effectively stitching portions of suboptimal trajectories from the static dataset while avoiding extrapolation errors arising due to a lack of support in the dataset. Existing approach… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  12. arXiv:2309.03839  [pdf, other

    cs.RO cs.HC cs.LG

    Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning

    Authors: Jensen Gao, Siddharth Reddy, Glen Berseth, Anca D. Dragan, Sergey Levine

    Abstract: Adaptive interfaces can help users perform sequential decision-making tasks like robotic teleoperation given noisy, high-dimensional command signals (e.g., from a brain-computer interface). Recent advances in human-in-the-loop machine learning enable such systems to improve by interacting with users, but tend to be limited by the amount of data that they can collect from individual users in practi… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2023

  13. arXiv:2306.14808  [pdf, other

    cs.LG

    Maximum State Entropy Exploration using Predecessor and Successor Representations

    Authors: Arnav Kumar Jain, Lucas Lehnert, Irina Rish, Glen Berseth

    Abstract: Animals have a developed ability to explore that aids them in important tasks such as locating food, exploring for shelter, and finding misplaced items. These exploration skills necessarily track where they have been so that they can plan for finding items with relative efficiency. Contemporary exploration algorithms often learn a less efficient exploration strategy because they either condition o… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  14. Torque-based Deep Reinforcement Learning for Task-and-Robot Agnostic Learning on Bipedal Robots Using Sim-to-Real Transfer

    Authors: Donghyeon Kim, Glen Berseth, Mathew Schwartz, Jaeheung Park

    Abstract: In this paper, we review the question of which action space is best suited for controlling a real biped robot in combination with Sim2Real training. Position control has been popular as it has been shown to be more sample efficient and intuitive to combine with other planning algorithms. However, for position control gain tuning is required to achieve the best possible policy performance. We show… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Journal ref: IEEE Robotics and Automation Letters, vol. 8, no. 10, pp. 6251-6258, Oct. 2023

  15. arXiv:2302.09450  [pdf, other

    cs.RO cs.AI eess.SY

    Robust and Versatile Bipedal Jumping Control through Reinforcement Learning

    Authors: Zhongyu Li, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath

    Abstract: This work aims to push the limits of agility for bipedal robots by enabling a torque-controlled bipedal robot to perform robust and versatile dynamic jumps in the real world. We present a reinforcement learning framework for training a robot to accomplish a large variety of jumping tasks, such as jumping to different locations and directions. To improve performance on these challenging tasks, we d… ▽ More

    Submitted 31 May, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: Accepted in Robotics: Science and Systems 2023 (RSS 2023). The accompanying video is at https://youtu.be/aAPSZ2QFB-E

  16. arXiv:2208.01160  [pdf, other

    cs.RO cs.AI eess.SY

    Hierarchical Reinforcement Learning for Precise Soccer Shooting Skills using a Quadrupedal Robot

    Authors: Yandong Ji, Zhongyu Li, Yinan Sun, Xue Bin Peng, Sergey Levine, Glen Berseth, Koushil Sreenath

    Abstract: We address the problem of enabling quadrupedal robots to perform precise shooting skills in the real world using reinforcement learning. Developing algorithms to enable a legged robot to shoot a soccer ball to a given target is a challenging problem that combines robot motion control and planning into one task. To solve this problem, we need to consider the dynamics limitation and motion stability… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted to 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

  17. arXiv:2206.12279  [pdf, other

    cs.LG cs.AI cs.RO

    AnyMorph: Learning Transferable Polices By Inferring Agent Morphology

    Authors: Brandon Trabucco, Mariano Phielipp, Glen Berseth

    Abstract: The prototypical approach to reinforcement learning involves training policies tailored to a particular agent from scratch for every new morphology. Recent work aims to eliminate the re-training of policies by investigating whether a morphology-agnostic policy, trained on a diverse set of agents with similar task objectives, can be transferred to new agents with unseen morphologies without re-trai… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: published at ICML 2022

  18. arXiv:2203.02072  [pdf, other

    cs.HC cs.LG

    X2T: Training an X-to-Text Typing Interface with Online Learning from User Feedback

    Authors: Jensen Gao, Siddharth Reddy, Glen Berseth, Nicholas Hardy, Nikhilesh Natraj, Karunesh Ganguly, Anca D. Dragan, Sergey Levine

    Abstract: We aim to help users communicate their intent to machines using flexible, adaptive interfaces that translate arbitrary user input into desired actions. In this work, we focus on assistive typing applications in which a user cannot operate a keyboard, but can instead supply other inputs, such as webcam images that capture eye gaze or neural activity measured by a brain implant. Standard methods tra… ▽ More

    Submitted 6 March, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Accepted to International Conference on Learning Representations (ICLR) 2021

  19. arXiv:2202.02465  [pdf, other

    cs.RO cs.HC cs.LG

    ASHA: Assistive Teleoperation via Human-in-the-Loop Reinforcement Learning

    Authors: Sean Chen, Jensen Gao, Siddharth Reddy, Glen Berseth, Anca D. Dragan, Sergey Levine

    Abstract: Building assistive interfaces for controlling robots through arbitrary, high-dimensional, noisy inputs (e.g., webcam images of eye gaze) can be challenging, especially when it involves inferring the user's desired action in the absence of a natural 'default' interface. Reinforcement learning from online user feedback on the system's performance presents a natural solution to this problem, and enab… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: Accepted to IEEE Conference on Robotics and Automation (ICRA) 2022

  20. arXiv:2112.04467  [pdf, other

    cs.LG cs.AI cs.RO

    CoMPS: Continual Meta Policy Search

    Authors: Glen Berseth, Zhiwei Zhang, Grace Zhang, Chelsea Finn, Sergey Levine

    Abstract: We develop a new continual meta-learning method to address challenges in sequential multi-task learning. In this setting, the agent's goal is to achieve high reward over any sequence of tasks quickly. Prior meta-reinforcement learning algorithms have demonstrated promising results in accelerating the acquisition of new tasks. However, they require access to all tasks during training. Beyond simply… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

    Comments: 23 pages, under review

  21. arXiv:2112.03899  [pdf, other

    cs.LG cs.AI

    Information is Power: Intrinsic Control via Information Capture

    Authors: Nicholas Rhinehart, Jenny Wang, Glen Berseth, John D. Co-Reyes, Danijar Hafner, Chelsea Finn, Sergey Levine

    Abstract: Humans and animals explore their environment and acquire useful skills even in the absence of clear goals, exhibiting intrinsic motivation. The study of intrinsic motivation in artificial agents is concerned with the following question: what is a good general-purpose objective for an agent? We study this question in dynamic partially-observed environments, and argue that a compact and general lear… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: NeurIPS 2021

  22. arXiv:2107.13545  [pdf, other

    cs.LG cs.RO

    Fully Autonomous Real-World Reinforcement Learning with Applications to Mobile Manipulation

    Authors: Charles Sun, Jędrzej Orbik, Coline Devin, Brian Yang, Abhishek Gupta, Glen Berseth, Sergey Levine

    Abstract: We study how robots can autonomously learn skills that require a combination of navigation and grasping. While reinforcement learning in principle provides for automated robotic skill learning, in practice reinforcement learning in the real world is challenging and often requires extensive instrumentation and supervision. Our aim is to devise a robotic reinforcement learning system for learning na… ▽ More

    Submitted 6 December, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

    Comments: 16 pages, Published at CoRL 2021

  23. arXiv:2107.07394  [pdf, other

    cs.LG cs.AI

    Explore and Control with Adversarial Surprise

    Authors: Arnaud Fickinger, Natasha Jaques, Samyak Parajuli, Michael Chang, Nicholas Rhinehart, Glen Berseth, Stuart Russell, Sergey Levine

    Abstract: Unsupervised reinforcement learning (RL) studies how to leverage environment statistics to learn useful behaviors without the cost of reward engineering. However, a central challenge in unsupervised RL is to extract behaviors that meaningfully affect the world and cover the range of possible outcomes, without getting distracted by inherently unpredictable, uncontrollable, and stochastic elements i… ▽ More

    Submitted 28 December, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

  24. arXiv:2104.11707  [pdf, other

    cs.LG cs.AI cs.RO

    DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies

    Authors: Soroush Nasiriany, Vitchyr H. Pong, Ashvin Nair, Alexander Khazatsky, Glen Berseth, Sergey Levine

    Abstract: Can we use reinforcement learning to learn general-purpose policies that can perform a wide range of different tasks, resulting in flexible and reusable skills? Contextual policies provide this capability in principle, but the representation of the context determines the degree of generalization and expressivity. Categorical contexts preclude generalization to entirely new tasks. Goal-conditioned… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

    Comments: ICRA 2021

  25. arXiv:2103.14295  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots

    Authors: Zhongyu Li, Xuxin Cheng, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath

    Abstract: Developing robust walking controllers for bipedal robots is a challenging endeavor. Traditional model-based locomotion controllers require simplifying assumptions and careful modelling; any small errors can result in unstable control. To address these challenges for bipedal locomotion, we present a model-free reinforcement learning framework for training robust locomotion policies in simulation, w… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: To appear on 2021 International Conference on Robotics and Automation (ICRA 2021)

  26. arXiv:2006.12478  [pdf, other

    cs.LG cs.AI stat.ML

    Ecological Reinforcement Learning

    Authors: John D. Co-Reyes, Suvansh Sanjeev, Glen Berseth, Abhishek Gupta, Sergey Levine

    Abstract: Much of the current work on reinforcement learning studies episodic settings, where the agent is reset between trials to an initial state distribution, often with well-shaped reward functions. Non-episodic settings, where the agent must learn through continuous interaction with the world without resets, and where the agent receives only delayed and sparse reward signals, is substantially more diff… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: Preprint. Website at: https://sites.google.com/view/ecological-rl/home

  27. arXiv:1912.13360  [pdf, other

    cs.RO cs.CV

    Morphology-Agnostic Visual Robotic Control

    Authors: Brian Yang, Dinesh Jayaraman, Glen Berseth, Alexei Efros, Sergey Levine

    Abstract: Existing approaches for visuomotor robotic control typically require characterizing the robot in advance by calibrating the camera or performing system identification. We propose MAVRIC, an approach that works with minimal prior knowledge of the robot's morphology, and requires only a camera view containing the robot and its environment and an unknown control interface. MAVRIC revolves around a mu… ▽ More

    Submitted 31 December, 2019; originally announced December 2019.

  28. arXiv:1912.05510  [pdf, other

    cs.LG cs.AI stat.ML

    SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments

    Authors: Glen Berseth, Daniel Geng, Coline Devin, Nicholas Rhinehart, Chelsea Finn, Dinesh Jayaraman, Sergey Levine

    Abstract: Every living organism struggles against disruptive environmental forces to carve out and maintain an orderly niche. We propose that such a struggle to achieve and preserve order might offer a principle for the emergence of useful behaviors in artificial agents. We formalize this idea into an unsupervised reinforcement learning method called surprise minimizing reinforcement learning (SMiRL). SMiRL… ▽ More

    Submitted 7 February, 2021; v1 submitted 11 December, 2019; originally announced December 2019.

    Comments: ICLR 2021

    ACM Class: G.3

  29. arXiv:1912.02368  [pdf, other

    cs.LG cs.AI stat.ML

    Inter-Level Cooperation in Hierarchical Reinforcement Learning

    Authors: Abdul Rahman Kreidieh, Glen Berseth, Brandon Trabucco, Samyak Parajuli, Sergey Levine, Alexandre M. Bayen

    Abstract: Hierarchies of temporally decoupled policies present a promising approach for enabling structured exploration in complex long-term planning problems. To fully achieve this approach an end-to-end training paradigm is needed. However, training these multi-level policies has had limited success due to challenges arising from interactions between the goal-assigning and goal-achieving levels within a h… ▽ More

    Submitted 17 November, 2021; v1 submitted 4 December, 2019; originally announced December 2019.

  30. arXiv:1910.11670  [pdf, other

    cs.RO cs.CV cs.LG

    Contextual Imagined Goals for Self-Supervised Robotic Learning

    Authors: Ashvin Nair, Shikhar Bahl, Alexander Khazatsky, Vitchyr Pong, Glen Berseth, Sergey Levine

    Abstract: While reinforcement learning provides an appealing formalism for learning individual skills, a general-purpose robotic system must be able to master an extensive repertoire of behaviors. Instead of learning a large collection of skills individually, can we instead enable a robot to propose and practice its own behaviors automatically, learning about the affordances and behaviors that it can perfor… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: 12 pages, to be presented at Conference on Robot Learning (CoRL) 2019. Project website: https://ccrig.github.io/

  31. arXiv:1901.07186  [pdf, other

    cs.LG cs.RO stat.ML

    Towards Learning to Imitate from a Single Video Demonstration

    Authors: Glen Berseth, Florian Golemo, Christopher Pal

    Abstract: Agents that can learn to imitate given video observation -- \emph{without direct access to state or action information} are more applicable to learning in the natural world. However, formulating a reinforcement learning (RL) agent that facilitates this goal remains a significant challenge. We approach this challenge using contrastive training to learn a reward function comparing an agent's behavio… ▽ More

    Submitted 12 July, 2023; v1 submitted 22 January, 2019; originally announced January 2019.

    Comments: Published in JMLR. https://jmlr.org/papers/v24/21-1174.html

  32. arXiv:1804.06424  [pdf, other

    cs.AI cs.RO

    Terrain RL Simulator

    Authors: Glen Berseth, Xue Bin Peng, Michiel van de Panne

    Abstract: We provide $89$ challenging simulation environments that range in difficulty. The difficulty of solving a task is linked not only to the number of dimensions in the action space but also to the size and shape of the distribution of configurations the agent experiences. Therefore, we are releasing a number of simulation environments that include randomly generated terrain. The library also provides… ▽ More

    Submitted 17 April, 2018; originally announced April 2018.

    Comments: 10 pages

  33. arXiv:1803.05580  [pdf, other

    cs.RO

    Feedback Control For Cassie With Deep Reinforcement Learning

    Authors: Zhaoming Xie, Glen Berseth, Patrick Clary, Jonathan Hurst, Michiel van de Panne

    Abstract: Bipedal locomotion skills are challenging to develop. Control strategies often use local linearization of the dynamics in conjunction with reduced-order abstractions to yield tractable solutions. In these model-based control strategies, the controller is often not fully aware of many details, including torque limits, joint limits, and other non-linearities that are necessarily excluded from the co… ▽ More

    Submitted 27 July, 2018; v1 submitted 14 March, 2018; originally announced March 2018.

    Comments: 6 pages, 4 figures, accepted for IROS2018

  34. arXiv:1802.04765  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Progressive Reinforcement Learning with Distillation for Multi-Skilled Motion Control

    Authors: Glen Berseth, Cheng Xie, Paul Cernek, Michiel Van de Panne

    Abstract: Deep reinforcement learning has demonstrated increasing capabilities for continuous control problems, including agents that can move with skill and agility through their environment. An open problem in this setting is that of developing good strategies for integrating or merging policies for multiple skills, where each individual skill is a specialist in a specific skill and its associated state d… ▽ More

    Submitted 13 February, 2018; originally announced February 2018.

    Comments: 15 pages, Conference paper

  35. arXiv:1801.08607  [pdf, other

    cs.HC cs.CE

    Interactive Diversity Optimization of Environments

    Authors: Glen Berseth, Mahyar Khayatkhoei, Brandon Haworth, Muhammad Usman, Mubbasir Kapadia, Petros Faloutsos

    Abstract: The design of a building requires an architect to balance a wide range of constraints: aesthetic, geometric, usability, lighting, safety, etc. At the same time, there are often a multiplicity of diverse designs that can meet these constraints equally well. Architects must use their skills and artistic vision to explore these rich but highly constrained design spaces. A number of computer-aided des… ▽ More

    Submitted 22 January, 2018; originally announced January 2018.

    Comments: 20 pages

  36. arXiv:1801.03954  [pdf, other

    cs.AI

    Model-Based Action Exploration for Learning Dynamic Motion Skills

    Authors: Glen Berseth, Michiel van de Panne

    Abstract: Deep reinforcement learning has achieved great strides in solving challenging motion control tasks. Recently, there has been significant work on methods for exploiting the data gathered during training, but there has been less work on how to best generate the data to learn from. For continuous action domains, the most common method for generating exploratory actions involves sampling from a Gaussi… ▽ More

    Submitted 11 April, 2018; v1 submitted 11 January, 2018; originally announced January 2018.

    Comments: 7 pages, 7 figures, conference paper