Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Rybkin, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14853  [pdf, other

    cs.LG cs.AI cs.RO

    Privileged Sensing Scaffolds Reinforcement Learning

    Authors: Edward S. Hu, James Springer, Oleh Rybkin, Dinesh Jayaraman

    Abstract: We need to look at our shoelaces as we first learn to tie them but having mastered this skill, can do it from touch alone. We call this phenomenon "sensory scaffolding": observation streams that are not needed by a master might yet aid a novice learner. We consider such sensory scaffolding setups for training artificial agents. For example, a robot arm may need to be deployed with just a low-cost,… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: ICLR 2024 Spotlight version

  2. arXiv:2310.08887  [pdf, other

    cs.LG cs.AI cs.RO

    METRA: Scalable Unsupervised RL with Metric-Aware Abstraction

    Authors: Seohong Park, Oleh Rybkin, Sergey Levine

    Abstract: Unsupervised pre-training strategies have proven to be highly effective in natural language processing and computer vision. Likewise, unsupervised reinforcement learning (RL) holds the promise of discovering a variety of potentially useful behaviors that can accelerate the learning of a wide array of downstream tasks. Previous unsupervised RL approaches have mainly focused on pure exploration and… ▽ More

    Submitted 9 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  3. arXiv:2303.13002  [pdf, other

    cs.LG cs.AI cs.RO

    Planning Goals for Exploration

    Authors: Edward S. Hu, Richard Chang, Oleh Rybkin, Dinesh Jayaraman

    Abstract: Dropped into an unknown environment, what should an agent do to quickly learn about the environment and how to accomplish diverse tasks within it? We address this question within the goal-conditioned reinforcement learning paradigm, by identifying how the agent should set its goals at training time to maximize exploration. We propose "Planning Exploratory Goals" (PEG), a method that sets goals for… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: Camera Ready version for ICLR2023 Spotlight

  4. arXiv:2210.12719  [pdf, other

    cs.LG cs.AI

    Learning General World Models in a Handful of Reward-Free Deployments

    Authors: Yingchen Xu, Jack Parker-Holder, Aldo Pacchiano, Philip J. Ball, Oleh Rybkin, Stephen J. Roberts, Tim Rocktäschel, Edward Grefenstette

    Abstract: Building generally capable agents is a grand challenge for deep reinforcement learning (RL). To approach this challenge practically, we outline two key desiderata: 1) to facilitate generalization, exploration should be task agnostic; 2) to facilitate scalability, exploration policies should collect large quantities of data without costly centralized retraining. Combining these two properties, we i… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: To be published at NeurIPS 2022. Code and videos available at https://ycxuyingchen.github.io/cascade/

  5. arXiv:2110.09514  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Discovering and Achieving Goals via World Models

    Authors: Russell Mendonca, Oleh Rybkin, Kostas Daniilidis, Danijar Hafner, Deepak Pathak

    Abstract: How can artificial agents learn to solve many diverse tasks in complex visual environments in the absence of any supervision? We decompose this question into two problems: discovering new goals and learning to reliably achieve them. We introduce Latent Explorer Achiever (LEXA), a unified solution to these that learns a world model from image inputs and uses it to train an explorer and an achiever… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021. First two authors contributed equally. Website at https://orybkin.github.io/lexa/

  6. arXiv:2107.09047  [pdf, other

    cs.LG cs.CV cs.RO

    Know Thyself: Transferable Visual Control Policies Through Robot-Awareness

    Authors: Edward S. Hu, Kun Huang, Oleh Rybkin, Dinesh Jayaraman

    Abstract: Training visual control policies from scratch on a new robot typically requires generating large amounts of robot-specific data. How might we leverage data previously collected on another robot to reduce or even completely remove this need for robot-specific data? We propose a "robot-aware control" paradigm that achieves this by exploiting readily available knowledge about the robot. We then insta… ▽ More

    Submitted 17 October, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: Updated to ICLR22 version

  7. arXiv:2106.13229  [pdf, other

    cs.LG cs.AI cs.RO

    Model-Based Reinforcement Learning via Latent-Space Collocation

    Authors: Oleh Rybkin, Chuning Zhu, Anusha Nagabandi, Kostas Daniilidis, Igor Mordatch, Sergey Levine

    Abstract: The ability to plan into the future while utilizing only raw high-dimensional observations, such as images, can provide autonomous agents with broad capabilities. Visual model-based reinforcement learning (RL) methods that plan future actions directly have shown impressive results on tasks that require only short-horizon reasoning, however, these methods struggle on temporally extended tasks. We a… ▽ More

    Submitted 7 August, 2021; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: International Conference on Machine Learning (ICML), 2021. Videos and code at https://orybkin.github.io/latco/

  8. arXiv:2011.06507  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Reinforcement Learning with Videos: Combining Offline Observations with Interaction

    Authors: Karl Schmeckpeper, Oleh Rybkin, Kostas Daniilidis, Sergey Levine, Chelsea Finn

    Abstract: Reinforcement learning is a powerful framework for robots to acquire skills from experience, but often requires a substantial amount of online data collection. As a result, it is difficult to collect sufficiently diverse experiences that are needed for robots to generalize broadly. Videos of humans, on the other hand, are a readily available source of broad and interesting experiences. In this pap… ▽ More

    Submitted 4 November, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Journal ref: Conference on Robot Learning (2020)

  9. arXiv:2006.13205  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors

    Authors: Karl Pertsch, Oleh Rybkin, Frederik Ebert, Chelsea Finn, Dinesh Jayaraman, Sergey Levine

    Abstract: The ability to predict and plan into the future is fundamental for agents acting in the world. To reach a faraway goal, we predict trajectories at multiple timescales, first devising a coarse plan towards the goal and then gradually filling in details. In contrast, current learning approaches for visual prediction and planning fail on long-horizon tasks as they generate predictions (1) without con… ▽ More

    Submitted 27 November, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: Project page: orybkin.github.io/video-gcp. KP and OR contributed equally

  10. arXiv:2006.13202  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Simple and Effective VAE Training with Calibrated Decoders

    Authors: Oleh Rybkin, Kostas Daniilidis, Sergey Levine

    Abstract: Variational autoencoders (VAEs) provide an effective and simple method for modeling complex distributions. However, training VAEs often requires considerable hyperparameter tuning to determine the optimal amount of information retained by the latent variable. We study the impact of calibrated decoders, which learn the uncertainty of the decoding distribution and can determine this amount of inform… ▽ More

    Submitted 12 July, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: International Conference on Machine Learning (ICML), 2021. Project website is at https://orybkin.github.io/sigma-vae/

  11. arXiv:2005.05960  [pdf, other

    cs.LG cs.AI cs.CV cs.NE cs.RO stat.ML

    Planning to Explore via Self-Supervised World Models

    Authors: Ramanan Sekar, Oleh Rybkin, Kostas Daniilidis, Pieter Abbeel, Danijar Hafner, Deepak Pathak

    Abstract: Reinforcement learning allows solving complex tasks, however, the learning tends to be task-specific and the sample efficiency remains a challenge. We present Plan2Explore, a self-supervised reinforcement learning agent that tackles both these challenges through a new approach to self-supervised exploration and fast adaptation to new tasks, which need not be known during exploration. During explor… ▽ More

    Submitted 30 June, 2020; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: Accepted at ICML 2020. Videos and code at https://ramanans1.github.io/plan2explore/

  12. arXiv:1912.12773  [pdf, other

    cs.LG cs.RO stat.ML

    Learning Predictive Models From Observation and Interaction

    Authors: Karl Schmeckpeper, Annie Xie, Oleh Rybkin, Stephen Tian, Kostas Daniilidis, Sergey Levine, Chelsea Finn

    Abstract: Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works, and then use this learned model to plan coordinated sequences of actions to bring about desired outcomes. However, learning a model that captures the dynamics of complex skills represents a major challenge: if the agent needs a good model to perform these skills, it migh… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.

  13. arXiv:1904.05869  [pdf, other

    cs.LG cs.CV stat.ML

    Keyframing the Future: Keyframe Discovery for Visual Prediction and Planning

    Authors: Karl Pertsch, Oleh Rybkin, Jingyun Yang, Shenghao Zhou, Konstantinos G. Derpanis, Kostas Daniilidis, Joseph Lim, Andrew Jaegle

    Abstract: Temporal observations such as videos contain essential information about the dynamics of the underlying scene, but they are often interleaved with inessential, predictable details. One way of dealing with this problem is by focusing on the most informative moments in a sequence. We propose a model that learns to discover these important events and the times when they occur and uses them to represe… ▽ More

    Submitted 7 May, 2020; v1 submitted 11 April, 2019; originally announced April 2019.

    Comments: Conference on Learning for Dynamics and Control, 2020. Website: https://sites.google.com/view/keyin/home

  14. arXiv:1806.09655  [pdf, other

    cs.LG cs.CV stat.ML

    Learning what you can do before doing anything

    Authors: Oleh Rybkin, Karl Pertsch, Konstantinos G. Derpanis, Kostas Daniilidis, Andrew Jaegle

    Abstract: Intelligent agents can learn to represent the action spaces of other agents simply by observing them act. Such representations help agents quickly learn to predict the effects of their own actions on the environment and to plan complex action sequences. In this work, we address the problem of learning an agent's action space purely from visual observation. We use stochastic video prediction to lea… ▽ More

    Submitted 12 February, 2019; v1 submitted 25 June, 2018; originally announced June 2018.

    Comments: Published at ICLR 2019. 10 pages + 15 pages of references and appendices

    Journal ref: International Conference on Learning Representations, 2019

  15. arXiv:1803.09760  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Predicting the Future with Transformational States

    Authors: Andrew Jaegle, Oleh Rybkin, Konstantinos G. Derpanis, Kostas Daniilidis

    Abstract: An intelligent observer looks at the world and sees not only what is, but what is moving and what can be moved. In other words, the observer sees how the present state of the world can transform in the future. We propose a model that predicts future images by learning to represent the present state and its transformation given only a sequence of images. To do so, we introduce an architecture with… ▽ More

    Submitted 26 March, 2018; originally announced March 2018.

    Comments: 24 pages, including supplement