Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Vezzani, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.11706  [pdf, other

    cs.RO cs.LG

    RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation

    Authors: Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauza, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Żołna, Scott Reed, Sergio Gómez Colmenarejo, Jon Scholz , et al. (14 additional authors not shown)

    Abstract: The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to transform robot learning. Inspired by recent advances in foundation models for vision and language, we propose a multi-embodiment, multi-task generalist agent for robotic manipulation. This agent, named RoboCat, is a visual goal-conditioned de… ▽ More

    Submitted 22 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Transactions on Machine Learning Research (12/2023)

  2. arXiv:2211.13743  [pdf, other

    cs.LG cs.AI cs.RO

    SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration

    Authors: Giulia Vezzani, Dhruva Tirumala, Markus Wulfmeier, Dushyant Rao, Abbas Abdolmaleki, Ben Moran, Tuomas Haarnoja, Jan Humplik, Roland Hafner, Michael Neunert, Claudio Fantacci, Tim Hertweck, Thomas Lampe, Fereshteh Sadeghi, Nicolas Heess, Martin Riedmiller

    Abstract: The ability to effectively reuse prior knowledge is a key requirement when building general and flexible Reinforcement Learning (RL) agents. Skill reuse is one of the most common approaches, but current methods have considerable limitations.For example, fine-tuning an existing policy frequently fails, as the policy can degrade rapidly early in training. In a similar vein, distillation of expert be… ▽ More

    Submitted 11 January, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

  3. arXiv:2112.05062  [pdf, other

    cs.LG cs.AI cs.RO

    Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies

    Authors: Dushyant Rao, Fereshteh Sadeghi, Leonard Hasenclever, Markus Wulfmeier, Martina Zambelli, Giulia Vezzani, Dhruva Tirumala, Yusuf Aytar, Josh Merel, Nicolas Heess, Raia Hadsell

    Abstract: For robots operating in the real world, it is desirable to learn reusable behaviours that can effectively be transferred and adapted to numerous tasks and scenarios. We propose an approach to learn abstract motor skills from data using a hierarchical mixture latent variable model. In contrast to existing work, our method exploits a three-level hierarchy of both discrete and continuous latent varia… ▽ More

    Submitted 14 March, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

  4. arXiv:2109.08603  [pdf, other

    cs.LG cs.NE cs.RO

    Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration

    Authors: Oliver Groth, Markus Wulfmeier, Giulia Vezzani, Vibhavari Dasagi, Tim Hertweck, Roland Hafner, Nicolas Heess, Martin Riedmiller

    Abstract: Curiosity-based reward schemes can present powerful exploration mechanisms which facilitate the discovery of solutions for complex, sparse or long-horizon tasks. However, as the agent learns to reach previously unexplored spaces and the objective adapts to reward new areas, many behaviours emerge only to disappear due to being overwritten by the constantly shifting objective. We argue that merely… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: 14 pages, 7 figures, 2 tables

    ACM Class: I.2.6; I.2.9

  5. arXiv:2106.08199  [pdf, other

    cs.LG cs.RO

    On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning

    Authors: Abbas Abdolmaleki, Sandy H. Huang, Giulia Vezzani, Bobak Shahriari, Jost Tobias Springenberg, Shruti Mishra, Dhruva TB, Arunkumar Byravan, Konstantinos Bousmalis, Andras Gyorgy, Csaba Szepesvari, Raia Hadsell, Nicolas Heess, Martin Riedmiller

    Abstract: Many advances that have improved the robustness and efficiency of deep reinforcement learning (RL) algorithms can, in one way or another, be understood as introducing additional objectives or constraints in the policy optimization step. This includes ideas as far ranging as exploration bonuses, entropy regularization, and regularization toward teachers or data priors. Often, the task reward and au… ▽ More

    Submitted 1 August, 2023; v1 submitted 15 June, 2021; originally announced June 2021.

  6. arXiv:2010.15492  [pdf, other

    cs.RO

    "What, not how": Solving an under-actuated insertion task from scratch

    Authors: Giulia Vezzani, Michael Neunert, Markus Wulfmeier, Rae Jeong, Thomas Lampe, Noah Siegel, Roland Hafner, Abbas Abdolmaleki, Martin Riedmiller, Francesco Nori

    Abstract: Robot manipulation requires a complex set of skills that need to be carefully combined and coordinated to solve a task. Yet, most ReinforcementLearning (RL) approaches in robotics study tasks which actually consist only of a single manipulation skill, such as grasping an object or inserting a pre-grasped object. As a result the skill ('how' to solve the task) but not the actual goal of a complete… ▽ More

    Submitted 30 October, 2020; v1 submitted 29 October, 2020; originally announced October 2020.

  7. GRASPA 1.0: GRASPA is a Robot Arm graSping Performance benchmArk

    Authors: Fabrizio Bottarel, Giulia Vezzani, Ugo Pattacini, Lorenzo Natale

    Abstract: The use of benchmarks is a widespread and scientifically meaningful practice to validate performance of different approaches to the same task. In the context of robot grasping the use of common object sets has emerged in recent years, however no dominant protocols and metrics to test grasping pipelines have taken root yet. In this paper, we present version 1.0 of GRASPA, a benchmark to test effect… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: To cite this work, please refer to the journal reference entry. For more information, code, pictures and video please visit https://github.com/robotology/GRASPA-benchmark

    Journal ref: in IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 836-843, April 2020

  8. arXiv:1905.12621  [pdf, other

    cs.LG stat.ML

    Learning latent state representation for speeding up exploration

    Authors: Giulia Vezzani, Abhishek Gupta, Lorenzo Natale, Pieter Abbeel

    Abstract: Exploration is an extremely challenging problem in reinforcement learning, especially in high dimensional state and action spaces and when only sparse rewards are available. Effective representations can indicate which components of the state are task relevant and thus reduce the dimensionality of the space to explore. In this work, we take a representation learning viewpoint on exploration, utili… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    Comments: 7 pages, 8 figures, workshop

    Journal ref: 2nd Exploration in Reinforcement Learning Workshop at the 36 th International Conference on Machine Learning, 2019

  9. arXiv:1710.04465  [pdf, other

    cs.RO eess.SY stat.CO

    Markerless visual servoing on unknown objects for humanoid robot platforms

    Authors: Claudio Fantacci, Giulia Vezzani, Ugo Pattacini, Vadim Tikhanoff, Lorenzo Natale

    Abstract: To precisely reach for an object with a humanoid robot, it is of central importance to have good knowledge of both end-effector, object pose and shape. In this work we propose a framework for markerless visual servoing on unknown objects, which is divided in four main parts: I) a least-squares minimization problem is formulated to find the volume of the object graspable by the robot's hand using i… ▽ More

    Submitted 12 October, 2017; originally announced October 2017.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2018

  10. arXiv:1709.10087  [pdf, other

    cs.LG cs.AI cs.RO

    Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations

    Authors: Aravind Rajeswaran, Vikash Kumar, Abhishek Gupta, Giulia Vezzani, John Schulman, Emanuel Todorov, Sergey Levine

    Abstract: Dexterous multi-fingered hands are extremely versatile and provide a generic way to perform a multitude of tasks in human-centric environments. However, effectively controlling them remains challenging due to their high dimensionality and large number of potential contacts. Deep reinforcement learning (DRL) provides a model-agnostic approach to control complex dynamical systems, but has not been s… ▽ More

    Submitted 26 June, 2018; v1 submitted 28 September, 2017; originally announced September 2017.

    Comments: Accepted for presentation at Robotics: Science and Systems (RSS) 2018. Project page: https://sites.google.com/view/deeprl-dexterous-manipulation

  11. Memory Unscented Particle Filter for 6-DOF Tactile Localization

    Authors: Giulia Vezzani, Ugo Pattacini, Giorgio Battistelli, Luigi Chisci, Lorenzo Natale

    Abstract: This paper addresses 6-DOF (degree-of-freedom) tactile localization, i.e. the pose estimation of tridimensional objects given tactile measurements. This estimation problem is fundamental for the operation of autonomous robots that are often required to manipulate and grasp objects whose pose is a-priori unknown. The nature of tactile measurements, the strict time requirements for real-time operati… ▽ More

    Submitted 10 November, 2016; v1 submitted 10 July, 2016; originally announced July 2016.

    Journal ref: IEEE Transactions on Robotics, Volume 33, Issue 5, October 2017