Zum Hauptinhalt springen

Showing 1–28 of 28 results for author: Kormushev, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.01334  [pdf, other

    cs.RO cs.AI cs.CV cs.HC

    A Backbone for Long-Horizon Robot Task Understanding

    Authors: Xiaoshuai Chen, Wei Chen, Dongmyoung Lee, Yukun Ge, Nicolas Rojas, Petar Kormushev

    Abstract: End-to-end robot learning, particularly for long-horizon tasks, often results in unpredictable outcomes and poor generalization. To address these challenges, we propose a novel Therblig-based Backbone Framework (TBBF) to enhance robot task understanding and transferability. This framework uses therbligs (basic action elements) as the backbone to decompose high-level robot tasks into elemental robo… ▽ More

    Submitted 7 August, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

    Comments: 8 pages, 8 figures. This work is intended to be submitted to IEEE Robotics and Automation Letters (RA-L) for possible publication

  2. arXiv:2309.14266  [pdf, other

    cs.RO

    The Hydra Hand: A Mode-Switching Underactuated Gripper with Precision and Power Grasping Modes

    Authors: Digby Chappell, Fernando Bello, Petar Kormushev, Nicolas Rojas

    Abstract: Human hands are able to grasp a wide range of object sizes, shapes, and weights, achieved via reshaping and altering their apparent grasping stiffness between compliant power and rigid precision. Achieving similar versatility in robotic hands remains a challenge, which has often been addressed by adding extra controllable degrees of freedom, tactile sensors, or specialised extra grasping hardware,… ▽ More

    Submitted 26 September, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: This paper has been accepted for publication in IEEE Robotics and Automation Letters. For the purpose of open access, the author(s) has applied a Creative Commons Attribution (CC BY) license to any Accepted Manuscript version arising. 8 pages, 11 figures

  3. arXiv:2302.07345  [pdf, other

    cs.RO math.NA

    When and Where to Step: Terrain-Aware Real-Time Footstep Location and Timing Optimization for Bipedal Robots

    Authors: Ke Wang, Zhaoyang Jacopo Hu, Peter Tisnikar, Oskar Helander, Digby Chappell, Petar Kormushev

    Abstract: Online footstep planning is essential for bipedal walking robots, allowing them to walk in the presence of disturbances and sensory noise. Most of the literature on the topic has focused on optimizing the footstep placement while keeping the step timing constant. In this work, we introduce a footstep planner capable of optimizing footstep placement and step time online. The proposed planner, consi… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 32 pages, 15 figures. Submitted to Robotics and Autonomous Systems

  4. arXiv:2112.06061  [pdf, other

    cs.RO cs.LG

    OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion

    Authors: Vittorio La Barbera, Fabio Pardo, Yuval Tassa, Monica Daley, Christopher Richards, Petar Kormushev, John Hutchinson

    Abstract: Muscle-actuated control is a research topic that spans multiple domains, including biomechanics, neuroscience, reinforcement learning, robotics, and graphics. This type of control is particularly challenging as bodies are often overactuated and dynamics are delayed and non-linear. It is however a very well tested and tuned actuation mechanism that has undergone millions of years of evolution with… ▽ More

    Submitted 24 May, 2022; v1 submitted 11 December, 2021; originally announced December 2021.

    Comments: https://github.com/vittorione94/ostrichrl

  5. arXiv:2109.09373  [pdf, other

    cs.RO

    Fast Online Optimization for Terrain-Blind Bipedal Robot Walking with a Decoupled Actuated SLIP Model

    Authors: Ke Wang, Hengyi Fei, Petar Kormushev

    Abstract: We present a highly reactive controller which enables bipedal robots to blindly walk over various kinds of uneven terrains while resisting pushes. The high level motion planner does fast online optimization for footstep locations and Center of Mass (CoM) height using the decoupled actuated Spring Loaded Inverted Pendulum (aSLIP) model. The decoupled aSLIP model simplifies the original aSLIP with L… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 8 pages, 8 figures, submitted to ICRA 2022

  6. arXiv:2109.04581  [pdf, other

    cs.RO eess.SY

    A Unified Model with Inertia Shaping for Highly Dynamic Jumps of Legged Robots

    Authors: Ke Wang, Guiyang Xin, Songyan Xin, Michael Mistry, Sethu Vijayakumar, Petar Kormushev

    Abstract: To achieve highly dynamic jumps of legged robots, it is essential to control the rotational dynamics of the robot. In this paper, we aim to improve the jumping performance by proposing a unified model for planning highly dynamic jumps that can approximately model the centroidal inertia. This model abstracts the robot as a single rigid body for the base and point masses for the legs. The model is c… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: 8 pages

  7. Policy Manifold Search: Exploring the Manifold Hypothesis for Diversity-based Neuroevolution

    Authors: Nemanja Rakicevic, Antoine Cully, Petar Kormushev

    Abstract: Neuroevolution is an alternative to gradient-based optimisation that has the potential to avoid local minima and allows parallelisation. The main limiting factor is that usually it does not scale well with parameter space dimensionality. Inspired by recent work examining neural network intrinsic dimension and loss landscapes, we hypothesise that there exists a low-dimensional manifold, embedded in… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted as a full paper at Genetic and Evolutionary Computation Conference, GECCO 2021. arXiv admin note: substantial text overlap with arXiv:2012.08676

  8. arXiv:2012.08676  [pdf, other

    cs.LG cs.NE

    Policy Manifold Search for Improving Diversity-based Neuroevolution

    Authors: Nemanja Rakicevic, Antoine Cully, Petar Kormushev

    Abstract: Diversity-based approaches have recently gained popularity as an alternative paradigm to performance-based policy search. A popular approach from this family, Quality-Diversity (QD), maintains a collection of high-performing policies separated in the diversity-metric space, defined based on policies' rollout behaviours. When policies are parameterised as neural networks, i.e. Neuroevolution, QD te… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: Paper accepted as oral (8% acceptance rate) at Beyond Backpropagation: Novel Ideas for Training Neural Architectures Workshop at NeurIPS 2020

  9. arXiv:2010.14680  [pdf, other

    cs.LG stat.ML

    Learning to Represent Action Values as a Hypergraph on the Action Vertices

    Authors: Arash Tavakoli, Mehdi Fatemi, Petar Kormushev

    Abstract: Action-value estimation is a critical component of many reinforcement learning (RL) methods whereby sample complexity relies heavily on how fast a good estimator for action value can be learned. By viewing this problem through the lens of representation learning, good representations of both state and action can facilitate action-value estimation. While advances in deep learning have seamlessly dr… ▽ More

    Submitted 20 June, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: ICLR 2021, code: https://github.com/atavakol/action-hypergraph-networks

  10. arXiv:2007.00385  [pdf, other

    cs.RO eess.SY

    Asynchronous Real-Time Optimization of Footstep Placement and Timing in Bipedal Walking Robots

    Authors: Digby Chappell, Ke Wang, Petar Kormushev

    Abstract: Online footstep planning is essential for bipedal walking robots to be able to walk in the presence of disturbances. Until recently this has been achieved by only optimizing the placement of the footstep, keeping the duration of the step constant. In this paper we introduce a footstep planner capable of optimizing footstep placement and timing in real-time by asynchronously combining two optimizer… ▽ More

    Submitted 2 July, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: 8 pages, 8 figures, journal paper

  11. Sim-to-Real Learning for Casualty Detection from Ground Projected Point Cloud Data

    Authors: Roni Permana Saputra, Nemanja Rakicevic, Petar Kormushev

    Abstract: This paper addresses the problem of human body detection---particularly a human body lying on the ground (a.k.a. casualty)---using point cloud data. This ability to detect a casualty is one of the most important features of mobile rescue robots, in order for them to be able to operate autonomously. We propose a deep-learning-based casualty detection method using a deep convolutional neural network… ▽ More

    Submitted 9 August, 2019; v1 submitted 8 August, 2019; originally announced August 2019.

    Comments: 10 pages, 10 figures, accepted to the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2019

    Journal ref: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China, 2019, pp. 3918-3925

  12. ResQbot: A Mobile Rescue Robot with Immersive Teleperception for Casualty Extraction

    Authors: Roni Permana Saputra, Petar Kormushev

    Abstract: In this work, we propose a novel mobile rescue robot equipped with an immersive stereoscopic teleperception and a teleoperation control. This robot is designed with the capability to perform safely a casualty-extraction procedure. We have built a proof-of-concept mobile rescue robot called ResQbot for the experimental platform. An approach called "loco-manipulation" is used to perform the casualty… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    Comments: Published in TAROS 2018

    Journal ref: Saputra R.P., Kormushev P. (2018) ResQbot: A Mobile Rescue Robot with Immersive Teleperception for Casualty Extraction. In: Towards Autonomous Robotic Systems. TAROS 2018. Lecture Notes in Computer Science, vol 10965. Springer, Cham

  13. Casualty Detection from 3D Point Cloud Data for Autonomous Ground Mobile Rescue Robots

    Authors: Roni Permana Saputra, Petar Kormushev

    Abstract: One of the most important features of mobile rescue robots is the ability to autonomously detect casualties, i.e. human bodies, which are usually lying on the ground. This paper proposes a novel method for autonomously detecting casualties lying on the ground using obtained 3D point-cloud data from an on-board sensor, such as an RGB-D camera or a 3D LIDAR, on a mobile rescue robot. In this method,… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    Comments: Published in SSRR 2018 Conference

    Journal ref: R. P. Saputra and P. Kormushev, "Casualty Detection from 3D Point Cloud Data for Autonomous Ground Mobile Rescue Robots," 2018 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), Philadelphia, PA, 2018, pp. 1-7

  14. arXiv:1811.11298  [pdf, other

    cs.LG cs.AI stat.ML

    Exploring Restart Distributions

    Authors: Arash Tavakoli, Vitaly Levdik, Riashat Islam, Christopher M. Smith, Petar Kormushev

    Abstract: We consider the generic approach of using an experience memory to help exploration by adapting a restart distribution. That is, given the capacity to reset the state with those corresponding to the agent's past observations, we help exploration by promoting faster state-space coverage via restarting the agent from a more diverse set of initial states, as well as allowing it to restart in states as… ▽ More

    Submitted 17 August, 2020; v1 submitted 27 November, 2018; originally announced November 2018.

    Comments: RLDM 2019

  15. arXiv:1810.09786  [pdf, other

    cs.RO

    Human-centered manipulation and navigation with Robot DE NIRO

    Authors: Fabian Falck, Sagar Doshi, Nico Smuts, John Lingi, Kim Rants, Petar Kormushev

    Abstract: Social assistance robots in health and elderly care have the potential to support and ease human lives. Given the macrosocial trends of aging and long-lived populations, robotics-based care research mainly focused on helping the elderly live independently. In this paper, we introduce Robot DE NIRO, a research platform that aims to support the supporter (the caregiver) and also offers direct human-… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018) Workshop "Towards Robots that Exhibit Manipulation Intelligence", Madrid, Spain, Oct. 1, 2018

  16. arXiv:1810.02927  [pdf, other

    cs.LG stat.ML

    Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks

    Authors: Fabio Pardo, Vitaly Levdik, Petar Kormushev

    Abstract: Being able to reach any desired location in the environment can be a valuable asset for an agent. Learning a policy to navigate between all pairs of states individually is often not feasible. An all-goals updating algorithm uses each transition to learn Q-values towards all goals simultaneously and off-policy. However the expensive numerous updates in parallel limited the approach to small tabular… ▽ More

    Submitted 4 February, 2020; v1 submitted 5 October, 2018; originally announced October 2018.

    Comments: AAAI 2020, https://sites.google.com/view/q-map-rl

  17. arXiv:1807.02078  [pdf, other

    cs.LG stat.ML

    Goal-oriented Trajectories for Efficient Exploration

    Authors: Fabio Pardo, Vitaly Levdik, Petar Kormushev

    Abstract: Exploration is a difficult challenge in reinforcement learning and even recent state-of-the art curiosity-based methods rely on the simple epsilon-greedy strategy to generate novelty. We argue that pure random walks do not succeed to properly expand the exploration area in most environments and propose to replace single random action choices by random goals selection followed by several steps in t… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: ICML 2018 Exploration in RL Workshop, videos: https://sites.google.com/view/got-exploration

  18. arXiv:1712.00378  [pdf, other

    cs.LG

    Time Limits in Reinforcement Learning

    Authors: Fabio Pardo, Arash Tavakoli, Vitaly Levdik, Petar Kormushev

    Abstract: In reinforcement learning, it is common to let an agent interact for a fixed amount of time with its environment before resetting it and repeating the process in a series of episodes. The task that the agent has to learn can either be to maximize its performance over (i) that fixed period, or (ii) an indefinite period where time limits are only used during training to diversify experience. In this… ▽ More

    Submitted 27 January, 2022; v1 submitted 1 December, 2017; originally announced December 2017.

    Comments: ICML 2018, NIPS 2017 Deep RL Symposium, code and videos: https://sites.google.com/view/time-limits-in-rl

    Journal ref: PMLR 80: 4042-4051 (2018)

  19. arXiv:1711.08946  [pdf, other

    cs.LG cs.AI

    Action Branching Architectures for Deep Reinforcement Learning

    Authors: Arash Tavakoli, Fabio Pardo, Petar Kormushev

    Abstract: Discrete-action algorithms have been central to numerous recent successes of deep reinforcement learning. However, applying these algorithms to high-dimensional action tasks requires tackling the combinatorial increase of the number of possible actions with the number of action dimensions. This problem is further exacerbated for continuous-action tasks that require fine control of actions via disc… ▽ More

    Submitted 24 January, 2019; v1 submitted 24 November, 2017; originally announced November 2017.

    Comments: AAAI 2018, NIPS 2017 Deep RL Symposium, code: https://github.com/atavakol/action-branching-agents

    Journal ref: AAAI 32: 4131-4138 (2018)

  20. arXiv:1706.00989  [pdf, other

    cs.RO cs.AI

    Visuospatial Skill Learning for Robots

    Authors: S. Reza Ahmadzadeh, Fulvio Mastrogiovanni, Petar Kormushev

    Abstract: A novel skill learning approach is proposed that allows a robot to acquire human-like visuospatial skills for object manipulation tasks. Visuospatial skills are attained by observing spatial relationships among objects through demonstrations. The proposed Visuospatial Skill Learning (VSL) is a goal-based approach that focuses on achieving a desired goal configuration of objects relative to one ano… ▽ More

    Submitted 3 June, 2017; originally announced June 2017.

    Comments: 24 pages, 36 figures

    MSC Class: 68T40

  21. arXiv:0904.1631  [pdf

    cs.RO cs.AI cs.HC

    Intent expression using eye robot for mascot robot system

    Authors: Yoichi Yamazaki, Fangyan Dong, Yuta Masuda, Yukiko Uehara, Petar Kormushev, Hai An Vu, Phuc Quang Le, Kaoru Hirota

    Abstract: An intent expression system using eye robots is proposed for a mascot robot system from a viewpoint of humatronics. The eye robot aims at providing a basic interface method for an information terminal robot system. To achieve better understanding of the displayed information, the importance and the degree of certainty of the information should be communicated along with the main content. The pro… ▽ More

    Submitted 9 April, 2009; originally announced April 2009.

    Comments: 5 pages

    Journal ref: 8th International Symposium on Advanced Intelligent Systems (ISIS2007), pp. 576-580, 2007

  22. arXiv:0904.1629  [pdf

    cs.RO cs.AI cs.HC

    Fuzzy inference based mentality estimation for eye robot agent

    Authors: Yoichi Yamazaki, Fangyan Dong, Yuta Masuda, Yukiko Uehara, Petar Kormushev, Hai An Vu, Phuc Quang Le, Kaoru Hirota

    Abstract: Household robots need to communicate with human beings in a friendly fashion. To achieve better understanding of displayed information, an importance and a certainty of the information should be communicated together with the main information. The proposed intent expression system aims to convey this additional information using an eye robot. The eye motions are represented as states in a pleasu… ▽ More

    Submitted 9 April, 2009; originally announced April 2009.

    Comments: 2 pages, in Japanese

    Journal ref: Proceedings of 23rd Fuzzy System Symposium (FSS 2007), pp. 387-388, 2007

  23. arXiv:0904.0546  [pdf

    cs.AI cs.LG cs.RO

    Eligibility Propagation to Speed up Time Hopping for Reinforcement Learning

    Authors: Petar Kormushev, Kohei Nomoto, Fangyan Dong, Kaoru Hirota

    Abstract: A mechanism called Eligibility Propagation is proposed to speed up the Time Hopping technique used for faster Reinforcement Learning in simulations. Eligibility Propagation provides for Time Hopping similar abilities to what eligibility traces provide for conventional Reinforcement Learning. It propagates values from one state to all of its temporal predecessors using a state transitions graph.… ▽ More

    Submitted 3 April, 2009; originally announced April 2009.

    Comments: 7 pages

  24. arXiv:0904.0545   

    cs.AI cs.LG cs.RO

    Time Hopping technique for faster reinforcement learning in simulations

    Authors: Petar Kormushev, Kohei Nomoto, Fangyan Dong, Kaoru Hirota

    Abstract: This preprint has been withdrawn by the author for revision

    Submitted 6 September, 2011; v1 submitted 3 April, 2009; originally announced April 2009.

    Comments: This preprint has been withdrawn by the author for revision

  25. arXiv:0904.0313  [pdf

    cs.IR cs.DB

    Visual approach for data mining on medical information databases using Fastmap algorithm

    Authors: Petar Kormushev

    Abstract: The rapid development of tools for acquisition and storage of information has lead to the formation of enormous medical databases. The large quantity of data definitely surpasses the abilities of humans for efficient usage without specialized tools for analysis. The situation is described as rich in data, but poor in information. In order to fill this growing gap, different approaches from the f… ▽ More

    Submitted 2 April, 2009; originally announced April 2009.

    Comments: Master's Thesis in Bio- and Medical Informatics, 76 pages, in Bulgarian. Submitted to Faculty of Mathematics and Informatics, Sofia University, 2006

  26. arXiv:0904.0300  [pdf

    cs.AI cs.LO

    Design, development and implementation of a tool for construction of declarative functional descriptions of semantic web services based on WSMO methodology

    Authors: Petar Kormushev

    Abstract: Semantic web services (SWS) are self-contained, self-describing, semantically marked-up software resources that can be published, discovered, composed and executed across the Web in a semi-automatic way. They are a key component of the future Semantic Web, in which networked computer programs become providers and users of information at the same time. This work focuses on developing a full-life-… ▽ More

    Submitted 2 April, 2009; originally announced April 2009.

    Comments: Master's Thesis in Artificial Intelligence, 105 pages, in Bulgarian. Submitted to Faculty of Mathematics and Informatics, Sofia University, 2005

  27. arXiv:0904.0293   

    cs.SE

    INFRAWEBS axiom editor - a graphical ontology-driven tool for creating complex logical expressions

    Authors: Gennady Agre, Petar Kormushev, Ivan Dilov

    Abstract: The current INFRAWEBS European research project aims at developing ICT framework enabling software and service providers to generate and establish open and extensible development platforms for Web Service applications. One of the concrete project objectives is developing a full-life-cycle software toolset for creating and maintaining Semantic Web Services (SWSs) supporting specific applications ba… ▽ More

    Submitted 7 January, 2012; v1 submitted 1 April, 2009; originally announced April 2009.

    Comments: This preprint has been withdrawn by the author for revision

    ACM Class: H.5.2

    Journal ref: International Journal of Information Theories and Applications, vol. 13, no. 2, ISSN 1310-0513, pp. 169-178, 2006

  28. arXiv:0903.4930  [pdf

    cs.AI cs.LG cs.RO

    Time manipulation technique for speeding up reinforcement learning in simulations

    Authors: Petar Kormushev, Kohei Nomoto, Fangyan Dong, Kaoru Hirota

    Abstract: A technique for speeding up reinforcement learning algorithms by using time manipulation is proposed. It is applicable to failure-avoidance control problems running in a computer simulation. Turning the time of the simulation backwards on failure events is shown to speed up the learning by 260% and improve the state space exploration by 12% on the cart-pole balancing task, compared to the conven… ▽ More

    Submitted 27 March, 2009; originally announced March 2009.

    Comments: 12 pages

    Journal ref: International Journal of Cybernetics and Information Technologies, vol. 8, no. 1, pp. 12-24, 2008