Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Sima, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10858  [pdf, other

    cs.LG cs.AI

    Knowledge Sharing and Transfer via Centralized Reward Agent for Multi-Task Reinforcement Learning

    Authors: Haozhe Ma, Zhengding Luo, Thanh Vinh Vo, Kuankuan Sima, Tze-Yun Leong

    Abstract: Reward shaping is effective in addressing the sparse-reward challenge in reinforcement learning by providing immediate feedback through auxiliary informative rewards. Based on the reward shaping strategy, we propose a novel multi-task reinforcement learning framework, that integrates a centralized reward agent (CRA) and multiple distributed policy agents. The CRA functions as a knowledge pool, whi… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  2. arXiv:2408.03029  [pdf, other

    cs.LG cs.AI

    Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning

    Authors: Haozhe Ma, Zhengding Luo, Thanh Vinh Vo, Kuankuan Sima, Tze-Yun Leong

    Abstract: Reward shaping addresses the challenge of sparse rewards in reinforcement learning by constructing denser and more informative reward signals. To achieve self-adaptive and highly efficient reward shaping, we propose a novel method that incorporates success rates derived from historical experiences into shaped rewards. Our approach utilizes success rates sampled from Beta distributions, which dynam… ▽ More

    Submitted 7 August, 2024; v1 submitted 6 August, 2024; originally announced August 2024.

  3. arXiv:2406.04858  [pdf, other

    cs.RO eess.SY

    Auto-Multilift: Distributed Learning and Control for Cooperative Load Transportation With Quadrotors

    Authors: Bingheng Wang, Rui Huang, Kuankuan Sima, Lin Zhao

    Abstract: Designing motion control and planning algorithms for multilift systems remains challenging due to the complexities of dynamics, collision avoidance, actuator limits, and scalability. Existing methods that use optimization and distributed techniques effectively address these constraints and scalability issues. However, they often require substantial manual tuning, leading to suboptimal performance.… ▽ More

    Submitted 15 July, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  4. arXiv:2308.03624  [pdf, other

    cs.RO cs.CV

    MOMA-Force: Visual-Force Imitation for Real-World Mobile Manipulation

    Authors: Taozheng Yang, Ya Jing, Hongtao Wu, Jiafeng Xu, Kuankuan Sima, Guangzeng Chen, Qie Sima, Tao Kong

    Abstract: In this paper, we present a novel method for mobile manipulators to perform multiple contact-rich manipulation tasks. While learning-based methods have the potential to generate actions in an end-to-end manner, they often suffer from insufficient action accuracy and robustness against noise. On the other hand, classical control-based methods can enhance system robustness, but at the cost of extens… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023