Zum Hauptinhalt springen

Showing 1–14 of 14 results for author: Ze, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.20328  [pdf, other

    cs.RO cs.LG

    Learning Visual Quadrupedal Loco-Manipulation from Demonstrations

    Authors: Zhengmao He, Kun Lei, Yanjie Ze, Koushil Sreenath, Zhongyu Li, Huazhe Xu

    Abstract: Quadruped robots are progressively being integrated into human environments. Despite the growing locomotion capabilities of quadrupedal robots, their interaction with objects in realistic scenes is still limited. While additional robotic arms on quadrupedal robots enable manipulating objects, they are sometimes redundant given that a quadruped robot is essentially a mobile unit equipped with four… ▽ More

    Submitted 2 August, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: Published at IROS 2024. Project website: https://zhengmaohe.github.io/leg-manip

  2. arXiv:2403.03954  [pdf, other

    cs.RO cs.CV cs.LG

    3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

    Authors: Yanjie Ze, Gu Zhang, Kangning Zhang, Chenyuan Hu, Muhan Wang, Huazhe Xu

    Abstract: Imitation learning provides an efficient way to teach robots dexterous skills; however, learning complex skills robustly and generalizablely usually consumes large amounts of human demonstrations. To tackle this challenging problem, we present 3D Diffusion Policy (DP3), a novel visual imitation learning approach that incorporates the power of 3D visual representations into diffusion policies, a cl… ▽ More

    Submitted 8 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Published at Robotics: Science and Systems (RSS) 2024. Videos, code, and data: https://3d-diffusion-policy.github.io

  3. arXiv:2312.17116  [pdf, other

    cs.LG cs.CV cs.RO

    Generalizable Visual Reinforcement Learning with Segment Anything Model

    Authors: Ziyu Wang, Yanjie Ze, Yifei Sun, Zhecheng Yuan, Huazhe Xu

    Abstract: Learning policies that can generalize to unseen environments is a fundamental challenge in visual reinforcement learning (RL). While most current methods focus on acquiring robust visual representations through auxiliary supervision, pre-training, or data augmentation, the potential of modern vision foundation models remains underleveraged. In this work, we introduce Segment Anything Model for Gen… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Project page and code: https://yanjieze.com/SAM-G/

  4. arXiv:2312.14134  [pdf, other

    cs.LG cs.CV cs.RO

    Diffusion Reward: Learning Rewards via Conditional Video Diffusion

    Authors: Tao Huang, Guangqi Jiang, Yanjie Ze, Huazhe Xu

    Abstract: Learning rewards from expert videos offers an affordable and effective solution to specify the intended behaviors for reinforcement learning (RL) tasks. In this work, we propose Diffusion Reward, a novel framework that learns rewards from expert videos via conditional video diffusion models for solving complex visual RL problems. Our key insight is that lower generative diversity is exhibited when… ▽ More

    Submitted 8 August, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted to ECCV 2024. Project page and code: https://diffusion-reward.github.io/

  5. arXiv:2310.20587  [pdf, other

    cs.LG

    Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

    Authors: Ruizhe Shi, Yuyao Liu, Yanjie Ze, Simon S. Du, Huazhe Xu

    Abstract: Offline reinforcement learning (RL) aims to find a near-optimal policy using pre-collected datasets. In real-world scenarios, data collection could be costly and risky; therefore, offline RL becomes particularly challenging when the in-domain data is limited. Given recent advances in Large Language Models (LLMs) and their few-shot learning prowess, this paper introduces $\textbf{La}$nguage Models… ▽ More

    Submitted 27 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 24 pages, 16 tables

  6. arXiv:2310.19668  [pdf, other

    cs.LG cs.CV

    DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization

    Authors: Guowei Xu, Ruijie Zheng, Yongyuan Liang, Xiyao Wang, Zhecheng Yuan, Tianying Ji, Yu Luo, Xiaoyu Liu, Jiaxin Yuan, Pu Hua, Shuzhen Li, Yanjie Ze, Hal Daumé III, Furong Huang, Huazhe Xu

    Abstract: Visual reinforcement learning (RL) has shown promise in continuous control tasks. Despite its progress, current algorithms are still unsatisfactory in virtually every aspect of the performance such as sample efficiency, asymptotic performance, and their robustness to the choice of random seeds. In this paper, we identify a major shortcoming in existing visual RL methods that is the agents often ex… ▽ More

    Submitted 13 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted at The Twelfth International Conference on Learning Representations (ICLR 2024)

  7. arXiv:2310.01404  [pdf, other

    cs.LG cs.CV cs.RO

    H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation

    Authors: Yanjie Ze, Yuyao Liu, Ruizhe Shi, Jiaxin Qin, Zhecheng Yuan, Jiashun Wang, Huazhe Xu

    Abstract: Human hands possess remarkable dexterity and have long served as a source of inspiration for robotic manipulation. In this work, we propose a human $\textbf{H}$and$\textbf{-In}$formed visual representation learning framework to solve difficult $\textbf{Dex}$terous manipulation tasks ($\textbf{H-InDex}$) with reinforcement learning. Our framework consists of three stages: (i) pre-training represent… ▽ More

    Submitted 12 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023. Code and videos: https://yanjieze.com/H-InDex

  8. arXiv:2308.16891  [pdf, other

    cs.RO cs.CV cs.LG

    GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields

    Authors: Yanjie Ze, Ge Yan, Yueh-Hua Wu, Annabella Macaluso, Yuying Ge, Jianglong Ye, Nicklas Hansen, Li Erran Li, Xiaolong Wang

    Abstract: It is a long-standing problem in robotics to develop agents capable of executing diverse manipulation tasks from visual observations in unstructured real-world environments. To achieve this goal, the robot needs to have a comprehensive understanding of the 3D structure and semantics of the scene. In this work, we present $\textbf{GNFactor}$, a visual behavior cloning agent for multi-task robotic m… ▽ More

    Submitted 27 July, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: CoRL 2023 Oral. Website: https://yanjieze.com/GNFactor/

  9. arXiv:2308.09902  [pdf, other

    cs.LG

    DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning

    Authors: Canzhe Zhao, Yanjie Ze, Jing Dong, Baoxiang Wang, Shuai Li

    Abstract: Communication lays the foundation for cooperation in human society and in multi-agent reinforcement learning (MARL). Humans also desire to maintain their privacy when communicating with others, yet such privacy concern has not been considered in existing works in MARL. To this end, we propose the \textit{differentially private multi-agent communication} (DPMAC) algorithm, which protects the sensit… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: Full version; Accepted in IJCAI 2023

  10. arXiv:2307.00972  [pdf, other

    cs.LG cs.CV cs.RO

    MoVie: Visual Model-Based Policy Adaptation for View Generalization

    Authors: Sizhe Yang, Yanjie Ze, Huazhe Xu

    Abstract: Visual Reinforcement Learning (RL) agents trained on limited views face significant challenges in generalizing their learned abilities to unseen views. This inherent difficulty is known as the problem of $\textit{view generalization}$. In this work, we systematically categorize this fundamental problem into four distinct and highly challenging scenarios that closely resemble real-world situations.… ▽ More

    Submitted 27 September, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: Accepted in NeurIPS 2023. The first two authors contribute equally

  11. arXiv:2212.05749  [pdf, other

    cs.LG cs.CV cs.RO

    On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline

    Authors: Nicklas Hansen, Zhecheng Yuan, Yanjie Ze, Tongzhou Mu, Aravind Rajeswaran, Hao Su, Huazhe Xu, Xiaolong Wang

    Abstract: In this paper, we examine the effectiveness of pre-training for visuo-motor control tasks. We revisit a simple Learning-from-Scratch (LfS) baseline that incorporates data augmentation and a shallow ConvNet, and find that this baseline is surprisingly competitive with recent approaches (PVR, MVP, R3M) that leverage frozen visual representations trained on large-scale vision datasets -- across a var… ▽ More

    Submitted 15 June, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: Code: https://github.com/gemcollector/learning-from-scratch

  12. arXiv:2210.07241  [pdf, other

    cs.LG cs.RO

    Visual Reinforcement Learning with Self-Supervised 3D Representations

    Authors: Yanjie Ze, Nicklas Hansen, Yinbo Chen, Mohit Jain, Xiaolong Wang

    Abstract: A prominent approach to visual Reinforcement Learning (RL) is to learn an internal state representation using self-supervised methods, which has the potential benefit of improved sample-efficiency and generalization through additional learning signal and inductive biases. However, while the real world is inherently 3D, prior efforts have largely been focused on leveraging 2D computer vision techni… ▽ More

    Submitted 15 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted in RA-L 2023 and IROS 2023. Project page: https://yanjieze.com/3d4rl/

  13. arXiv:2201.10447  [pdf, other

    cs.LG

    Differentially Private Temporal Difference Learning with Stochastic Nonconvex-Strongly-Concave Optimization

    Authors: Canzhe Zhao, Yanjie Ze, Jing Dong, Baoxiang Wang, Shuai Li

    Abstract: Temporal difference (TD) learning is a widely used method to evaluate policies in reinforcement learning. While many TD learning methods have been developed in recent years, little attention has been paid to preserving privacy and most of the existing approaches might face the concerns of data privacy from users. To enable complex representative abilities of policies, in this paper, we consider pr… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

  14. arXiv:2011.11974  [pdf, other

    cs.CV

    UKPGAN: A General Self-Supervised Keypoint Detector

    Authors: Yang You, Wenhai Liu, Yanjie Ze, Yong-Lu Li, Weiming Wang, Cewu Lu

    Abstract: Keypoint detection is an essential component for the object registration and alignment. In this work, we reckon keypoint detection as information compression, and force the model to distill out irrelevant points of an object. Based on this, we propose UKPGAN, a general self-supervised 3D keypoint detector where keypoints are detected so that they could reconstruct the original object shape. Two mo… ▽ More

    Submitted 9 March, 2022; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: Accepted to CVPR2022