Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Shentu, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.04798  [pdf, other

    cs.RO cs.AI

    From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control

    Authors: Yide Shentu, Philipp Wu, Aravind Rajeswaran, Pieter Abbeel

    Abstract: Hierarchical control for robotics has long been plagued by the need to have a well defined interface layer to communicate between high-level task planners and low-level policies. With the advent of LLMs, language has been emerging as a prospective interface layer. However, this has several limitations. Not all tasks can be decomposed into steps that are easily expressible in natural language (e.g.… ▽ More

    Submitted 8 July, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  2. arXiv:2309.13037  [pdf, other

    cs.RO

    GELLO: A General, Low-Cost, and Intuitive Teleoperation Framework for Robot Manipulators

    Authors: Philipp Wu, Yide Shentu, Zhongke Yi, Xingyu Lin, Pieter Abbeel

    Abstract: Humans can teleoperate robots to accomplish complex manipulation tasks. Imitation learning has emerged as a powerful framework that leverages human teleoperated demonstrations to teach robots new skills. However, the performance of the learned policies is bottlenecked by the quality, scale, and variety of the demonstration data. In this paper, we aim to lower the barrier to collecting large and hi… ▽ More

    Submitted 18 July, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

  3. arXiv:2307.12909  [pdf, other

    cs.CV

    Dyn-E: Local Appearance Editing of Dynamic Neural Radiance Fields

    Authors: Shangzhan Zhang, Sida Peng, Yinji ShenTu, Qing Shuai, Tianrun Chen, Kaicheng Yu, Hujun Bao, Xiaowei Zhou

    Abstract: Recently, the editing of neural radiance fields (NeRFs) has gained considerable attention, but most prior works focus on static scenes while research on the appearance editing of dynamic scenes is relatively lacking. In this paper, we propose a novel framework to edit the local appearance of dynamic NeRFs by manipulating pixels in a single frame of training video. Specifically, to locally edit the… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: project page: https://dyn-e.github.io/

  4. arXiv:2210.07424  [pdf, other

    cs.CV

    Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction

    Authors: YuXuan Liu, Nikhil Mishra, Maximilian Sieb, Yide Shentu, Pieter Abbeel, Xi Chen

    Abstract: 3D bounding boxes are a widespread intermediate representation in many computer vision applications. However, predicting them is a challenging task, largely due to partial observability, which motivates the need for a strong sense of uncertainty. While many recent methods have explored better architectures for consuming sparse and unstructured point cloud data, we hypothesize that there is room fo… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: In ECCV 2022. Code and dataset are available at https://bbox.yuxuanliu.com

  5. arXiv:1806.08354  [pdf, other

    cs.CV cs.AI cs.LG cs.RO stat.ML

    Learning Instance Segmentation by Interaction

    Authors: Deepak Pathak, Yide Shentu, Dian Chen, Pulkit Agrawal, Trevor Darrell, Sergey Levine, Jitendra Malik

    Abstract: We present an approach for building an active agent that learns to segment its visual observations into individual objects by interacting with its environment in a completely self-supervised manner. The agent uses its current segmentation model to infer pixels that constitute objects and refines the segmentation model by interacting with these pixels. The model learned from over 50K interactions g… ▽ More

    Submitted 21 June, 2018; originally announced June 2018.

    Comments: Website at https://pathak22.github.io/seg-by-interaction/

  6. arXiv:1804.08606  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Zero-Shot Visual Imitation

    Authors: Deepak Pathak, Parsa Mahmoudieh, Guanghao Luo, Pulkit Agrawal, Dian Chen, Yide Shentu, Evan Shelhamer, Jitendra Malik, Alexei A. Efros, Trevor Darrell

    Abstract: The current dominant paradigm for imitation learning relies on strong supervision of expert actions to learn both 'what' and 'how' to imitate. We pursue an alternative paradigm wherein an agent first explores the world without any expert supervision and then distills its experience into a goal-conditioned skill policy with a novel forward consistency loss. In our framework, the role of the expert… ▽ More

    Submitted 23 April, 2018; originally announced April 2018.

    Comments: Oral presentation at ICLR 2018. Website at https://pathak22.github.io/zeroshot-imitation/