Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Wong, L L S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.19589  [pdf, other

    cs.LG

    Modeling Dynamics over Meshes with Gauge Equivariant Nonlinear Message Passing

    Authors: Jung Yeon Park, Lawson L. S. Wong, Robin Walters

    Abstract: Data over non-Euclidean manifolds, often discretized as surface meshes, naturally arise in computer graphics and biological and physical systems. In particular, solutions to partial differential equations (PDEs) over manifolds depend critically on the underlying geometry. While graph neural networks have been successfully applied to PDEs, they do not incorporate surface geometry and do not conside… ▽ More

    Submitted 2 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  2. arXiv:2310.10822  [pdf, other

    cs.RO cs.CV eess.SY

    Vision and Language Navigation in the Real World via Online Visual Language Mapping

    Authors: Chengguang Xu, Hieu T. Nguyen, Christopher Amato, Lawson L. S. Wong

    Abstract: Navigating in unseen environments is crucial for mobile robots. Enhancing them with the ability to follow instructions in natural language will further improve navigation efficiency in unseen cases. However, state-of-the-art (SOTA) vision-and-language navigation (VLN) methods are mainly evaluated in simulation, neglecting the complex and noisy real world. Directly transferring SOTA navigation poli… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  3. E(2)-Equivariant Graph Planning for Navigation

    Authors: Linfeng Zhao, Hongyu Li, Taskin Padir, Huaizu Jiang, Lawson L. S. Wong

    Abstract: Learning for robot navigation presents a critical and challenging task. The scarcity and costliness of real-world datasets necessitate efficient learning approaches. In this letter, we exploit Euclidean symmetry in planning for 2D navigation, which originates from Euclidean transformations between reference frames and enables parameter sharing. To address the challenges of unstructured environment… ▽ More

    Submitted 27 January, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: Accepted by RA-L

  4. arXiv:2307.08226  [pdf, other

    cs.LG cs.RO

    Can Euclidean Symmetry be Leveraged in Reinforcement Learning and Planning?

    Authors: Linfeng Zhao, Owen Howell, Jung Yeon Park, Xupeng Zhu, Robin Walters, Lawson L. S. Wong

    Abstract: In robotic tasks, changes in reference frames typically do not influence the underlying physical properties of the system, which has been known as invariance of physical laws.These changes, which preserve distance, encompass isometric transformations such as translations, rotations, and reflections, collectively known as the Euclidean group. In this work, we delve into the design of improved learn… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: Preprint. Website: http://lfzhao.com/SymCtrl

  5. arXiv:2306.12392  [pdf, other

    cs.RO cs.LG

    One-shot Imitation Learning via Interaction Warping

    Authors: Ondrej Biza, Skye Thompson, Kishore Reddy Pagidi, Abhinav Kumar, Elise van der Pol, Robin Walters, Thomas Kipf, Jan-Willem van de Meent, Lawson L. S. Wong, Robert Platt

    Abstract: Imitation learning of robot policies from few demonstrations is crucial in open-ended applications. We propose a new method, Interaction Warping, for learning SE(3) robotic manipulation policies from a single demonstration. We infer the 3D mesh of each object in the environment using shape warping, a technique for aligning point clouds across object instances. Then, we represent manipulation actio… ▽ More

    Submitted 4 November, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: CoRL 2023

  6. arXiv:2211.09231  [pdf, other

    cs.LG cs.RO

    The Surprising Effectiveness of Equivariant Models in Domains with Latent Symmetry

    Authors: Dian Wang, Jung Yeon Park, Neel Sortur, Lawson L. S. Wong, Robin Walters, Robert Platt

    Abstract: Extensive work has demonstrated that equivariant neural networks can significantly improve sample efficiency and generalization by enforcing an inductive bias in the network architecture. These applications typically assume that the domain symmetry is fully described by explicit transformations of the model inputs and outputs. However, many real-life applications contain only latent or partial sym… ▽ More

    Submitted 10 February, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Published at ICLR 2023, notable top 25% (Spotlight)

  7. arXiv:2210.13542  [pdf, other

    cs.LG cs.AI cs.RO

    Scaling up and Stabilizing Differentiable Planning with Implicit Differentiation

    Authors: Linfeng Zhao, Huazhe Xu, Lawson L. S. Wong

    Abstract: Differentiable planning promises end-to-end differentiability and adaptivity. However, an issue prevents it from scaling up to larger-scale problems: they need to differentiate through forward iteration layers to compute gradients, which couples forward computation and backpropagation, and needs to balance forward planner performance and computational cost of the backward pass. To alleviate this i… ▽ More

    Submitted 1 May, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: ICLR 2023 camera-ready version. Website: http://lfzhao.com/IDPlan

  8. arXiv:2210.09337  [pdf, other

    cs.LG cs.AI

    Robust Imitation of a Few Demonstrations with a Backwards Model

    Authors: Jung Yeon Park, Lawson L. S. Wong

    Abstract: Behavior cloning of expert demonstrations can speed up learning optimal policies in a more sample-efficient way over reinforcement learning. However, the policy cannot extrapolate well to unseen states outside of the demonstration data, creating covariate shift (agent drifting away from demonstrations) and compounding errors. In this work, we tackle this issue by extending the region of attraction… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Conference on Neural Information Processing Systems (NeurIPS) 2022

  9. arXiv:2206.03674  [pdf, other

    cs.LG cs.AI cs.RO

    Integrating Symmetry into Differentiable Planning with Steerable Convolutions

    Authors: Linfeng Zhao, Xupeng Zhu, Lingzhi Kong, Robin Walters, Lawson L. S. Wong

    Abstract: We study how group symmetry helps improve data efficiency and generalization for end-to-end differentiable planning algorithms when symmetry appears in decision-making tasks. Motivated by equivariant convolution networks, we treat the path planning problem as \textit{signals} over grids. We show that value iteration in this case is a linear equivariant operator, which is a (steerable) convolution.… ▽ More

    Submitted 1 May, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: ICLR 2023 camera-ready version. Original name = "Integrating Symmetry into Differentiable Planning". Website: http://lfzhao.com/SymPlan

  10. arXiv:2204.13661  [pdf, other

    cs.LG cs.AI cs.RO

    Toward Compositional Generalization in Object-Oriented World Modeling

    Authors: Linfeng Zhao, Lingzhi Kong, Robin Walters, Lawson L. S. Wong

    Abstract: Compositional generalization is a critical ability in learning and decision-making. We focus on the setting of reinforcement learning in object-oriented environments to study compositional generalization in world modeling. We (1) formalize the compositional generalization problem with an algebraic approach and (2) study how a world model can achieve that. We introduce a conceptual environment, Obj… ▽ More

    Submitted 17 June, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: ICML 2022 Long Presentation. Website: http://lfzhao.com/oowm/

  11. arXiv:2204.13022  [pdf, other

    cs.LG

    Binding Actions to Objects in World Models

    Authors: Ondrej Biza, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong, Thomas Kipf

    Abstract: We study the problem of binding actions to objects in object-factored world models using action-attention mechanisms. We propose two attention mechanisms for binding actions to objects, soft attention and hard attention, which we evaluate in the context of structured world models for five environments. Our experiments show that hard attention helps contrastively-trained structured world models to… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: Published at the ICLR 2022 workshop on Objects, Structure and Causality

  12. arXiv:2202.05333  [pdf, other

    cs.RO cs.LG

    Factored World Models for Zero-Shot Generalization in Robotic Manipulation

    Authors: Ondrej Biza, Thomas Kipf, David Klee, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong

    Abstract: World models for environments with many objects face a combinatorial explosion of states: as the number of objects increases, the number of possible arrangements grows exponentially. In this paper, we learn to generalize over robotic pick-and-place tasks using object-factored world models, which combat the combinatorial explosion by ensuring that predictions are equivariant to permutations of obje… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

  13. arXiv:2110.04441  [pdf, other

    cs.AI cs.CL cs.RO

    Natural Language for Human-Robot Collaboration: Problems Beyond Language Grounding

    Authors: Seth Pate, Wei Xu, Ziyi Yang, Maxwell Love, Siddarth Ganguri, Lawson L. S. Wong

    Abstract: To enable robots to instruct humans in collaborations, we identify several aspects of language processing that are not commonly studied in this context. These include location, planning, and generation. We suggest evaluations for each task, offer baselines for simple methods, and close by discussing challenges and opportunities in studying language for collaboration.

    Submitted 8 October, 2021; originally announced October 2021.

    Comments: 5 pages, 2 figures, Presented at AI-HRI symposium as part of AAAI-FSS 2021 (arXiv:2109.10836)

    Report number: AIHRI/2021/38

  14. arXiv:2110.03424  [pdf, other

    cs.LG cs.AI

    Bad-Policy Density: A Measure of Reinforcement Learning Hardness

    Authors: David Abel, Cameron Allen, Dilip Arumugam, D. Ellis Hershkowitz, Michael L. Littman, Lawson L. S. Wong

    Abstract: Reinforcement learning is hard in general. Yet, in many specific environments, learning is easy. What makes learning easy in one environment, but difficult in another? We address this question by proposing a simple measure of reinforcement-learning hardness called the bad-policy density. This quantity measures the fraction of the deterministic stationary policy space that is below a desired thresh… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: Presented at the 2021 ICML Workshop on Reinforcement Learning Theory

  15. arXiv:2106.03665  [pdf, other

    cs.RO cs.LG

    Hierarchical Robot Navigation in Novel Environments using Rough 2-D Maps

    Authors: Chengguang Xu, Christopher Amato, Lawson L. S. Wong

    Abstract: In robot navigation, generalizing quickly to unseen environments is essential. Hierarchical methods inspired by human navigation have been proposed, typically consisting of a high-level landmark proposer and a low-level controller. However, these methods either require precise high-level information to be given in advance or need to construct such guidance from extensive interaction with the envir… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: 21 pages, Conference on Robot Learning 2020, Boston, MA

  16. arXiv:2101.04178  [pdf, other

    cs.RO cs.LG

    Action Priors for Large Action Spaces in Robotics

    Authors: Ondrej Biza, Dian Wang, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong

    Abstract: In robotics, it is often not possible to learn useful policies using pure model-free reinforcement learning without significant reward shaping or curriculum learning. As a consequence, many researchers rely on expert demonstrations to guide learning. However, acquiring expert demonstrations can be expensive. This paper proposes an alternative approach where the solutions of previously solved tasks… ▽ More

    Submitted 15 February, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: 13 pages, 9 figures

    Journal ref: Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS '21). 2021. 205 - 213

  17. arXiv:2010.05134  [pdf, other

    cs.RO cs.AI cs.LG

    Deep Imitation Learning for Bimanual Robotic Manipulation

    Authors: Fan Xie, Alexander Chowdhury, M. Clara De Paolis Kaluza, Linfeng Zhao, Lawson L. S. Wong, Rose Yu

    Abstract: We present a deep imitation learning framework for robotic bimanual manipulation in a continuous state-action space. A core challenge is to generalize the manipulation skills to objects in different locations. We hypothesize that modeling the relational information in the environment can significantly improve generalization. To achieve this, we propose to (i) decompose the multi-modal dynamics int… ▽ More

    Submitted 30 November, 2020; v1 submitted 10 October, 2020; originally announced October 2020.

  18. arXiv:2003.04300  [pdf, other

    cs.LG stat.ML

    Learning Discrete State Abstractions With Deep Variational Inference

    Authors: Ondrej Biza, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong

    Abstract: Abstraction is crucial for effective sequential decision making in domains with large state spaces. In this work, we propose an information bottleneck method for learning approximate bisimulations, a type of state abstraction. We use a deep neural encoder to map states onto continuous embeddings. We map these embeddings onto a discrete representation using an action-conditioned hidden Markov model… ▽ More

    Submitted 11 January, 2021; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: 15 pages, 7 figures

  19. arXiv:1707.08668  [pdf, other

    cs.AI cs.CL

    A Tale of Two DRAGGNs: A Hybrid Approach for Interpreting Action-Oriented and Goal-Oriented Instructions

    Authors: Siddharth Karamcheti, Edward C. Williams, Dilip Arumugam, Mina Rhee, Nakul Gopalan, Lawson L. S. Wong, Stefanie Tellex

    Abstract: Robots operating alongside humans in diverse, stochastic environments must be able to accurately interpret natural language commands. These instructions often fall into one of two categories: those that specify a goal condition or target state, and those that specify explicit actions, or how to perform a given task. Recent approaches have used reward functions as a semantic representation of goal-… ▽ More

    Submitted 26 July, 2017; originally announced July 2017.

    Comments: Accepted at the 1st Workshop on Language Grounding for Robotics at ACL 2017

  20. arXiv:1706.00536  [pdf, other

    cs.AI

    Modeling Latent Attention Within Neural Networks

    Authors: Christopher Grimm, Dilip Arumugam, Siddharth Karamcheti, David Abel, Lawson L. S. Wong, Michael L. Littman

    Abstract: Deep neural networks are able to solve tasks across a variety of domains and modalities of data. Despite many empirical successes, we lack the ability to clearly understand and interpret the learned internal mechanisms that contribute to such effective behaviors or, more critically, failure modes. In this work, we present a general method for visualizing an arbitrary neural network's inner mechani… ▽ More

    Submitted 30 December, 2017; v1 submitted 1 June, 2017; originally announced June 2017.

  21. Accurately and Efficiently Interpreting Human-Robot Instructions of Varying Granularities

    Authors: Dilip Arumugam, Siddharth Karamcheti, Nakul Gopalan, Lawson L. S. Wong, Stefanie Tellex

    Abstract: Humans can ground natural language commands to tasks at both abstract and fine-grained levels of specificity. For instance, a human forklift operator can be instructed to perform a high-level action, like "grab a pallet" or a low-level action like "tilt back a little bit." While robots are also capable of grounding language commands to tasks, previous methods implicitly assume that all commands an… ▽ More

    Submitted 19 June, 2018; v1 submitted 21 April, 2017; originally announced April 2017.

    Comments: Updated with final version - Published as Conference Paper in Robotics: Science and Systems 2017

  22. arXiv:1512.00573  [pdf, other

    cs.AI cs.LG cs.RO

    Object-based World Modeling in Semi-Static Environments with Dependent Dirichlet-Process Mixtures

    Authors: Lawson L. S. Wong, Thanard Kurutach, Leslie Pack Kaelbling, Tomás Lozano-Pérez

    Abstract: To accomplish tasks in human-centric indoor environments, robots need to represent and understand the world in terms of objects and their attributes. We refer to this attribute-based representation as a world model, and consider how to acquire it via noisy perception and maintain it over time, as objects are added, changed, and removed in the world. Previous work has framed this as multiple-target… ▽ More

    Submitted 1 December, 2015; originally announced December 2015.