Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Engelcke, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10179  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Scaling Instructable Agents Across Many Simulated Worlds

    Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (68 additional authors not shown)

    Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

  2. Reaching Through Latent Space: From Joint Statistics to Path Planning in Manipulation

    Authors: Chia-Man Hung, Shaohong Zhong, Walter Goodwin, Oiwi Parker Jones, Martin Engelcke, Ioannis Havoutis, Ingmar Posner

    Abstract: We present a novel approach to path planning for robotic manipulators, in which paths are produced via iterative optimisation in the latent space of a generative model of robot poses. Constraints are incorporated through the use of constraint satisfaction classifiers operating on the same space. Optimisation leverages gradients through our learned models that provide a simple way to combine goal r… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 10 pages, 6 figures, 4 tables

    ACM Class: I.2.6; I.2.9; I.2.10

    Journal ref: IEEE Robotics and Automation Letters 7.2 (2022): 5334-5341

  3. arXiv:2205.01179  [pdf, other

    cs.RO cs.LG

    VAE-Loco: Versatile Quadruped Locomotion by Learning a Disentangled Gait Representation

    Authors: Alexander L. Mitchell, Wolfgang Merkt, Mathieu Geisert, Siddhant Gangapurwala, Martin Engelcke, Oiwi Parker Jones, Ioannis Havoutis, Ingmar Posner

    Abstract: Quadruped locomotion is rapidly maturing to a degree where robots are able to realise highly dynamic manoeuvres. However, current planners are unable to vary key gait parameters of the in-swing feet midair. In this work we address this limitation and show that it is pivotal in increasing controller robustness by learning a latent space capturing the key stance phases constituting a particular gait… ▽ More

    Submitted 12 July, 2023; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: 16 pages, 13 figures, 1 table, accepted by IEEE Transactions on Robotics (T-RO) as an extended paper. arXiv admin note: substantial text overlap with arXiv:2112.04809

  4. arXiv:2112.04809  [pdf, other

    cs.RO cs.LG

    Next Steps: Learning a Disentangled Gait Representation for Versatile Quadruped Locomotion

    Authors: Alexander L. Mitchell, Wolfgang Merkt, Mathieu Geisert, Siddhant Gangapurwala, Martin Engelcke, Oiwi Parker Jones, Ioannis Havoutis, Ingmar Posner

    Abstract: Quadruped locomotion is rapidly maturing to a degree where robots now routinely traverse a variety of unstructured terrains. However, while gaits can be varied typically by selecting from a range of pre-computed styles, current planners are unable to vary key gait parameters continuously while the robot is in motion. The synthesis, on-the-fly, of gaits with unexpected operational characteristics o… ▽ More

    Submitted 29 March, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: 7 pages, 4 figures, accepted at IEEE International Conference on Robotics and Automation (ICRA), 2022

  5. arXiv:2107.01959  [pdf, other

    cs.LG stat.ML

    Universal Approximation of Functions on Sets

    Authors: Edward Wagstaff, Fabian B. Fuchs, Martin Engelcke, Michael A. Osborne, Ingmar Posner

    Abstract: Modelling functions of sets, or equivalently, permutation-invariant functions, is a long-standing challenge in machine learning. Deep Sets is a popular method which is known to be a universal approximator for continuous set functions. We provide a theoretical analysis of Deep Sets which shows that this universal approximation property is only guaranteed if the model's latent space is sufficiently… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 54 pages, 13 figures

  6. arXiv:2105.14895  [pdf, other

    cs.RO

    APEX: Unsupervised, Object-Centric Scene Segmentation and Tracking for Robot Manipulation

    Authors: Yizhe Wu, Oiwi Parker Jones, Martin Engelcke, Ingmar Posner

    Abstract: Recent advances in unsupervised learning for object detection, segmentation, and tracking hold significant promise for applications in robotics. A common approach is to frame these tasks as inference in probabilistic latent-variable models. In this paper, however, we show that the current state-of-the-art struggles with visually complex scenes such as typically encountered in robot manipulation ta… ▽ More

    Submitted 12 September, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: 8 pages, 5 figures

    MSC Class: I.2.9

  7. arXiv:2104.09958  [pdf, other

    cs.CV cs.LG stat.ML

    GENESIS-V2: Inferring Unordered Object Representations without Iterative Refinement

    Authors: Martin Engelcke, Oiwi Parker Jones, Ingmar Posner

    Abstract: Advances in unsupervised learning of object-representations have culminated in the development of a broad range of methods for unsupervised object segmentation and interpretable object-centric scene generation. These methods, however, are limited to simulated and real-world datasets with limited visual complexity. Moreover, object representations are often inferred using RNNs which do not scale we… ▽ More

    Submitted 25 January, 2022; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: NeurIPS 2021 camera-ready version; 26 pages, 19 figures

  8. arXiv:2007.06245  [pdf, other

    cs.LG stat.ML

    Reconstruction Bottlenecks in Object-Centric Generative Models

    Authors: Martin Engelcke, Oiwi Parker Jones, Ingmar Posner

    Abstract: A range of methods with suitable inductive biases exist to learn interpretable object-centric representations of images without supervision. However, these are largely restricted to visually simple images; robust object discovery in real-world sensory datasets remains elusive. To increase the understanding of such inductive biases, we empirically investigate the role of "reconstruction bottlenecks… ▽ More

    Submitted 24 November, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: 10 pages, 7 Figures, Workshop on Object-Oriented Learning at ICML 2020

  9. arXiv:2007.01520  [pdf, other

    cs.RO cs.LG

    First Steps: Latent-Space Control with Semantic Constraints for Quadruped Locomotion

    Authors: Alexander L. Mitchell, Martin Engelcke, Oiwi Parker Jones, David Surovik, Siddhant Gangapurwala, Oliwier Melon, Ioannis Havoutis, Ingmar Posner

    Abstract: Traditional approaches to quadruped control frequently employ simplified, hand-derived models. This significantly reduces the capability of the robot since its effective kinematic range is curtailed. In addition, kinodynamic constraints are often non-differentiable and difficult to implement in an optimisation approach. In this work, these challenges are addressed by framing quadruped control as o… ▽ More

    Submitted 20 November, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: 8 pages, 7 figures, accepted at IROS 2020

  10. arXiv:2007.01272  [pdf, other

    cs.CV

    RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces

    Authors: Sebastien Ehrhardt, Oliver Groth, Aron Monszpart, Martin Engelcke, Ingmar Posner, Niloy Mitra, Andrea Vedaldi

    Abstract: We present RELATE, a model that learns to generate physically plausible scenes and videos of multiple interacting objects. Similar to other generative approaches, RELATE is trained end-to-end on raw, unlabeled data. RELATE combines an object-centric GAN formulation with a model that explicitly accounts for correlations between individual objects. This allows the model to generate realistic scenes… ▽ More

    Submitted 9 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

  11. arXiv:1907.13052  [pdf, other

    cs.LG cs.CV cs.NE cs.RO stat.ML

    GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations

    Authors: Martin Engelcke, Adam R. Kosiorek, Oiwi Parker Jones, Ingmar Posner

    Abstract: Generative latent-variable models are emerging as promising tools in robotics and reinforcement learning. Yet, even though tasks in these domains typically involve distinct objects, most state-of-the-art generative models do not explicitly capture the compositional nature of visual scenes. Two recent exceptions, MONet and IODINE, decompose scenes into objects in an unsupervised fashion. Their unde… ▽ More

    Submitted 23 November, 2020; v1 submitted 30 July, 2019; originally announced July 2019.

    Comments: Published at the International Conference on Learning Representations (ICLR) 2020

  12. arXiv:1901.09006  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    On the Limitations of Representing Functions on Sets

    Authors: Edward Wagstaff, Fabian B. Fuchs, Martin Engelcke, Ingmar Posner, Michael Osborne

    Abstract: Recent work on the representation of functions on sets has considered the use of summation in a latent space to enforce permutation invariance. In particular, it has been conjectured that the dimension of this latent space may remain fixed as the cardinality of the sets under consideration increases. However, we demonstrate that the analysis leading to this conjecture requires mappings which are h… ▽ More

    Submitted 7 October, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: Published at the International Conference on Machine Learning (2019)

  13. arXiv:1711.10275  [pdf, other

    cs.CV

    3D Semantic Segmentation with Submanifold Sparse Convolutional Networks

    Authors: Benjamin Graham, Martin Engelcke, Laurens van der Maaten

    Abstract: Convolutional networks are the de-facto standard for analyzing spatio-temporal data such as images, videos, and 3D shapes. Whilst some of this data is naturally dense (e.g., photos), many other data sources are inherently sparse. Examples include 3D point clouds that were obtained using a LiDAR scanner or RGB-D camera. Standard "dense" implementations of convolutional networks are very inefficient… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

    Comments: arXiv admin note: text overlap with arXiv:1706.01307

  14. arXiv:1710.06104  [pdf, other

    cs.CV

    Large-Scale 3D Shape Reconstruction and Segmentation from ShapeNet Core55

    Authors: Li Yi, Lin Shao, Manolis Savva, Haibin Huang, Yang Zhou, Qirui Wang, Benjamin Graham, Martin Engelcke, Roman Klokov, Victor Lempitsky, Yuan Gan, Pengyu Wang, Kun Liu, Fenggen Yu, Panpan Shui, Bingyang Hu, Yan Zhang, Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Minki Jeong, Jaehoon Choi, Changick Kim, Angom Geetchandra , et al. (25 additional authors not shown)

    Abstract: We introduce a large-scale 3D shape understanding benchmark using data and annotation from ShapeNet 3D object database. The benchmark consists of two tasks: part-level segmentation of 3D shapes and 3D reconstruction from single view images. Ten teams have participated in the challenge and the best performing teams have outperformed state-of-the-art approaches on both tasks. A few novel deep learni… ▽ More

    Submitted 27 October, 2017; v1 submitted 17 October, 2017; originally announced October 2017.

  15. arXiv:1609.06666  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.NE

    Vote3Deep: Fast Object Detection in 3D Point Clouds Using Efficient Convolutional Neural Networks

    Authors: Martin Engelcke, Dushyant Rao, Dominic Zeng Wang, Chi Hay Tong, Ingmar Posner

    Abstract: This paper proposes a computationally efficient approach to detecting objects natively in 3D point clouds using convolutional neural networks (CNNs). In particular, this is achieved by leveraging a feature-centric voting scheme to implement novel convolutional layers which explicitly exploit the sparsity encountered in the input. To this end, we examine the trade-off between accuracy and speed for… ▽ More

    Submitted 5 March, 2017; v1 submitted 21 September, 2016; originally announced September 2016.

    Comments: To be published at the IEEE International Conference on Robotics and Automation 2017