Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Kosiorek, A R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2301.05747  [pdf, other

    cs.CV cs.AI

    Laser: Latent Set Representations for 3D Generative Modeling

    Authors: Pol Moreno, Adam R. Kosiorek, Heiko Strathmann, Daniel Zoran, Rosalia G. Schneider, Björn Winckler, Larisa Markeeva, Théophane Weber, Danilo J. Rezende

    Abstract: NeRF provides unparalleled fidelity of novel view synthesis: rendering a 3D scene from an arbitrary viewpoint. NeRF requires training on a large number of views that fully cover a scene, which limits its applicability. While these issues can be addressed by learning a prior over scenes in various forms, previous approaches have been either applied to overly simple scenes or struggling to render un… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: See https://laser-nv-paper.github.io/ for video results

  2. arXiv:2201.13100  [pdf, other

    cs.CV cs.LG

    Adversarial Masking for Self-Supervised Learning

    Authors: Yuge Shi, N. Siddharth, Philip H. S. Torr, Adam R. Kosiorek

    Abstract: We propose ADIOS, a masked image model (MIM) framework for self-supervised learning, which simultaneously learns a masking function and an image encoder using an adversarial objective. The image encoder is trained to minimise the distance between representations of the original and that of a masked image. The masking function, conversely, aims at maximising this distance. ADIOS consistently improv… ▽ More

    Submitted 6 July, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

  3. arXiv:2104.01148  [pdf, other

    cs.CV cs.LG stat.ML

    Decomposing 3D Scenes into Objects via Unsupervised Volume Segmentation

    Authors: Karl Stelzner, Kristian Kersting, Adam R. Kosiorek

    Abstract: We present ObSuRF, a method which turns a single image of a scene into a 3D model represented as a set of Neural Radiance Fields (NeRFs), with each NeRF corresponding to a different object. A single forward pass of an encoder network outputs a set of latent vectors describing the objects in the scene. These vectors are used independently to condition a NeRF decoder, defining the geometry and appea… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: 15 pages, 3 figures. For project page with videos, see http://stelzner.github.io/obsurf/

  4. arXiv:2104.00587  [pdf, other

    stat.ML cs.LG

    NeRF-VAE: A Geometry Aware 3D Scene Generative Model

    Authors: Adam R. Kosiorek, Heiko Strathmann, Daniel Zoran, Pol Moreno, Rosalia Schneider, Soňa Mokrá, Danilo J. Rezende

    Abstract: We propose NeRF-VAE, a 3D scene generative model that incorporates geometric structure via NeRF and differentiable volume rendering. In contrast to NeRF, our model takes into account shared structure across scenes, and is able to infer the structure of a novel scene -- without the need to re-train -- using amortized inference. NeRF-VAE's explicit 3D rendering process further contrasts previous gen… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: 17 pages, 15 figures, under review

  5. arXiv:2006.16841  [pdf, other

    cs.CV cs.LG

    Conditional Set Generation with Transformers

    Authors: Adam R Kosiorek, Hyunjik Kim, Danilo J Rezende

    Abstract: A set is an unordered collection of unique elements--and yet many machine learning models that generate sets impose an implicit or explicit ordering. Since model performance can depend on the choice of order, any particular ordering can lead to sub-optimal results. An alternative solution is to use a permutation-equivariant set generator, which does not specify an order-ing. An example of such a g… ▽ More

    Submitted 1 July, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: 6 pages, 6 figures, ICML 2020 Workshop on Object-Oriented Learning

  6. arXiv:1912.02738  [pdf, other

    stat.ML cs.LG

    MetaFun: Meta-Learning with Iterative Functional Updates

    Authors: Jin Xu, Jean-Francois Ton, Hyunjik Kim, Adam R. Kosiorek, Yee Whye Teh

    Abstract: We develop a functional encoder-decoder approach to supervised meta-learning, where labeled data is encoded into an infinite-dimensional functional representation rather than a finite-dimensional one. Furthermore, rather than directly producing the representation, we learn a neural update rule resembling functional gradient descent which iteratively improves the representation. The final represent… ▽ More

    Submitted 16 August, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

  7. arXiv:1907.13052  [pdf, other

    cs.LG cs.CV cs.NE cs.RO stat.ML

    GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations

    Authors: Martin Engelcke, Adam R. Kosiorek, Oiwi Parker Jones, Ingmar Posner

    Abstract: Generative latent-variable models are emerging as promising tools in robotics and reinforcement learning. Yet, even though tasks in these domains typically involve distinct objects, most state-of-the-art generative models do not explicitly capture the compositional nature of visual scenes. Two recent exceptions, MONet and IODINE, decompose scenes into objects in an unsupervised fashion. Their unde… ▽ More

    Submitted 23 November, 2020; v1 submitted 30 July, 2019; originally announced July 2019.

    Comments: Published at the International Conference on Learning Representations (ICLR) 2020

  8. arXiv:1907.12887  [pdf, other

    cs.CV cs.AI cs.LG cs.RO stat.ML

    End-to-end Recurrent Multi-Object Tracking and Trajectory Prediction with Relational Reasoning

    Authors: Fabian B. Fuchs, Adam R. Kosiorek, Li Sun, Oiwi Parker Jones, Ingmar Posner

    Abstract: The majority of contemporary object-tracking approaches do not model interactions between objects. This contrasts with the fact that objects' paths are not independent: a cyclist might abruptly deviate from a previously planned trajectory in order to avoid colliding with a car. Building upon HART, a neural class-agnostic single-object tracker, we introduce a multi-object tracking method MOHART cap… ▽ More

    Submitted 28 September, 2020; v1 submitted 12 July, 2019; originally announced July 2019.

  9. arXiv:1906.06818  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    Stacked Capsule Autoencoders

    Authors: Adam R. Kosiorek, Sara Sabour, Yee Whye Teh, Geoffrey E. Hinton

    Abstract: Objects are composed of a set of geometrically organized parts. We introduce an unsupervised capsule autoencoder (SCAE), which explicitly uses geometric relationships between parts to reason about objects. Since these relationships do not depend on the viewpoint, our model is robust to viewpoint changes. SCAE consists of two stages. In the first stage, the model predicts presences and poses of par… ▽ More

    Submitted 2 December, 2019; v1 submitted 16 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019; 14 pages, 7 figures, 4 tables, code is available at https://github.com/google-research/google-research/tree/master/stacked_capsule_autoencoders

  10. arXiv:1810.00825  [pdf, other

    cs.LG stat.ML

    Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks

    Authors: Juho Lee, Yoonho Lee, Jungtaek Kim, Adam R. Kosiorek, Seungjin Choi, Yee Whye Teh

    Abstract: Many machine learning tasks such as multiple instance learning, 3D shape recognition, and few-shot image classification are defined on sets of instances. Since solutions to such problems do not depend on the order of elements of the set, models used to address them should be permutation invariant. We present an attention-based neural network module, the Set Transformer, specifically designed to mo… ▽ More

    Submitted 26 May, 2019; v1 submitted 1 October, 2018; originally announced October 2018.

    Comments: ICML 2019

  11. arXiv:1806.05502  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Scrutinizing and De-Biasing Intuitive Physics with Neural Stethoscopes

    Authors: Fabian B. Fuchs, Oliver Groth, Adam R. Kosiorek, Alex Bewley, Markus Wulfmeier, Andrea Vedaldi, Ingmar Posner

    Abstract: Visually predicting the stability of block towers is a popular task in the domain of intuitive physics. While previous work focusses on prediction accuracy, a one-dimensional performance measure, we provide a broader analysis of the learned physical understanding of the final model and how the learning process can be guided. To this end, we introduce neural stethoscopes as a general purpose framew… ▽ More

    Submitted 6 September, 2019; v1 submitted 14 June, 2018; originally announced June 2018.

  12. arXiv:1806.01794  [pdf, other

    cs.LG cs.CV stat.ML

    Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects

    Authors: Adam R. Kosiorek, Hyunjik Kim, Ingmar Posner, Yee Whye Teh

    Abstract: We present Sequential Attend, Infer, Repeat (SQAIR), an interpretable deep generative model for videos of moving objects. It can reliably discover and track objects throughout the sequence of frames, and can also generate future frames conditioning on the current frame, thereby simulating expected motion of objects. This is achieved by explicitly encoding object presence, locations and appearances… ▽ More

    Submitted 21 November, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

    Comments: 25 pages, 19 figures, NeurIPS 2018, code: https://github.com/akosiorek/sqair, video: https://youtu.be/-IUNQgSLE0c

  13. arXiv:1805.10469  [pdf, other

    stat.ML cs.LG

    Revisiting Reweighted Wake-Sleep for Models with Stochastic Control Flow

    Authors: Tuan Anh Le, Adam R. Kosiorek, N. Siddharth, Yee Whye Teh, Frank Wood

    Abstract: Stochastic control-flow models (SCFMs) are a class of generative models that involve branching on choices from discrete random variables. Amortized gradient-based learning of SCFMs is challenging as most approaches targeting discrete variables rely on their continuous relaxations---which can be intractable in SCFMs, as branching on relaxations requires evaluating all (exponentially many) branching… ▽ More

    Submitted 16 September, 2019; v1 submitted 26 May, 2018; originally announced May 2018.

    Comments: Tuan Anh Le and Adam R. Kosiorek contributed equally; accepted to Uncertainty in Artificial Intelligence 2019

  14. arXiv:1802.04537  [pdf, other

    stat.ML cs.LG

    Tighter Variational Bounds are Not Necessarily Better

    Authors: Tom Rainforth, Adam R. Kosiorek, Tuan Anh Le, Chris J. Maddison, Maximilian Igl, Frank Wood, Yee Whye Teh

    Abstract: We provide theoretical and empirical evidence that using tighter evidence lower bounds (ELBOs) can be detrimental to the process of learning an inference network by reducing the signal-to-noise ratio of the gradient estimator. Our results call into question common implicit assumptions that tighter ELBOs are better variational objectives for simultaneous model learning and inference amortization sc… ▽ More

    Submitted 5 March, 2019; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: To appear at ICML 2018

  15. arXiv:1706.09262  [pdf, other

    cs.CV cs.AI cs.NE

    Hierarchical Attentive Recurrent Tracking

    Authors: Adam R. Kosiorek, Alex Bewley, Ingmar Posner

    Abstract: Class-agnostic object tracking is particularly difficult in cluttered environments as target specific discriminative models cannot be learned a priori. Inspired by how the human visual cortex employs spatial attention and separate "where" and "what" processing pathways to actively suppress irrelevant visual features, this work develops a hierarchical attentive recurrent model for single object tra… ▽ More

    Submitted 5 September, 2017; v1 submitted 28 June, 2017; originally announced June 2017.

    Comments: Published as a conference paper at NIPS 2017. Code is available at https://github.com/akosiorek/hart and qualitative results are available at https://youtu.be/Vvkjm0FRGSs