Zum Hauptinhalt springen

Showing 1–25 of 25 results for author: Gregor, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10179  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Scaling Instructable Agents Across Many Simulated Worlds

    Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (68 additional authors not shown)

    Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

  2. arXiv:2301.07608  [pdf, other

    cs.LG cs.AI cs.NE

    Human-Timescale Adaptation in an Open-Ended Task Space

    Authors: Adaptive Agent Team, Jakob Bauer, Kate Baumli, Satinder Baveja, Feryal Behbahani, Avishkar Bhoopchand, Nathalie Bradley-Schmieg, Michael Chang, Natalie Clay, Adrian Collister, Vibhavari Dasagi, Lucy Gonzalez, Karol Gregor, Edward Hughes, Sheleem Kashem, Maria Loks-Thompson, Hannah Openshaw, Jack Parker-Holder, Shreya Pathak, Nicolas Perez-Nieves, Nemanja Rakicevic, Tim Rocktäschel, Yannick Schroecker, Jakub Sygnowski, Karl Tuyls , et al. (3 additional authors not shown)

    Abstract: Foundation models have shown impressive adaptation and scalability in supervised and self-supervised learning problems, but so far these successes have not fully translated to reinforcement learning (RL). In this work, we demonstrate that training an RL agent at scale leads to a general in-context learning algorithm that can adapt to open-ended novel embodied 3D problems as quickly as humans. In a… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  3. arXiv:2101.07627  [pdf, other

    cs.NE

    Self-Organizing Intelligent Matter: A blueprint for an AI generating algorithm

    Authors: Karol Gregor, Frederic Besse

    Abstract: We propose an artificial life framework aimed at facilitating the emergence of intelligent organisms. In this framework there is no explicit notion of an agent: instead there is an environment made of atomic elements. These elements contain neural operations and interact through exchanges of information and through physics-like rules contained in the environment. We discuss how an evolutionary pro… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

    Comments: 13 pages, 2 figures

  4. arXiv:2003.03124  [pdf, other

    cs.LG cs.NE stat.ML

    Finding online neural update rules by learning to remember

    Authors: Karol Gregor

    Abstract: We investigate learning of the online local update rules for neural activations (bodies) and weights (synapses) from scratch. We represent the states of each weight and activation by small vectors, and parameterize their updates using (meta-) neural networks. Different neuron types are represented by different embedding vectors which allows the same two functions to be used for all neurons. Instea… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: 11 Pages, 1 figure

  5. arXiv:2002.02836  [pdf, other

    cs.LG cs.AI stat.ML

    Causally Correct Partial Models for Reinforcement Learning

    Authors: Danilo J. Rezende, Ivo Danihelka, George Papamakarios, Nan Rosemary Ke, Ray Jiang, Theophane Weber, Karol Gregor, Hamza Merzic, Fabio Viola, Jane Wang, Jovana Mitrovic, Frederic Besse, Ioannis Antonoglou, Lars Buesing

    Abstract: In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling future observations can be computationally expensive or even intractable if the observations are high-dimensional (e.g. images). For this reason, previous works have considered partial models, which model only part of the observation. In this pa… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

  6. arXiv:1906.09237  [pdf, other

    cs.LG cs.AI stat.ML

    Shaping Belief States with Generative Environment Models for RL

    Authors: Karol Gregor, Danilo Jimenez Rezende, Frederic Besse, Yan Wu, Hamza Merzic, Aaron van den Oord

    Abstract: When agents interact with a complex environment, they must form and maintain beliefs about the relevant aspects of that environment. We propose a way to efficiently train expressive generative models in complex environments. We show that a predictive algorithm with an expressive generative model can form stable belief-states in visually rich and dynamic 3D environments. More precisely, we show tha… ▽ More

    Submitted 24 June, 2019; v1 submitted 21 June, 2019; originally announced June 2019.

    Comments: pre-print

  7. arXiv:1901.03559  [pdf, other

    cs.LG cs.AI stat.ML

    An investigation of model-free planning

    Authors: Arthur Guez, Mehdi Mirza, Karol Gregor, Rishabh Kabra, Sébastien Racanière, Théophane Weber, David Raposo, Adam Santoro, Laurent Orseau, Tom Eccles, Greg Wayne, David Silver, Timothy Lillicrap

    Abstract: The field of reinforcement learning (RL) is facing increasingly challenging domains with combinatorial complexity. For an RL agent to address these challenges, it is essential that it can plan effectively. Prior work has typically utilized an explicit model of the environment, combined with a specific planning algorithm (such as tree search). More recently, a new family of methods have been propos… ▽ More

    Submitted 20 May, 2019; v1 submitted 11 January, 2019; originally announced January 2019.

  8. arXiv:1811.09556  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Attractor Dynamics for Generative Memory

    Authors: Yan Wu, Greg Wayne, Karol Gregor, Timothy Lillicrap

    Abstract: A central challenge faced by memory systems is the robust retrieval of a stored pattern in the presence of interference due to other stored patterns and noise. A theoretically well-founded solution to robust retrieval is given by attractor dynamics, which iteratively clean up patterns during recall. However, incorporating attractor dynamics into modern deep learning systems poses difficulties: att… ▽ More

    Submitted 23 November, 2018; originally announced November 2018.

  9. arXiv:1806.03107  [pdf, other

    cs.LG stat.ML

    Temporal Difference Variational Auto-Encoder

    Authors: Karol Gregor, George Papamakarios, Frederic Besse, Lars Buesing, Theophane Weber

    Abstract: To act and plan in complex environments, we posit that agents should have a mental simulator of the world with three characteristics: (a) it should build an abstract state representing the condition of the world; (b) it should form a belief which represents uncertainty on the world; (c) it should go beyond simple step-by-step simulation, and exhibit temporal abstraction. Motivated by the absence o… ▽ More

    Submitted 2 January, 2019; v1 submitted 8 June, 2018; originally announced June 2018.

  10. arXiv:1802.03006  [pdf, other

    cs.LG

    Learning and Querying Fast Generative Models for Reinforcement Learning

    Authors: Lars Buesing, Theophane Weber, Sebastien Racaniere, S. M. Ali Eslami, Danilo Rezende, David P. Reichert, Fabio Viola, Frederic Besse, Karol Gregor, Demis Hassabis, Daan Wierstra

    Abstract: A key challenge in model-based reinforcement learning (RL) is to synthesize computationally efficient and accurate environment models. We show that carefully designed generative models that learn and operate on compact state representations, so-called state-space models, substantially reduce the computational costs for predicting outcomes of sequences of actions. Extensive experiments establish th… ▽ More

    Submitted 8 February, 2018; originally announced February 2018.

  11. arXiv:1611.07507  [pdf, other

    cs.LG cs.AI

    Variational Intrinsic Control

    Authors: Karol Gregor, Danilo Jimenez Rezende, Daan Wierstra

    Abstract: In this paper we introduce a new unsupervised reinforcement learning method for discovering the set of intrinsic options available to an agent. This set is learned by maximizing the number of different states an agent can reliably reach, as measured by the mutual information between the set of options and option termination states. To this end, we instantiate two policy gradient based algorithms,… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: 15 pages, 6 figures

  12. arXiv:1606.01535  [pdf, other

    cs.CV

    What is the Best Feature Learning Procedure in Hierarchical Recognition Architectures?

    Authors: Kevin Jarrett, Koray Kvukcuoglu, Karol Gregor, Yann LeCun

    Abstract: (This paper was written in November 2011 and never published. It is posted on arXiv.org in its original form in June 2016). Many recent object recognition systems have proposed using a two phase training procedure to learn sparse convolutional feature hierarchies: unsupervised pre-training followed by supervised fine-tuning. Recent results suggest that these methods provide little improvement over… ▽ More

    Submitted 5 June, 2016; originally announced June 2016.

    Comments: 17 pages, 3 figures

  13. arXiv:1605.02226  [pdf, other

    cs.LG

    Neural Autoregressive Distribution Estimation

    Authors: Benigno Uria, Marc-Alexandre Côté, Karol Gregor, Iain Murray, Hugo Larochelle

    Abstract: We present Neural Autoregressive Distribution Estimation (NADE) models, which are neural network architectures applied to the problem of unsupervised distribution and density estimation. They leverage the probability product rule and a weight sharing scheme inspired from restricted Boltzmann machines, to yield an estimator that is both tractable and has good generalization performance. We discuss… ▽ More

    Submitted 27 May, 2016; v1 submitted 7 May, 2016; originally announced May 2016.

  14. arXiv:1604.08772  [pdf, other

    stat.ML cs.CV cs.LG

    Towards Conceptual Compression

    Authors: Karol Gregor, Frederic Besse, Danilo Jimenez Rezende, Ivo Danihelka, Daan Wierstra

    Abstract: We introduce a simple recurrent variational auto-encoder architecture that significantly improves image modeling. The system represents the state-of-the-art in latent variable models for both the ImageNet and Omniglot datasets. We show that it naturally separates global conceptual information from lower level details, thus addressing one of the fundamentally desired properties of unsupervised lear… ▽ More

    Submitted 29 April, 2016; originally announced April 2016.

    Comments: 14 pages, 13 figures

  15. arXiv:1603.05106  [pdf, other

    stat.ML cs.AI cs.LG

    One-Shot Generalization in Deep Generative Models

    Authors: Danilo Jimenez Rezende, Shakir Mohamed, Ivo Danihelka, Karol Gregor, Daan Wierstra

    Abstract: Humans have an impressive ability to reason about new concepts and experiences from just a single example. In particular, humans have an ability for one-shot generalization: an ability to encounter a new concept, understand its structure, and then be able to generate compelling alternative variations of the concept. We develop machine learning systems with this important capacity by developing new… ▽ More

    Submitted 25 May, 2016; v1 submitted 16 March, 2016; originally announced March 2016.

    Comments: 8pgs, 1pg references, 1pg appendix, In Proceedings of the 33rd International Conference on Machine Learning, JMLR: W&CP volume 48, 2016

  16. arXiv:1511.06440  [pdf, other

    cs.LG

    Towards Principled Unsupervised Learning

    Authors: Ilya Sutskever, Rafal Jozefowicz, Karol Gregor, Danilo Rezende, Tim Lillicrap, Oriol Vinyals

    Abstract: General unsupervised learning is a long-standing conceptual problem in machine learning. Supervised learning is successful because it can be solved by the minimization of the training error cost function. Unsupervised learning is not as successful, because the unsupervised objective may be unrelated to the supervised task of interest. For an example, density modelling and reconstruction have often… ▽ More

    Submitted 3 December, 2015; v1 submitted 19 November, 2015; originally announced November 2015.

  17. arXiv:1502.04623  [pdf, other

    cs.CV cs.LG cs.NE

    DRAW: A Recurrent Neural Network For Image Generation

    Authors: Karol Gregor, Ivo Danihelka, Alex Graves, Danilo Jimenez Rezende, Daan Wierstra

    Abstract: This paper introduces the Deep Recurrent Attentive Writer (DRAW) neural network architecture for image generation. DRAW networks combine a novel spatial attention mechanism that mimics the foveation of the human eye, with a sequential variational auto-encoding framework that allows for the iterative construction of complex images. The system substantially improves on the state of the art for gener… ▽ More

    Submitted 20 May, 2015; v1 submitted 16 February, 2015; originally announced February 2015.

  18. arXiv:1502.03509  [pdf, other

    cs.LG cs.NE stat.ML

    MADE: Masked Autoencoder for Distribution Estimation

    Authors: Mathieu Germain, Karol Gregor, Iain Murray, Hugo Larochelle

    Abstract: There has been a lot of recent interest in designing neural network models to estimate a distribution from a set of examples. We introduce a simple modification for autoencoder neural networks that yields powerful generative models. Our method masks the autoencoder's parameters to respect autoregressive constraints: each input is reconstructed only from previous inputs in a given ordering. Constra… ▽ More

    Submitted 5 June, 2015; v1 submitted 11 February, 2015; originally announced February 2015.

    Comments: 9 pages and 1 page of supplementary material. Updated to match published version

    Journal ref: Proceedings of the 32nd International Conference on Machine Learning, JMLR W&CP 37:881-889, 2015

  19. arXiv:1402.0030  [pdf, ps, other

    cs.LG stat.ML

    Neural Variational Inference and Learning in Belief Networks

    Authors: Andriy Mnih, Karol Gregor

    Abstract: Highly expressive directed latent variable models, such as sigmoid belief networks, are difficult to train on large datasets because exact inference in them is intractable and none of the approximate inference methods that have been applied to them scale well. We propose a fast non-iterative approximate inference method that uses a feedforward network to implement efficient exact sampling from the… ▽ More

    Submitted 4 June, 2014; v1 submitted 31 January, 2014; originally announced February 2014.

    Journal ref: Proceedings of the 31st International Conference on Machine Learning (ICML), JMLR: W&CP volume 32, 2014 pgs 1791-1799

  20. arXiv:1310.8499  [pdf, other

    cs.LG stat.ML

    Deep AutoRegressive Networks

    Authors: Karol Gregor, Ivo Danihelka, Andriy Mnih, Charles Blundell, Daan Wierstra

    Abstract: We introduce a deep, generative autoencoder capable of learning hierarchies of distributed representations from data. Successive deep stochastic hidden layers are equipped with autoregressive connections, which enable the model to be sampled from quickly and exactly via ancestral sampling. We derive an efficient approximate parameter estimation method based on the minimum description length (MDL)… ▽ More

    Submitted 20 May, 2014; v1 submitted 31 October, 2013; originally announced October 2013.

    Comments: Appears in Proceedings of the 31st International Conference on Machine Learning (ICML), Beijing, China, 2014

    Journal ref: Karol Gregor, Ivo Danihelka, Andriy Mnih, Charles Blundell, Daan Wierstra. Deep AutoRegressive Networks. In Proceedings of the 31st International Conference on Machine Learning (ICML), JMLR: W&CP volume 32, 2014

  21. arXiv:1202.6384  [pdf, ps, other

    cs.CV

    Fast approximations to structured sparse coding and applications to object classification

    Authors: Arthur Szlam, Karol Gregor, Yann LeCun

    Abstract: We describe a method for fast approximation of sparse coding. The input space is subdivided by a binary decision tree, and we simultaneously learn a dictionary and assignment of allowed dictionary elements for each leaf of the tree. We store a lookup table with the assignments and the pseudoinverses for each node, allowing for very fast inference. We give an algorithm for learning the tree, the di… ▽ More

    Submitted 28 February, 2012; originally announced February 2012.

  22. arXiv:1108.1169  [pdf, ps, other

    cs.CV

    Learning Representations by Maximizing Compression

    Authors: Karol Gregor, Yann LeCun

    Abstract: We give an algorithm that learns a representation of data through compression. The algorithm 1) predicts bits sequentially from those previously seen and 2) has a structure and a number of computations similar to an autoencoder. The likelihood under the model can be calculated exactly, and arithmetic coding can be used directly for compression. When training on digits the algorithm learns filters… ▽ More

    Submitted 4 August, 2011; originally announced August 2011.

    Comments: 8 pages, 3 figures

  23. arXiv:1105.5307  [pdf, ps, other

    cs.CV cs.NE

    Efficient Learning of Sparse Invariant Representations

    Authors: Karol Gregor, Yann LeCun

    Abstract: We propose a simple and efficient algorithm for learning sparse invariant representations from unlabeled data with fast inference. When trained on short movies sequences, the learned features are selective to a range of orientations and spatial frequencies, but robust to a wide range of positions, similar to complex cells in the primary visual cortex. We give a hierarchical version of the algorith… ▽ More

    Submitted 26 May, 2011; originally announced May 2011.

    Comments: 9 pages + 6 supplement pages

  24. arXiv:1006.0448  [pdf, ps, other

    cs.NE

    Emergence of Complex-Like Cells in a Temporal Product Network with Local Receptive Fields

    Authors: Karo Gregor, Yann LeCun

    Abstract: We introduce a new neural architecture and an unsupervised algorithm for learning invariant representations from temporal sequence of images. The system uses two groups of complex cells whose outputs are combined multiplicatively: one that represents the content of the image, constrained to be constant over several consecutive frames, and one that represents the precise location of features, which… ▽ More

    Submitted 2 June, 2010; originally announced June 2010.

  25. arXiv:0912.0717  [pdf, ps, other

    cs.NE cs.CV

    Behavior and performance of the deep belief networks on image classification

    Authors: Karol Gregor, Gregory Griffin

    Abstract: We apply deep belief networks of restricted Boltzmann machines to bags of words of sift features obtained from databases of 13 Scenes, 15 Scenes and Caltech 256 and study experimentally their behavior and performance. We find that the final performance in the supervised phase is reached much faster if the system is pre-trained. Pre-training the system on a larger dataset keeping the supervised d… ▽ More

    Submitted 3 December, 2009; originally announced December 2009.

    Comments: 8 pages, 9 figures