Zum Hauptinhalt springen

Showing 1–41 of 41 results for author: Blundell, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.04229  [pdf, other

    cs.LG cs.AI cs.CL cs.DS stat.ML

    The CLRS-Text Algorithmic Reasoning Language Benchmark

    Authors: Larisa Markeeva, Sean McLeish, Borja Ibarz, Wilfried Bounsi, Olga Kozlova, Alex Vitvitskyi, Charles Blundell, Tom Goldstein, Avi Schwarzschild, Petar Veličković

    Abstract: Eliciting reasoning capabilities from language models (LMs) is a critical direction on the path towards building intelligent systems. Most recent studies dedicated to reasoning focus on out-of-distribution performance on procedurally-generated synthetic benchmarks, bespoke-built to evaluate specific skills only. This trend makes results hard to transfer across publications, slowing down progress.… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Preprint, under review. Comments welcome

  2. arXiv:2305.01521  [pdf, other

    cs.LG stat.ML

    Unlocking the Power of Representations in Long-term Novelty-based Exploration

    Authors: Alaa Saade, Steven Kapturowski, Daniele Calandriello, Charles Blundell, Pablo Sprechmann, Leopoldo Sarra, Oliver Groth, Michal Valko, Bilal Piot

    Abstract: We introduce Robust Exploration via Clustering-based Online Density Estimation (RECODE), a non-parametric method for novelty-based exploration that estimates visitation counts for clusters of states based on their similarity in a chosen embedding space. By adapting classical clustering to the nonstationary setting of Deep RL, RECODE can efficiently track state visitation counts over thousands of e… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  3. arXiv:2302.10258  [pdf, other

    cs.LG cs.AI stat.ME

    Neural Algorithmic Reasoning with Causal Regularisation

    Authors: Beatrice Bevilacqua, Kyriacos Nikiforou, Borja Ibarz, Ioana Bica, Michela Paganini, Charles Blundell, Jovana Mitrovic, Petar Veličković

    Abstract: Recent work on neural algorithmic reasoning has investigated the reasoning capabilities of neural networks, effectively demonstrating they can learn to execute classical algorithms on unseen data coming from the train distribution. However, the performance of existing neural reasoners significantly degrades on out-of-distribution (OOD) test data, where inputs have larger sizes. In this work, we ma… ▽ More

    Submitted 3 July, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: ICML 2023, Camera Ready; 17 pages, 7 figures

  4. arXiv:2209.11142  [pdf, other

    cs.LG cs.AI stat.ML

    A Generalist Neural Algorithmic Learner

    Authors: Borja Ibarz, Vitaly Kurin, George Papamakarios, Kyriacos Nikiforou, Mehdi Bennani, Róbert Csordás, Andrew Dudzik, Matko Bošnjak, Alex Vitvitskyi, Yulia Rubanova, Andreea Deac, Beatrice Bevilacqua, Yaroslav Ganin, Charles Blundell, Petar Veličković

    Abstract: The cornerstone of neural algorithmic reasoning is the ability to solve algorithmic tasks, especially in a way that generalises out of distribution. While recent years have seen a surge in methodological improvements in this area, they mostly focused on building specialist models. Specialist models are capable of learning to neurally execute either only one algorithm or a collection of algorithms… ▽ More

    Submitted 3 December, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: To appear at LoG 2022 (Spotlight talk). 23 pages, 11 figures

  5. arXiv:2205.15659  [pdf, other

    cs.LG cs.DS stat.ML

    The CLRS Algorithmic Reasoning Benchmark

    Authors: Petar Veličković, Adrià Puigdomènech Badia, David Budden, Razvan Pascanu, Andrea Banino, Misha Dashevskiy, Raia Hadsell, Charles Blundell

    Abstract: Learning representations of algorithms is an emerging area of machine learning, seeking to bridge concepts from neural networks with classical algorithms. Several important works have investigated whether neural networks can effectively reason like algorithms, typically by learning to execute them. The common trend in the area, however, is to generate targeted kinds of algorithmic data to evaluate… ▽ More

    Submitted 4 June, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: To appear in ICML 2022. 19 pages, 4 figures

  6. arXiv:2201.05119  [pdf, other

    cs.CV cs.LG stat.ML

    Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?

    Authors: Nenad Tomasev, Ioana Bica, Brian McWilliams, Lars Buesing, Razvan Pascanu, Charles Blundell, Jovana Mitrovic

    Abstract: Despite recent progress made by self-supervised methods in representation learning with residual networks, they still underperform supervised learning on the ImageNet classification benchmark, limiting their applicability in performance-critical settings. Building on prior theoretical insights from ReLIC [Mitrovic et al., 2021], we include additional inductive biases into self-supervised learning.… ▽ More

    Submitted 3 November, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

  7. arXiv:2111.08696  [pdf, other

    physics.comp-ph cond-mat.stat-mech stat.ML

    Normalizing flows for atomic solids

    Authors: Peter Wirnsberger, George Papamakarios, Borja Ibarz, Sébastien Racanière, Andrew J. Ballard, Alexander Pritzel, Charles Blundell

    Abstract: We present a machine-learning approach, based on normalizing flows, for modelling atomic solids. Our model transforms an analytically tractable base distribution into the target solid without requiring ground-truth samples for training. We report Helmholtz free energy estimates for cubic and hexagonal ice modelled as monatomic water as well as for a truncated and shifted Lennard-Jones system, and… ▽ More

    Submitted 28 April, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: 20 pages, 7 figures

  8. arXiv:2107.08881  [pdf, other

    cs.LG cs.AI stat.ML

    Reasoning-Modulated Representations

    Authors: Petar Veličković, Matko Bošnjak, Thomas Kipf, Alexander Lerchner, Raia Hadsell, Razvan Pascanu, Charles Blundell

    Abstract: Neural networks leverage robust internal representations in order to generalise. Learning them is difficult, and often requires a large training set that covers the data distribution densely. We study a common setting where our task is not purely opaque. Indeed, very often we may have access to information about the underlying system (e.g. that observations must obey certain laws of physics) that… ▽ More

    Submitted 3 December, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: To appear at LoG 2022. 17 pages, 5 figures

  9. arXiv:2106.11779  [pdf, other

    cs.LG stat.ML

    Emphatic Algorithms for Deep Reinforcement Learning

    Authors: Ray Jiang, Tom Zahavy, Zhongwen Xu, Adam White, Matteo Hessel, Charles Blundell, Hado van Hasselt

    Abstract: Off-policy learning allows us to learn about possible policies of behavior from experience generated by a different behavior policy. Temporal difference (TD) learning algorithms can become unstable when combined with function approximation and off-policy sampling - this is known as the ''deadly triad''. Emphatic temporal difference (ETD($λ$)) algorithm ensures convergence in the linear case by app… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021

  10. arXiv:2105.02761  [pdf, other

    cs.LG cs.AI cs.DS math.OC stat.ML

    Neural Algorithmic Reasoning

    Authors: Petar Veličković, Charles Blundell

    Abstract: Algorithms have been fundamental to recent global technological advances and, in particular, they have been the cornerstone of technical advances in one field rapidly being applied to another. We argue that algorithms possess fundamentally different qualities to deep learning methods, and this strongly suggests that, were deep learning methods better able to mimic algorithms, generalisation of the… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted as an Opinion paper in Patterns. 7 pages, 1 figure

  11. arXiv:2103.01937  [pdf, other

    cs.AI cs.LG stat.ML

    Neural Production Systems: Learning Rule-Governed Visual Dynamics

    Authors: Anirudh Goyal, Aniket Didolkar, Nan Rosemary Ke, Charles Blundell, Philippe Beaudoin, Nicolas Heess, Michael Mozer, Yoshua Bengio

    Abstract: Visual environments are structured, consisting of distinct objects or entities. These entities have properties -- both visible and latent -- that determine the manner in which they interact with one another. To partition images into entities, deep-learning researchers have proposed structural inductive biases such as slot-based architectures. To model interactions among entities, equivariant graph… ▽ More

    Submitted 23 March, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: NeurIPS'21

  12. arXiv:2103.01197  [pdf, other

    cs.LG cs.AI stat.ML

    Coordination Among Neural Modules Through a Shared Global Workspace

    Authors: Anirudh Goyal, Aniket Didolkar, Alex Lamb, Kartikeya Badola, Nan Rosemary Ke, Nasim Rahaman, Jonathan Binas, Charles Blundell, Michael Mozer, Yoshua Bengio

    Abstract: Deep learning has seen a movement away from representing examples with a monolithic hidden state towards a richly structured state. For example, Transformers segment by position, and object-centric architectures decompose images into entities. In all these architectures, interactions between different elements are modeled via pairwise interactions: Transformers make use of self-attention to incorp… ▽ More

    Submitted 22 March, 2022; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: ICLR'22 accepted paper

  13. arXiv:2103.01043  [pdf, other

    cs.LG cs.AI cs.SI stat.ML

    Persistent Message Passing

    Authors: Heiko Strathmann, Mohammadamin Barekatain, Charles Blundell, Petar Veličković

    Abstract: Graph neural networks (GNNs) are a powerful inductive bias for modelling algorithmic reasoning procedures and data structures. Their prowess was mainly demonstrated on tasks featuring Markovian dynamics, where querying any associated data structure depends only on its latest state. For many tasks of interest, however, it may be highly beneficial to support efficient data structure queries dependen… ▽ More

    Submitted 27 April, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: 7 pages, 2 figures. Published as a workshop paper at ICLR 2021 SimDL Workshop. Accepted at the ICLR 2021 Workshop on Geometrical and Topological Representation Learning

  14. arXiv:2102.13515  [pdf, other

    cs.LG cs.AI stat.ML

    Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning

    Authors: Víctor Campos, Pablo Sprechmann, Steven Hansen, Andre Barreto, Steven Kapturowski, Alex Vitvitskyi, Adrià Puigdomènech Badia, Charles Blundell

    Abstract: Designing agents that acquire knowledge autonomously and use it to solve new tasks efficiently is an important challenge in reinforcement learning. Knowledge acquired during an unsupervised pre-training phase is often transferred by fine-tuning neural network weights once rewards are exposed, as is common practice in supervised domains. Given the nature of the reinforcement learning problem, we ar… ▽ More

    Submitted 8 June, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

  15. arXiv:2010.07922  [pdf, other

    cs.LG cs.CV stat.ML

    Representation Learning via Invariant Causal Mechanisms

    Authors: Jovana Mitrovic, Brian McWilliams, Jacob Walker, Lars Buesing, Charles Blundell

    Abstract: Self-supervised learning has emerged as a strategy to reduce the reliance on costly supervised signal by pretraining representations only using unlabeled data. These methods combine heuristic proxy classification tasks with data augmentations and have achieved significant success, but our theoretical understanding of this success remains limited. In this paper we analyze self-supervised representa… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  16. arXiv:2006.16225  [pdf, other

    cs.LG stat.ML

    Object Files and Schemata: Factorizing Declarative and Procedural Knowledge in Dynamical Systems

    Authors: Anirudh Goyal, Alex Lamb, Phanideep Gampa, Philippe Beaudoin, Sergey Levine, Charles Blundell, Yoshua Bengio, Michael Mozer

    Abstract: Modeling a structured, dynamic environment like a video game requires keeping track of the objects and their states declarative knowledge) as well as predicting how objects behave (procedural knowledge). Black-box models with a monolithic hidden state often fail to apply procedural knowledge consistently and uniformly, i.e., they lack systematicity. For example, in a video game, correct prediction… ▽ More

    Submitted 12 November, 2020; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: Type/Token Distinction in Deep learning Framework

  17. arXiv:2006.06380  [pdf, other

    stat.ML cs.DS cs.LG

    Pointer Graph Networks

    Authors: Petar Veličković, Lars Buesing, Matthew C. Overlan, Razvan Pascanu, Oriol Vinyals, Charles Blundell

    Abstract: Graph neural networks (GNNs) are typically applied to static graphs that are assumed to be known upfront. This static input structure is often informed purely by insight of the machine learning practitioner, and might not be optimal for the actual task the GNN is solving. In absence of reliable domain expertise, one might resort to inferring the latent graph structure, which is often difficult due… ▽ More

    Submitted 18 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: To appear at NeurIPS 2020 (Spotlight talk)

  18. arXiv:2003.13350  [pdf, other

    cs.LG stat.ML

    Agent57: Outperforming the Atari Human Benchmark

    Authors: Adrià Puigdomènech Badia, Bilal Piot, Steven Kapturowski, Pablo Sprechmann, Alex Vitvitskyi, Daniel Guo, Charles Blundell

    Abstract: Atari games have been a long-standing benchmark in the reinforcement learning (RL) community for the past decade. This benchmark was proposed to test general competency of RL algorithms. Previous work has achieved good average performance by doing outstandingly well on many games of the set, but very poorly in several of the most challenging games. We propose Agent57, the first deep RL agent that… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

  19. arXiv:2002.06038  [pdf, other

    cs.LG stat.ML

    Never Give Up: Learning Directed Exploration Strategies

    Authors: Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi, Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martín Arjovsky, Alexander Pritzel, Andew Bolt, Charles Blundell

    Abstract: We propose a reinforcement learning agent to solve hard exploration games by learning a range of directed exploratory policies. We construct an episodic memory-based intrinsic reward using k-nearest neighbors over the agent's recent experience to train the directed exploratory policies, thereby encouraging the agent to repeatedly revisit all states in its environment. A self-supervised inverse dyn… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: Published as a conference paper in ICLR 2020

  20. arXiv:2002.04913  [pdf, other

    physics.comp-ph physics.chem-ph stat.ML

    Targeted free energy estimation via learned mappings

    Authors: Peter Wirnsberger, Andrew J. Ballard, George Papamakarios, Stuart Abercrombie, Sébastien Racanière, Alexander Pritzel, Danilo Jimenez Rezende, Charles Blundell

    Abstract: Free energy perturbation (FEP) was proposed by Zwanzig more than six decades ago as a method to estimate free energy differences, and has since inspired a huge body of related methods that use it as an integral building block. Being an importance sampling based estimator, however, FEP suffers from a severe limitation: the requirement of sufficient overlap between distributions. One strategy to mit… ▽ More

    Submitted 18 August, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Added figure 3, added data augmentation for octahedral symmetries, updated experimental results and revised text (11 pages, 6 figures)

  21. arXiv:1910.13406  [pdf, other

    cs.LG cs.AI stat.ML

    Generalization of Reinforcement Learners with Working and Episodic Memory

    Authors: Meire Fortunato, Melissa Tan, Ryan Faulkner, Steven Hansen, Adrià Puigdomènech Badia, Gavin Buttimore, Charlie Deck, Joel Z Leibo, Charles Blundell

    Abstract: Memory is an important aspect of intelligence and plays a role in many deep reinforcement learning models. However, little progress has been made in understanding when specific memory systems help more than others and how well they generalize. The field also has yet to see a prevalent consistent and rigorous approach for evaluating agent performance on holdout data. In this paper, we aim to develo… ▽ More

    Submitted 18 February, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019. Equal contribution of first 4 authors

    Journal ref: 33rd Conference on Neural Information Processing Systems (Neurips 2019)

  22. arXiv:1910.10593  [pdf, other

    stat.ML cs.AI cs.DS cs.LG

    Neural Execution of Graph Algorithms

    Authors: Petar Veličković, Rex Ying, Matilde Padovano, Raia Hadsell, Charles Blundell

    Abstract: Graph Neural Networks (GNNs) are a powerful representational tool for solving problems on graph-structured inputs. In almost all cases so far, however, they have been applied to directly recovering a final solution from raw inputs, without explicit guidance on how to structure their problem-solving. Here, instead, we focus on learning in the space of algorithms: we train several state-of-the-art G… ▽ More

    Submitted 15 January, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: To appear at ICLR 2020. 13 pages, 4 figures

  23. arXiv:1805.09692  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    Been There, Done That: Meta-Learning with Episodic Recall

    Authors: Samuel Ritter, Jane X. Wang, Zeb Kurth-Nelson, Siddhant M. Jayakumar, Charles Blundell, Razvan Pascanu, Matthew Botvinick

    Abstract: Meta-learning agents excel at rapidly learning new tasks from open-ended task distributions; yet, they forget what they learn about each task as soon as the next begins. When tasks reoccur - as they do in natural environments - metalearning agents must explore again instead of immediately exploiting previously discovered solutions. We propose a formalism for generating open-ended yet repetitious e… ▽ More

    Submitted 6 July, 2018; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: ICML 2018

  24. arXiv:1805.09208  [pdf, other

    stat.ML cs.CL cs.LG

    Pushing the bounds of dropout

    Authors: Gábor Melis, Charles Blundell, Tomáš Kočiský, Karl Moritz Hermann, Chris Dyer, Phil Blunsom

    Abstract: We show that dropout training is best understood as performing MAP estimation concurrently for a family of conditional models whose objectives are themselves lower bounded by the original dropout objective. This discovery allows us to pick any model from this family after training, which leads to a substantial improvement on regularisation-heavy language modelling. The family includes models that… ▽ More

    Submitted 27 September, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

  25. arXiv:1802.10542  [pdf, other

    stat.ML cs.LG

    Memory-based Parameter Adaptation

    Authors: Pablo Sprechmann, Siddhant M. Jayakumar, Jack W. Rae, Alexander Pritzel, Adrià Puigdomènech Badia, Benigno Uria, Oriol Vinyals, Demis Hassabis, Razvan Pascanu, Charles Blundell

    Abstract: Deep neural networks have excelled on a wide range of problems, from vision to language and game playing. Neural networks very gradually incorporate information into weights as they process data, requiring very low learning rates. If the training distribution shifts, the network is slow to adapt, and when it does adapt, it typically performs badly on the training distribution before the shift. Our… ▽ More

    Submitted 28 February, 2018; originally announced February 2018.

    Comments: Published as a conference paper at ICLR 2018

  26. arXiv:1707.08475  [pdf, other

    stat.ML cs.AI cs.LG

    DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

    Authors: Irina Higgins, Arka Pal, Andrei A. Rusu, Loic Matthey, Christopher P Burgess, Alexander Pritzel, Matthew Botvinick, Charles Blundell, Alexander Lerchner

    Abstract: Domain adaptation is an important open problem in deep reinforcement learning (RL). In many scenarios of interest data is hard to obtain, so agents may learn a source policy in a setting where data is readily available, with the hope that it generalises well to the target domain. We propose a new multi-stage RL agent, DARLA (DisentAngled Representation Learning Agent), which learns to see before l… ▽ More

    Submitted 6 June, 2018; v1 submitted 26 July, 2017; originally announced July 2017.

    Comments: ICML 2017

  27. arXiv:1706.10295  [pdf, other

    cs.LG stat.ML

    Noisy Networks for Exploration

    Authors: Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg

    Abstract: We introduce NoisyNet, a deep reinforcement learning agent with parametric noise added to its weights, and show that the induced stochasticity of the agent's policy can be used to aid efficient exploration. The parameters of the noise are learned with gradient descent along with the remaining network weights. NoisyNet is straightforward to implement and adds little computational overhead. We find… ▽ More

    Submitted 9 July, 2019; v1 submitted 30 June, 2017; originally announced June 2017.

    Comments: ICLR 2018

  28. arXiv:1704.02798  [pdf, other

    cs.LG stat.ML

    Bayesian Recurrent Neural Networks

    Authors: Meire Fortunato, Charles Blundell, Oriol Vinyals

    Abstract: In this work we explore a straightforward variational Bayes scheme for Recurrent Neural Networks. Firstly, we show that a simple adaptation of truncated backpropagation through time can yield good quality uncertainty estimates and superior regularisation at only a small extra computational cost during training, also reducing the amount of parameters by 80\%. Secondly, we demonstrate how a novel ki… ▽ More

    Submitted 9 May, 2019; v1 submitted 10 April, 2017; originally announced April 2017.

    Comments: 12th Women in Machine Learning Workshop (WiML 2017), co-located with the 31st Conference on Neural Information Processing Systems (NeurIPS 2017), Long Beach, CA, USA

  29. arXiv:1703.01988  [pdf, other

    cs.LG stat.ML

    Neural Episodic Control

    Authors: Alexander Pritzel, Benigno Uria, Sriram Srinivasan, Adrià Puigdomènech, Oriol Vinyals, Demis Hassabis, Daan Wierstra, Charles Blundell

    Abstract: Deep reinforcement learning methods attain super-human performance in a wide range of environments. Such methods are grossly inefficient, often taking orders of magnitudes more data than humans to achieve reasonable performance. We propose Neural Episodic Control: a deep reinforcement learning agent that is able to rapidly assimilate new experiences and act upon them. Our agent uses a semi-tabular… ▽ More

    Submitted 6 March, 2017; originally announced March 2017.

  30. arXiv:1612.01474  [pdf, other

    stat.ML cs.LG

    Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles

    Authors: Balaji Lakshminarayanan, Alexander Pritzel, Charles Blundell

    Abstract: Deep neural networks (NNs) are powerful black box predictors that have recently achieved impressive performance on a wide spectrum of tasks. Quantifying predictive uncertainty in NNs is a challenging and yet unsolved problem. Bayesian NNs, which learn a distribution over weights, are currently the state-of-the-art for estimating predictive uncertainty; however these require significant modificatio… ▽ More

    Submitted 3 November, 2017; v1 submitted 5 December, 2016; originally announced December 2016.

    Comments: NIPS 2017

  31. arXiv:1611.05763  [pdf, other

    cs.LG cs.AI stat.ML

    Learning to reinforcement learn

    Authors: Jane X Wang, Zeb Kurth-Nelson, Dhruva Tirumala, Hubert Soyer, Joel Z Leibo, Remi Munos, Charles Blundell, Dharshan Kumaran, Matt Botvinick

    Abstract: In recent years deep reinforcement learning (RL) systems have attained superhuman performance in a number of challenging task domains. However, a major limitation of such applications is their demand for massive amounts of training data. A critical present objective is thus to develop deep RL methods that can adapt rapidly to new tasks. In the present work we introduce a novel approach to this cha… ▽ More

    Submitted 23 January, 2017; v1 submitted 17 November, 2016; originally announced November 2016.

    Comments: 17 pages, 7 figures, 1 table

  32. arXiv:1606.05579  [pdf, other

    stat.ML cs.LG q-bio.NC

    Early Visual Concept Learning with Unsupervised Deep Learning

    Authors: Irina Higgins, Loic Matthey, Xavier Glorot, Arka Pal, Benigno Uria, Charles Blundell, Shakir Mohamed, Alexander Lerchner

    Abstract: Automated discovery of early visual concepts from raw image data is a major open challenge in AI research. Addressing this problem, we propose an unsupervised approach for learning disentangled representations of the underlying factors of variation. We draw inspiration from neuroscience, and show how this can be achieved in an unsupervised generative model by applying the same learning pressures a… ▽ More

    Submitted 20 September, 2016; v1 submitted 17 June, 2016; originally announced June 2016.

  33. arXiv:1606.04460  [pdf, other

    stat.ML cs.LG q-bio.NC

    Model-Free Episodic Control

    Authors: Charles Blundell, Benigno Uria, Alexander Pritzel, Yazhe Li, Avraham Ruderman, Joel Z Leibo, Jack Rae, Daan Wierstra, Demis Hassabis

    Abstract: State of the art deep reinforcement learning algorithms take many millions of interactions to attain human-level performance. Humans, on the other hand, can very quickly exploit highly rewarding nuances of an environment upon first discovery. In the brain, such rapid learning is thought to depend on the hippocampus and its capacity for episodic memory. Here we investigate whether a simple model of… ▽ More

    Submitted 14 June, 2016; originally announced June 2016.

  34. arXiv:1606.04080  [pdf, other

    cs.LG stat.ML

    Matching Networks for One Shot Learning

    Authors: Oriol Vinyals, Charles Blundell, Timothy Lillicrap, Koray Kavukcuoglu, Daan Wierstra

    Abstract: Learning from a few examples remains a key challenge in machine learning. Despite recent advances in important domains such as vision and language, the standard supervised deep learning paradigm does not offer a satisfactory solution for learning new concepts rapidly from little data. In this work, we employ ideas from metric learning based on deep neural features and from recent advances that aug… ▽ More

    Submitted 29 December, 2017; v1 submitted 13 June, 2016; originally announced June 2016.

  35. arXiv:1602.04621  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Deep Exploration via Bootstrapped DQN

    Authors: Ian Osband, Charles Blundell, Alexander Pritzel, Benjamin Van Roy

    Abstract: Efficient exploration in complex environments remains a major challenge for reinforcement learning. We propose bootstrapped DQN, a simple algorithm that explores in a computationally and statistically efficient manner through use of randomized value functions. Unlike dithering strategies such as epsilon-greedy exploration, bootstrapped DQN carries out temporally-extended (or deep) exploration; thi… ▽ More

    Submitted 4 July, 2016; v1 submitted 15 February, 2016; originally announced February 2016.

  36. arXiv:1512.09327  [pdf, other

    cs.LG stat.ML

    Distributed Bayesian Learning with Stochastic Natural-gradient Expectation Propagation and the Posterior Server

    Authors: Leonard Hasenclever, Stefan Webb, Thibaut Lienart, Sebastian Vollmer, Balaji Lakshminarayanan, Charles Blundell, Yee Whye Teh

    Abstract: This paper makes two contributions to Bayesian machine learning algorithms. Firstly, we propose stochastic natural gradient expectation propagation (SNEP), a novel alternative to expectation propagation (EP), a popular variational inference algorithm. SNEP is a black box variational algorithm, in that it does not require any simplifying assumptions on the distribution of interest, beyond the exist… ▽ More

    Submitted 7 September, 2017; v1 submitted 31 December, 2015; originally announced December 2015.

    Comments: 37 pages, 7 figures

    Journal ref: Journal of Machine Learning Research 18 (2017) 1-37

  37. arXiv:1505.05424  [pdf, other

    stat.ML cs.LG

    Weight Uncertainty in Neural Networks

    Authors: Charles Blundell, Julien Cornebise, Koray Kavukcuoglu, Daan Wierstra

    Abstract: We introduce a new, efficient, principled and backpropagation-compatible algorithm for learning a probability distribution on the weights of a neural network, called Bayes by Backprop. It regularises the weights by minimising a compression cost, known as the variational free energy or the expected lower bound on the marginal likelihood. We show that this principled kind of regularisation yields co… ▽ More

    Submitted 21 May, 2015; v1 submitted 20 May, 2015; originally announced May 2015.

    Comments: In Proceedings of the 32nd International Conference on Machine Learning (ICML 2015)

  38. arXiv:1411.2674  [pdf, other

    stat.ML cs.CL cs.LG cs.SI

    The Bayesian Echo Chamber: Modeling Social Influence via Linguistic Accommodation

    Authors: Fangjian Guo, Charles Blundell, Hanna Wallach, Katherine Heller

    Abstract: We present the Bayesian Echo Chamber, a new Bayesian generative model for social interaction data. By modeling the evolution of people's language usage over time, this model discovers latent influence relationships between them. Unlike previous work on inferring influence, which has primarily focused on simple temporal dynamics evidenced via turn-taking behavior, our model captures more nuanced in… ▽ More

    Submitted 27 January, 2015; v1 submitted 10 November, 2014; originally announced November 2014.

    Comments: 14 pages, 7 figures, to appear in AISTATS 2015. Fixed minor formatting issues

  39. arXiv:1310.8499  [pdf, other

    cs.LG stat.ML

    Deep AutoRegressive Networks

    Authors: Karol Gregor, Ivo Danihelka, Andriy Mnih, Charles Blundell, Daan Wierstra

    Abstract: We introduce a deep, generative autoencoder capable of learning hierarchies of distributed representations from data. Successive deep stochastic hidden layers are equipped with autoregressive connections, which enable the model to be sampled from quickly and exactly via ancestral sampling. We derive an efficient approximate parameter estimation method based on the minimum description length (MDL)… ▽ More

    Submitted 20 May, 2014; v1 submitted 31 October, 2013; originally announced October 2013.

    Comments: Appears in Proceedings of the 31st International Conference on Machine Learning (ICML), Beijing, China, 2014

    Journal ref: Karol Gregor, Ivo Danihelka, Andriy Mnih, Charles Blundell, Daan Wierstra. Deep AutoRegressive Networks. In Proceedings of the 31st International Conference on Machine Learning (ICML), JMLR: W&CP volume 32, 2014

  40. arXiv:1203.3468  [pdf

    cs.LG stat.ML

    Bayesian Rose Trees

    Authors: Charles Blundell, Yee Whye Teh, Katherine A. Heller

    Abstract: Hierarchical structure is ubiquitous in data across many domains. There are many hierarchical clustering methods, frequently used by domain experts, which strive to discover this structure. However, most of these methods limit discoverable hierarchies to those with binary branching structure. This limitation, while computationally convenient, is often undesirable. In this paper we explore a Bayesi… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-65-72

  41. arXiv:1008.5386  [pdf, ps, other

    stat.ML cs.LG

    Mixed Cumulative Distribution Networks

    Authors: Ricardo Silva, Charles Blundell, Yee Whye Teh

    Abstract: Directed acyclic graphs (DAGs) are a popular framework to express multivariate probability distributions. Acyclic directed mixed graphs (ADMGs) are generalizations of DAGs that can succinctly capture much richer sets of conditional independencies, and are especially useful in modeling the effects of latent variables implicitly. Unfortunately there are currently no good parameterizations of general… ▽ More

    Submitted 31 August, 2010; originally announced August 2010.

    Comments: 11 pages, 4 figures