Zum Hauptinhalt springen

Showing 1–50 of 70 results for author: Rezende, D

.
  1. arXiv:2405.12205  [pdf, other

    cs.AI cs.LG

    Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

    Authors: Aniket Didolkar, Anirudh Goyal, Nan Rosemary Ke, Siyuan Guo, Michal Valko, Timothy Lillicrap, Danilo Rezende, Yoshua Bengio, Michael Mozer, Sanjeev Arora

    Abstract: Metacognitive knowledge refers to humans' intuitive knowledge of their own thinking and reasoning processes. Today's best LLMs clearly possess some reasoning processes. The paper gives evidence that they also have metacognitive knowledge, including ability to name skills and procedures to apply given a task. We explore this primarily in context of math reasoning, developing a prompt-guided interac… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Preprint. Under review

  2. arXiv:2404.10179  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Scaling Instructable Agents Across Many Simulated Worlds

    Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (68 additional authors not shown)

    Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

  3. arXiv:2401.10874  [pdf, other

    hep-lat cs.LG

    Applications of flow models to the generation of correlated lattice QCD ensembles

    Authors: Ryan Abbott, Aleksandar Botev, Denis Boyda, Daniel C. Hackett, Gurtej Kanwar, Sébastien Racanière, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

    Abstract: Machine-learned normalizing flows can be used in the context of lattice quantum field theory to generate statistically correlated ensembles of lattice gauge fields at different action parameters. This work demonstrates how these correlations can be exploited for variance reduction in the computation of observables. Three different proof-of-concept applications are demonstrated using a novel residu… ▽ More

    Submitted 28 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: 12 pages, 2 tables, 5 figures. v2: accepted for publication

    Report number: MIT-CTP/5658, FERMILAB-PUB-24-0014-T

  4. arXiv:2310.15940  [pdf, other

    cs.AI cs.LG

    Combining Behaviors with the Successor Features Keyboard

    Authors: Wilka Carvalho, Andre Saraiva, Angelos Filos, Andrew Kyle Lampinen, Loic Matthey, Richard L. Lewis, Honglak Lee, Satinder Singh, Danilo J. Rezende, Daniel Zoran

    Abstract: The Option Keyboard (OK) was recently proposed as a method for transferring behavioral knowledge across tasks. OK transfers knowledge by adaptively combining subsets of known behaviors using Successor Features (SFs) and Generalized Policy Improvement (GPI). However, it relies on hand-designed state-features and task encodings which are cumbersome to design for every new environment. In this work,… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  5. Advances in machine-learning-based sampling motivated by lattice quantum chromodynamics

    Authors: Kyle Cranmer, Gurtej Kanwar, Sébastien Racanière, Danilo J. Rezende, Phiala E. Shanahan

    Abstract: Sampling from known probability distributions is a ubiquitous task in computational science, underlying calculations in domains from linguistics to biology and physics. Generative machine-learning (ML) models have emerged as a promising tool in this space, building on the success of this approach in applications such as image, text, and audio generation. Often, however, generative tasks in scienti… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: 11 pages, 5 figures

    Journal ref: Nature Reviews Physics 5, 526-535 (2023)

  6. arXiv:2305.02402  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Normalizing flows for lattice gauge theory in arbitrary space-time dimension

    Authors: Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Gurtej Kanwar, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

    Abstract: Applications of normalizing flows to the sampling of field configurations in lattice gauge theory have so far been explored almost exclusively in two space-time dimensions. We report new algorithmic developments of gauge-equivariant flow architectures facilitating the generalization to higher-dimensional lattice geometries. Specifically, we discuss masked autoregressive transformations with tracta… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  7. arXiv:2304.05823  [pdf, other

    q-bio.MN cs.LG q-bio.GN

    DiscoGen: Learning to Discover Gene Regulatory Networks

    Authors: Nan Rosemary Ke, Sara-Jane Dunn, Jorg Bornschein, Silvia Chiappa, Melanie Rey, Jean-Baptiste Lespiau, Albin Cassirer, Jane Wang, Theophane Weber, David Barrett, Matthew Botvinick, Anirudh Goyal, Mike Mozer, Danilo Rezende

    Abstract: Accurately inferring Gene Regulatory Networks (GRNs) is a critical and challenging task in biology. GRNs model the activatory and inhibitory interactions between genes and are inherently causal in nature. To accurately identify GRNs, perturbational data is required. However, most GRN discovery methods only operate on observational data. Recent advances in neural network-based causal discovery meth… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  8. arXiv:2301.05747  [pdf, other

    cs.CV cs.AI

    Laser: Latent Set Representations for 3D Generative Modeling

    Authors: Pol Moreno, Adam R. Kosiorek, Heiko Strathmann, Daniel Zoran, Rosalia G. Schneider, Björn Winckler, Larisa Markeeva, Théophane Weber, Danilo J. Rezende

    Abstract: NeRF provides unparalleled fidelity of novel view synthesis: rendering a 3D scene from an arbitrary viewpoint. NeRF requires training on a large number of views that fully cover a scene, which limits its applicability. While these issues can be addressed by learning a prior over scenes in various forms, previous approaches have been either applied to overly simple scenes or struggling to render un… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: See https://laser-nv-paper.github.io/ for video results

  9. arXiv:2211.07541  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Aspects of scaling and scalability for flow-based sampling of lattice QCD

    Authors: Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

    Abstract: Recent applications of machine-learned normalizing flows to sampling in lattice field theory suggest that such methods may be able to mitigate critical slowing down and topological freezing. However, these demonstrations have been at the scale of toy models, and it remains to be determined whether they can be applied to state-of-the-art lattice quantum chromodynamics calculations. Assessing the vi… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 22 pages, 8 figures

    Report number: MIT-CTP/5496

  10. arXiv:2208.03832  [pdf, other

    hep-lat

    Sampling QCD field configurations with gauge-equivariant flow models

    Authors: Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Gurtej Kanwar, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

    Abstract: Machine learning methods based on normalizing flows have been shown to address important challenges, such as critical slowing-down and topological freezing, in the sampling of gauge field configurations in simple lattice field theories. A critical question is whether this success will translate to studies of QCD. This Proceedings presents a status update on advances in this area. In particular, it… ▽ More

    Submitted 20 August, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: Submitted as a proceedings to the 39th International Symposium on Lattice Field Theory (Lattice 2022)

  11. Gauge-equivariant flow models for sampling in lattice field theories with pseudofermions

    Authors: Ryan Abbott, Michael S. Albergo, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Gurtej Kanwar, Sébastien Racanière, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Betsy Tian, Julian M. Urban

    Abstract: This work presents gauge-equivariant architectures for flow-based sampling in fermionic lattice field theories using pseudofermions as stochastic estimators for the fermionic determinant. This is the default approach in state-of-the-art lattice field theory calculations, making this development critical to the practical application of flow models to theories such as QCD. Methods by which flow-base… ▽ More

    Submitted 16 October, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: 15 pages, 7 figures. v3: accepted version for publication. New appendix C

    Report number: MIT-CTP/5446, INT-PUB-22-017

    Journal ref: Phys.Rev.D 106 (2022) 7, 074506

  12. arXiv:2206.11846  [pdf, other

    cs.SI

    Analysis of account behaviors in Ethereum during an economic impact event

    Authors: Pedro Henrique F. S. Oliveira, Daniel Muller Rezende, Heder Soares Bernardino, Saulo Moraes Villela, Alex Borges Vieira

    Abstract: One of the main events that involve the world economy in 2022 is the conflict between Russia and Ukraine. This event offers a rare opportunity to analyze how events of this magnitude can reflect the use of cryptocurrencies. This work aims to investigate the behavior of accounts and their transactions on the Ethereum cryptocurrency during this event. To this end, we collected all transactions that… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: 13 pages, 5 figures

  13. arXiv:2204.04875  [pdf, other

    stat.ML cs.LG

    Learning to Induce Causal Structure

    Authors: Nan Rosemary Ke, Silvia Chiappa, Jane Wang, Anirudh Goyal, Jorg Bornschein, Melanie Rey, Theophane Weber, Matthew Botvinic, Michael Mozer, Danilo Jimenez Rezende

    Abstract: The fundamental challenge in causal induction is to infer the underlying graph structure given observational and/or interventional data. Most existing causal induction algorithms operate by generating candidate graphs and evaluating them using either score-based methods (including continuous optimization) or independence tests. In our work, we instead treat the inference process as a black box and… ▽ More

    Submitted 7 October, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

  14. arXiv:2203.09250  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE stat.ML

    Symmetry-Based Representations for Artificial and Biological General Intelligence

    Authors: Irina Higgins, Sébastien Racanière, Danilo Rezende

    Abstract: Biological intelligence is remarkable in its ability to produce complex behaviour in many diverse situations through data efficient, generalisable and transferable skill acquisition. It is believed that learning "good" sensory representations is important for enabling this, however there is little agreement as to what a good representation should look like. In this review article we are going to a… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

  15. arXiv:2202.11712  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Flow-based sampling in the lattice Schwinger model at criticality

    Authors: Michael S. Albergo, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Gurtej Kanwar, Sébastien Racanière, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

    Abstract: Recent results suggest that flow-based algorithms may provide efficient sampling of field distributions for lattice field theory applications, such as studies of quantum chromodynamics and the Schwinger model. In this work, we provide a numerical demonstration of robust flow-based sampling in the Schwinger model at the critical value of the fermion mass. In contrast, at the same parameters, conven… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: 5 pages main text, 3 pages supplementary material. 4 figures

    Report number: MIT-CTP/5409

  16. arXiv:2201.13117  [pdf, other

    stat.ML cond-mat.stat-mech cs.LG hep-lat

    Continual Repeated Annealed Flow Transport Monte Carlo

    Authors: Alexander G. D. G. Matthews, Michael Arbel, Danilo J. Rezende, Arnaud Doucet

    Abstract: We propose Continual Repeated Annealed Flow Transport Monte Carlo (CRAFT), a method that combines a sequential Monte Carlo (SMC) sampler (itself a generalization of Annealed Importance Sampling) with variational inference using normalizing flows. The normalizing flows are directly trained to transport between annealing temperatures using a KL divergence for each transition. This optimization objec… ▽ More

    Submitted 6 April, 2023; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: 21 pages, 6 figures Published at International Conference on Machine Learning (ICML) 2022

  17. arXiv:2201.12204  [pdf, other

    cs.LG

    From data to functa: Your data point is a function and you can treat it like one

    Authors: Emilien Dupont, Hyunjik Kim, S. M. Ali Eslami, Danilo Rezende, Dan Rosenbaum

    Abstract: It is common practice in deep learning to represent a measurement of the world on a discrete grid, e.g. a 2D grid of pixels. However, the underlying signal represented by these measurements is often continuous, e.g. the scene depicted in an image. A powerful continuous alternative is then to represent these measurements using an implicit neural representation, a neural function trained to output t… ▽ More

    Submitted 10 November, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

  18. arXiv:2110.01288  [pdf, other

    stat.ML cs.LG

    Implicit Riemannian Concave Potential Maps

    Authors: Danilo J. Rezende, Sébastien Racanière

    Abstract: We are interested in the challenging problem of modelling densities on Riemannian manifolds with a known symmetry group using normalising flows. This has many potential applications in physical sciences such as molecular dynamics and quantum simulations. In this work we combine ideas from implicit neural layers and optimal transport theory to propose a generalisation of existing work on exponentia… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

  19. arXiv:2107.00848  [pdf, other

    stat.ML cs.LG

    Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning

    Authors: Nan Rosemary Ke, Aniket Didolkar, Sarthak Mittal, Anirudh Goyal, Guillaume Lajoie, Stefan Bauer, Danilo Rezende, Yoshua Bengio, Michael Mozer, Christopher Pal

    Abstract: Inducing causal relationships from observations is a classic problem in machine learning. Most work in causality starts from the premise that the causal variables themselves are observed. However, for AI agents such as robots trying to make sense of their environment, the only observables are low-level variables like pixels in images. To generalize well, an agent must induce high-level variables,… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  20. arXiv:2106.05934  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Flow-based sampling for fermionic lattice field theories

    Authors: Michael S. Albergo, Gurtej Kanwar, Sébastien Racanière, Danilo J. Rezende, Julian M. Urban, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Phiala E. Shanahan

    Abstract: Algorithms based on normalizing flows are emerging as promising machine learning approaches to sampling complicated probability distributions in a way that can be made asymptotically exact. In the context of lattice field theory, proof-of-principle studies have demonstrated the effectiveness of this approach for scalar theories, gauge theories, and statistical systems. This work develops approache… ▽ More

    Submitted 28 December, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: 26 pages, 5 figures

    Report number: MIT-CTP/5307

    Journal ref: Phys. Rev. D 104, 114507 (2021)

  21. arXiv:2104.00587  [pdf, other

    stat.ML cs.LG

    NeRF-VAE: A Geometry Aware 3D Scene Generative Model

    Authors: Adam R. Kosiorek, Heiko Strathmann, Daniel Zoran, Pol Moreno, Rosalia Schneider, Soňa Mokrá, Danilo J. Rezende

    Abstract: We propose NeRF-VAE, a 3D scene generative model that incorporates geometric structure via NeRF and differentiable volume rendering. In contrast to NeRF, our model takes into account shared structure across scenes, and is able to infer the structure of a novel scene -- without the need to re-train -- using amortized inference. NeRF-VAE's explicit 3D rendering process further contrasts previous gen… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: 17 pages, 15 figures, under review

  22. arXiv:2101.08176  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Introduction to Normalizing Flows for Lattice Field Theory

    Authors: Michael S. Albergo, Denis Boyda, Daniel C. Hackett, Gurtej Kanwar, Kyle Cranmer, Sébastien Racanière, Danilo Jimenez Rezende, Phiala E. Shanahan

    Abstract: This notebook tutorial demonstrates a method for sampling Boltzmann distributions of lattice field theories using a class of machine learning models known as normalizing flows. The ideas and approaches proposed in arXiv:1904.12072, arXiv:2002.02428, and arXiv:2003.06413 are reviewed and a concrete implementation of the framework is presented. We apply this framework to a lattice scalar field theor… ▽ More

    Submitted 6 August, 2021; v1 submitted 20 January, 2021; originally announced January 2021.

    Comments: 38 pages, 5 numbered figures, Jupyter notebook included as ancillary file

    Report number: MIT-CTP/5272

  23. arXiv:2012.02035  [pdf, other

    stat.ML cs.LG

    Integrable Nonparametric Flows

    Authors: David Pfau, Danilo Rezende

    Abstract: We introduce a method for reconstructing an infinitesimal normalizing flow given only an infinitesimal change to a (possibly unnormalized) probability distribution. This reverses the conventional task of normalizing flows -- rather than being given samples from a unknown target distribution and learning a flow that approximates the distribution, we are given a perturbation to an initial distributi… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted to 3rd NeurIPS Workshop on Machine Learning and Physical Sciences

  24. arXiv:2008.09301  [pdf, other

    stat.ML cs.LG

    Amortized learning of neural causal representations

    Authors: Nan Rosemary Ke, Jane. X. Wang, Jovana Mitrovic, Martin Szummer, Danilo J. Rezende

    Abstract: Causal models can compactly and efficiently encode the data-generating process under all interventions and hence may generalize better under changes in distribution. These models are often represented as Bayesian networks and learning them scales poorly with the number of variables. Moreover, these approaches cannot leverage previously learned knowledge to help with learning new causal models. In… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: ICLR 2020 causal learning for decision making workshop

  25. arXiv:2008.05456  [pdf, other

    hep-lat cs.LG stat.ML

    Sampling using $SU(N)$ gauge equivariant flows

    Authors: Denis Boyda, Gurtej Kanwar, Sébastien Racanière, Danilo Jimenez Rezende, Michael S. Albergo, Kyle Cranmer, Daniel C. Hackett, Phiala E. Shanahan

    Abstract: We develop a flow-based sampling algorithm for $SU(N)$ lattice gauge theories that is gauge-invariant by construction. Our key contribution is constructing a class of flows on an $SU(N)$ variable (or on a $U(N)$ variable by a simple alternative) that respect matrix conjugation symmetry. We apply this technique to sample distributions of single $SU(N)$ variables and to construct flow-based samplers… ▽ More

    Submitted 18 September, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

    Comments: 24 pages, 19 figures

    Report number: MIT-CTP/5228

    Journal ref: Phys. Rev. D 103, 074504 (2021)

  26. arXiv:2006.16841  [pdf, other

    cs.CV cs.LG

    Conditional Set Generation with Transformers

    Authors: Adam R Kosiorek, Hyunjik Kim, Danilo J Rezende

    Abstract: A set is an unordered collection of unique elements--and yet many machine learning models that generate sets impose an implicit or explicit ordering. Since model performance can depend on the choice of order, any particular ordering can lead to sub-optimal results. An alternative solution is to use a permutation-equivariant set generator, which does not specify an order-ing. An example of such a g… ▽ More

    Submitted 1 July, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: 6 pages, 6 figures, ICML 2020 Workshop on Object-Oriented Learning

  27. arXiv:2003.13367  [pdf, other

    cs.LG cs.IT stat.ML

    Neural Communication Systems with Bandwidth-limited Channel

    Authors: Karen Ullrich, Fabio Viola, Danilo Jimenez Rezende

    Abstract: Reliably transmitting messages despite information loss due to a noisy channel is a core problem of information theory. One of the most important aspects of real world communication, e.g. via wifi, is that it may happen at varying levels of information transfer. The bandwidth-limited channel models this phenomenon. In this study we consider learning coding with the bandwidth-limited channel (BWLC)… ▽ More

    Submitted 1 April, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

  28. arXiv:2003.06413  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Equivariant flow-based sampling for lattice gauge theory

    Authors: Gurtej Kanwar, Michael S. Albergo, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Sébastien Racanière, Danilo Jimenez Rezende, Phiala E. Shanahan

    Abstract: We define a class of machine-learned flow-based sampling algorithms for lattice gauge theories that are gauge-invariant by construction. We demonstrate the application of this framework to U(1) gauge theory in two spacetime dimensions, and find that near critical points in parameter space the approach is orders of magnitude more efficient at sampling topological quantities than more traditional sa… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

    Comments: 6 pages, 4 figures

    Report number: MIT-CTP/5181

    Journal ref: Phys. Rev. Lett. 125, 121601 (2020)

  29. arXiv:2002.04913  [pdf, other

    physics.comp-ph physics.chem-ph stat.ML

    Targeted free energy estimation via learned mappings

    Authors: Peter Wirnsberger, Andrew J. Ballard, George Papamakarios, Stuart Abercrombie, Sébastien Racanière, Alexander Pritzel, Danilo Jimenez Rezende, Charles Blundell

    Abstract: Free energy perturbation (FEP) was proposed by Zwanzig more than six decades ago as a method to estimate free energy differences, and has since inspired a huge body of related methods that use it as an integral building block. Being an importance sampling based estimator, however, FEP suffers from a severe limitation: the requirement of sufficient overlap between distributions. One strategy to mit… ▽ More

    Submitted 18 August, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Added figure 3, added data augmentation for octahedral symmetries, updated experimental results and revised text (11 pages, 6 figures)

  30. arXiv:2002.02836  [pdf, other

    cs.LG cs.AI stat.ML

    Causally Correct Partial Models for Reinforcement Learning

    Authors: Danilo J. Rezende, Ivo Danihelka, George Papamakarios, Nan Rosemary Ke, Ray Jiang, Theophane Weber, Karol Gregor, Hamza Merzic, Fabio Viola, Jane Wang, Jovana Mitrovic, Frederic Besse, Ioannis Antonoglou, Lars Buesing

    Abstract: In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling future observations can be computationally expensive or even intractable if the observations are high-dimensional (e.g. images). For this reason, previous works have considered partial models, which model only part of the observation. In this pa… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

  31. arXiv:2002.02428  [pdf, other

    stat.ML cs.LG

    Normalizing Flows on Tori and Spheres

    Authors: Danilo Jimenez Rezende, George Papamakarios, Sébastien Racanière, Michael S. Albergo, Gurtej Kanwar, Phiala E. Shanahan, Kyle Cranmer

    Abstract: Normalizing flows are a powerful tool for building expressive distributions in high dimensions. So far, most of the literature has concentrated on learning flows on Euclidean spaces. Some problems however, such as those involving angles, are defined on spaces with more complex geometries, such as tori or spheres. In this paper, we propose and compare expressive and numerically stable flows on such… ▽ More

    Submitted 1 July, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: Accepted to the International Conference on Machine Learning (ICML) 2020

  32. arXiv:1912.02762  [pdf, other

    stat.ML cs.LG

    Normalizing Flows for Probabilistic Modeling and Inference

    Authors: George Papamakarios, Eric Nalisnick, Danilo Jimenez Rezende, Shakir Mohamed, Balaji Lakshminarayanan

    Abstract: Normalizing flows provide a general mechanism for defining expressive probability distributions, only requiring the specification of a (usually simple) base distribution and a series of bijective transformations. There has been much recent work on normalizing flows, ranging from improving their expressive power to expanding their application. We believe the field has now matured and is in need of… ▽ More

    Submitted 8 April, 2021; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: Review article, 64 pages, 9 figures. Published in the Journal of Machine Learning Research (see https://jmlr.org/papers/v22/19-1028.html)

    Journal ref: Journal of Machine Learning Research, 22(57):1-64, 2021

  33. arXiv:1912.00830  [pdf, other

    cs.CV

    Information bottleneck through variational glasses

    Authors: Slava Voloshynovskiy, Mouad Kondah, Shideh Rezaeifar, Olga Taran, Taras Holotyak, Danilo Jimenez Rezende

    Abstract: Information bottleneck (IB) principle [1] has become an important element in information-theoretic analysis of deep models. Many state-of-the-art generative models of both Variational Autoencoder (VAE) [2; 3] and Generative Adversarial Networks (GAN) [4] families use various bounds on mutual information terms to introduce certain regularization constraints [5; 6; 7; 8; 9; 10]. Accordingly, the mai… ▽ More

    Submitted 5 December, 2019; v1 submitted 2 December, 2019; originally announced December 2019.

  34. arXiv:1909.13789  [pdf, other

    cs.LG stat.ML

    Hamiltonian Generative Networks

    Authors: Peter Toth, Danilo Jimenez Rezende, Andrew Jaegle, Sébastien Racanière, Aleksandar Botev, Irina Higgins

    Abstract: The Hamiltonian formalism plays a central role in classical and quantum physics. Hamiltonians are the main tool for modelling the continuous time evolution of systems with conserved quantities, and they come equipped with many useful properties, like time reversibility and smooth interpolation in time. These properties are important for many machine learning problems - from sequence prediction to… ▽ More

    Submitted 14 February, 2020; v1 submitted 30 September, 2019; originally announced September 2019.

  35. arXiv:1909.13739  [pdf, other

    stat.ML cs.LG

    Equivariant Hamiltonian Flows

    Authors: Danilo Jimenez Rezende, Sébastien Racanière, Irina Higgins, Peter Toth

    Abstract: This paper introduces equivariant hamiltonian flows, a method for learning expressive densities that are invariant with respect to a known Lie-algebra of local symmetry transformations while providing an equivariant representation of the data. We provide proof of principle demonstrations of how such flows can be learnt, as well as how the addition of symmetry invariance constraints can improve dat… ▽ More

    Submitted 30 September, 2019; originally announced September 2019.

  36. arXiv:1906.09237  [pdf, other

    cs.LG cs.AI stat.ML

    Shaping Belief States with Generative Environment Models for RL

    Authors: Karol Gregor, Danilo Jimenez Rezende, Frederic Besse, Yan Wu, Hamza Merzic, Aaron van den Oord

    Abstract: When agents interact with a complex environment, they must form and maintain beliefs about the relevant aspects of that environment. We propose a way to efficiently train expressive generative models in complex environments. We show that a predictive algorithm with an expressive generative model can form stable belief-states in visually rich and dynamic 3D environments. More precisely, we show tha… ▽ More

    Submitted 24 June, 2019; v1 submitted 21 June, 2019; originally announced June 2019.

    Comments: pre-print

  37. arXiv:1906.02500  [pdf, other

    cs.LG stat.ML

    Towards Interpretable Reinforcement Learning Using Attention Augmented Agents

    Authors: Alex Mott, Daniel Zoran, Mike Chrzanowski, Daan Wierstra, Danilo J. Rezende

    Abstract: Inspired by recent work in attention models for image captioning and question answering, we present a soft attention model for the reinforcement learning domain. This model uses a soft, top-down attention mechanism to create a bottleneck in the agent, forcing it to focus on task-relevant information by sequentially querying its view of the environment. The output of the attention mechanism allows… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

  38. arXiv:1905.13077  [pdf, other

    cs.CV

    A Hierarchical Probabilistic U-Net for Modeling Multi-Scale Ambiguities

    Authors: Simon A. A. Kohl, Bernardino Romera-Paredes, Klaus H. Maier-Hein, Danilo Jimenez Rezende, S. M. Ali Eslami, Pushmeet Kohli, Andrew Zisserman, Olaf Ronneberger

    Abstract: Medical imaging only indirectly measures the molecular identity of the tissue within each voxel, which often produces only ambiguous image evidence for target measures of interest, like semantic segmentation. This diversity and the variations of plausible interpretations are often specific to given image regions and may thus manifest on various scales, spanning all the way from the pixel to the im… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: 25 pages, 15 figures

  39. arXiv:1812.02230  [pdf, other

    cs.LG stat.ML

    Towards a Definition of Disentangled Representations

    Authors: Irina Higgins, David Amos, David Pfau, Sebastien Racaniere, Loic Matthey, Danilo Rezende, Alexander Lerchner

    Abstract: How can intelligent agents solve a diverse set of tasks in a data-efficient manner? The disentangled representation learning approach posits that such an agent would benefit from separating out (disentangling) the underlying structure of the world into disjoint parts of its representation. However, there is no generally agreed-upon definition of disentangling, not least because it is unclear how t… ▽ More

    Submitted 5 December, 2018; originally announced December 2018.

  40. arXiv:1810.00597  [pdf, other

    stat.ML cs.LG

    Taming VAEs

    Authors: Danilo Jimenez Rezende, Fabio Viola

    Abstract: In spite of remarkable progress in deep latent variable generative modeling, training still remains a challenge due to a combination of optimization and generalization issues. In practice, a combination of heuristic algorithms (such as hand-crafted annealing of KL-terms) is often used in order to achieve the desired results, but such solutions are not robust to changes in model architecture or dat… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

  41. arXiv:1807.03149  [pdf, other

    cs.CV cs.LG stat.ML

    Learning models for visual 3D localization with implicit mapping

    Authors: Dan Rosenbaum, Frederic Besse, Fabio Viola, Danilo J. Rezende, S. M. Ali Eslami

    Abstract: We consider learning based methods for visual localization that do not require the construction of explicit maps in the form of point clouds or voxels. The goal is to learn an implicit representation of the environment at a higher, more abstract level. We propose to use a generative approach based on Generative Query Networks (GQNs, Eslami et al. 2018), asking the following questions: 1) Can GQN c… ▽ More

    Submitted 12 December, 2018; v1 submitted 4 July, 2018; originally announced July 2018.

  42. arXiv:1807.02033  [pdf, other

    cs.CV cs.LG stat.ML

    Consistent Generative Query Networks

    Authors: Ananya Kumar, S. M. Ali Eslami, Danilo J. Rezende, Marta Garnelo, Fabio Viola, Edward Lockhart, Murray Shanahan

    Abstract: Stochastic video prediction models take in a sequence of image frames, and generate a sequence of consecutive future image frames. These models typically generate future frames in an autoregressive fashion, which is slow and requires the input and output frames to be consecutive. We introduce a model that overcomes these drawbacks by generating a latent representation from an arbitrary set of fram… ▽ More

    Submitted 21 April, 2019; v1 submitted 5 July, 2018; originally announced July 2018.

  43. arXiv:1807.01622  [pdf, other

    cs.LG stat.ML

    Neural Processes

    Authors: Marta Garnelo, Jonathan Schwarz, Dan Rosenbaum, Fabio Viola, Danilo J. Rezende, S. M. Ali Eslami, Yee Whye Teh

    Abstract: A neural network (NN) is a parameterised function that can be tuned via gradient descent to approximate a labelled collection of data with high precision. A Gaussian process (GP), on the other hand, is a probabilistic model that defines a distribution over possible functions, and is updated in light of data via the rules of probabilistic inference. GPs are probabilistic, data-efficient and flexibl… ▽ More

    Submitted 4 July, 2018; originally announced July 2018.

  44. arXiv:1807.01613  [pdf, other

    cs.LG stat.ML

    Conditional Neural Processes

    Authors: Marta Garnelo, Dan Rosenbaum, Chris J. Maddison, Tiago Ramalho, David Saxton, Murray Shanahan, Yee Whye Teh, Danilo J. Rezende, S. M. Ali Eslami

    Abstract: Deep neural networks excel at function approximation, yet they are typically trained from scratch for each new function. On the other hand, Bayesian methods, such as Gaussian Processes (GPs), exploit prior knowledge to quickly infer the shape of a new function at test time. Yet GPs are computationally expensive, and it can be hard to design appropriate priors. In this paper we propose a family of… ▽ More

    Submitted 4 July, 2018; originally announced July 2018.

  45. arXiv:1806.05034  [pdf, other

    cs.CV cs.LG cs.NE stat.ML

    A Probabilistic U-Net for Segmentation of Ambiguous Images

    Authors: Simon A. A. Kohl, Bernardino Romera-Paredes, Clemens Meyer, Jeffrey De Fauw, Joseph R. Ledsam, Klaus H. Maier-Hein, S. M. Ali Eslami, Danilo Jimenez Rezende, Olaf Ronneberger

    Abstract: Many real-world vision problems suffer from inherent ambiguities. In clinical applications for example, it might not be clear from a CT scan alone which particular region is cancer tissue. Therefore a group of graders typically produces a set of diverse but plausible segmentations. We consider the task of learning a distribution over segmentations given an input. To this end we propose a generativ… ▽ More

    Submitted 29 January, 2019; v1 submitted 13 June, 2018; originally announced June 2018.

    Comments: Last update: added further details about the LIDC experiment. 11 pages for the main paper, 28 pages including appendix. 5 figures in the main paper, 18 figures in total, Advances in Neural Information Processing Systems (NeurIPS), 2018

  46. arXiv:1804.09401  [pdf, other

    stat.ML cs.LG

    Generative Temporal Models with Spatial Memory for Partially Observed Environments

    Authors: Marco Fraccaro, Danilo Jimenez Rezende, Yori Zwols, Alexander Pritzel, S. M. Ali Eslami, Fabio Viola

    Abstract: In model-based reinforcement learning, generative and temporal models of environments can be leveraged to boost agent performance, either by tuning the agent's representations during training or via use as part of an explicit planning mechanism. However, their application in practice has been limited to simplistic environments, due to the difficulty of training such models in larger, potentially p… ▽ More

    Submitted 19 July, 2018; v1 submitted 25 April, 2018; originally announced April 2018.

    Comments: ICML 2018

  47. arXiv:1803.10760  [pdf, other

    cs.LG stat.ML

    Unsupervised Predictive Memory in a Goal-Directed Agent

    Authors: Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, Timothy Lillicrap

    Abstract: Animals execute goal-directed behaviours despite the limited range and scope of their sensors. To cope, they explore environments and store memories maintaining estimates of important information that is not presently available. Recently, progress has been made with artificial intelligence (AI) agents that learn to perform tasks from sensory input, even at a human level, by merging reinforcement l… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

  48. arXiv:1803.01682  [pdf, other

    stat.ML cs.LG

    Beyond Greedy Ranking: Slate Optimization via List-CVAE

    Authors: Ray Jiang, Sven Gowal, Timothy A. Mann, Danilo J. Rezende

    Abstract: The conventional solution to the recommendation problem greedily ranks individual document candidates by prediction scores. However, this method fails to optimize the slate as a whole, and hence, often struggles to capture biases caused by the page layout and document interdepedencies. The slate recommendation problem aims to directly find the optimally ordered subset of documents (i.e. slates) th… ▽ More

    Submitted 23 February, 2019; v1 submitted 5 March, 2018; originally announced March 2018.

  49. arXiv:1802.03006  [pdf, other

    cs.LG

    Learning and Querying Fast Generative Models for Reinforcement Learning

    Authors: Lars Buesing, Theophane Weber, Sebastien Racaniere, S. M. Ali Eslami, Danilo Rezende, David P. Reichert, Fabio Viola, Frederic Besse, Karol Gregor, Demis Hassabis, Daan Wierstra

    Abstract: A key challenge in model-based reinforcement learning (RL) is to synthesize computationally efficient and accurate environment models. We show that carefully designed generative models that learn and operate on compact state representations, so-called state-space models, substantially reduce the computational costs for predicting outcomes of sequences of actions. Extensive experiments establish th… ▽ More

    Submitted 8 February, 2018; originally announced February 2018.

  50. arXiv:1711.08378  [pdf

    cs.AI

    Building Machines that Learn and Think for Themselves: Commentary on Lake et al., Behavioral and Brain Sciences, 2017

    Authors: M. Botvinick, D. G. T. Barrett, P. Battaglia, N. de Freitas, D. Kumaran, J. Z Leibo, T. Lillicrap, J. Modayil, S. Mohamed, N. C. Rabinowitz, D. J. Rezende, A. Santoro, T. Schaul, C. Summerfield, G. Wayne, T. Weber, D. Wierstra, S. Legg, D. Hassabis

    Abstract: We agree with Lake and colleagues on their list of key ingredients for building humanlike intelligence, including the idea that model-based reasoning is essential. However, we favor an approach that centers on one additional ingredient: autonomy. In particular, we aim toward agents that can both build and exploit their own internal models, with minimal human hand-engineering. We believe an approac… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.