Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Brekelmans, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.17546  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo

    Authors: Stephen Zhao, Rob Brekelmans, Alireza Makhzani, Roger Grosse

    Abstract: Numerous capability and safety techniques of Large Language Models (LLMs), including RLHF, automated red-teaming, prompt engineering, and infilling, can be cast as sampling from an unnormalized target distribution defined by a given reward or potential function over the full sequence. In this work, we leverage the rich toolkit of Sequential Monte Carlo (SMC) for these probabilistic inference probl… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  2. arXiv:2310.10649  [pdf, other

    cs.LG math.OC stat.ML

    A Computational Framework for Solving Wasserstein Lagrangian Flows

    Authors: Kirill Neklyudov, Rob Brekelmans, Alexander Tong, Lazar Atanackovic, Qiang Liu, Alireza Makhzani

    Abstract: The dynamical formulation of the optimal transport can be extended through various choices of the underlying geometry (kinetic energy), and the regularization of density paths (potential energy). These combinations yield different variational problems (Lagrangians), encompassing many variations of the optimal transport problem such as the Schrödinger bridge, unbalanced optimal transport, and optim… ▽ More

    Submitted 3 July, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  3. arXiv:2303.06992  [pdf, other

    cs.LG stat.ML

    Improving Mutual Information Estimation with Annealed and Energy-Based Bounds

    Authors: Rob Brekelmans, Sicong Huang, Marzyeh Ghassemi, Greg Ver Steeg, Roger Grosse, Alireza Makhzani

    Abstract: Mutual information (MI) is a fundamental quantity in information theory and machine learning. However, direct estimation of MI is intractable, even if the true joint probability density for the variables of interest is known, as it involves estimating a potentially high-dimensional log partition function. In this work, we present a unifying view of existing MI bounds from the perspective of import… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: A shorter version appeared in the International Conference on Learning Representations (ICLR) 2022

    Journal ref: ICLR 2022 https://openreview.net/forum?id=T0B9AoM_bFg

  4. arXiv:2302.03792  [pdf, other

    cs.LG cs.IT

    Information-Theoretic Diffusion

    Authors: Xianghao Kong, Rob Brekelmans, Greg Ver Steeg

    Abstract: Denoising diffusion models have spurred significant gains in density modeling and image generation, precipitating an industrial revolution in text-guided AI art generation. We introduce a new mathematical foundation for diffusion models inspired by classic results in information theory that connect Information with Minimum Mean Square Error regression, the so-called I-MMSE relations. We generalize… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: 26 pages, 7 figures, International Conference on Learning Representations (ICLR), 2023. Code is at http://github.com/kxh001/ITdiffusion and http://github.com/gregversteeg/InfoDiffusionSimple

  5. arXiv:2210.06662  [pdf, other

    cs.LG

    Action Matching: Learning Stochastic Dynamics from Samples

    Authors: Kirill Neklyudov, Rob Brekelmans, Daniel Severo, Alireza Makhzani

    Abstract: Learning the continuous dynamics of a system from snapshots of its temporal marginals is a problem which appears throughout natural sciences and machine learning, including in quantum systems, single-cell biological data, and generative modeling. In these settings, we assume access to cross-sectional samples that are uncorrelated over time, rather than full trajectories of samples. In order to bet… ▽ More

    Submitted 8 June, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Published in ICML 2023

  6. arXiv:2209.07481  [pdf, other

    cs.LG cs.IT math.ST stat.ML

    Variational Representations of Annealing Paths: Bregman Information under Monotonic Embedding

    Authors: Rob Brekelmans, Frank Nielsen

    Abstract: Markov Chain Monte Carlo methods for sampling from complex distributions and estimating normalization constants often simulate samples from a sequence of intermediate distributions along an annealing path, which bridges between a tractable initial distribution and a target density of interest. Prior works have constructed annealing paths using quasi-arithmetic means, and interpreted the resulting… ▽ More

    Submitted 6 February, 2024; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Published in Information Geometry (Info. Geo. 2024)

  7. arXiv:2203.12592  [pdf, other

    cs.LG stat.ML

    Your Policy Regularizer is Secretly an Adversary

    Authors: Rob Brekelmans, Tim Genewein, Jordi Grau-Moya, Grégoire Delétang, Markus Kunesch, Shane Legg, Pedro Ortega

    Abstract: Policy regularization methods such as maximum entropy regularization are widely used in reinforcement learning to improve the robustness of a learned policy. In this paper, we show how this robustness arises from hedging against worst-case perturbations of the reward function, which are chosen from a limited set by an imagined adversary. Using convex duality, we characterize this robust set of adv… ▽ More

    Submitted 8 July, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: Transactions on Machine Learning Research

    Journal ref: TMLR (2022) https://openreview.net/forum?id=berNQMTYWZ

  8. arXiv:2111.02907  [pdf, other

    cs.LG

    Model-Free Risk-Sensitive Reinforcement Learning

    Authors: Grégoire Delétang, Jordi Grau-Moya, Markus Kunesch, Tim Genewein, Rob Brekelmans, Shane Legg, Pedro A. Ortega

    Abstract: We extend temporal-difference (TD) learning in order to obtain risk-sensitive, model-free reinforcement learning algorithms. This extension can be regarded as modification of the Rescorla-Wagner rule, where the (sigmoidal) stimulus is taken to be either the event of over- or underestimating the TD target. As a result, one obtains a stochastic approximation rule for estimating the free energy from… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: DeepMind Tech Report: 13 pages, 4 figures

  9. arXiv:2107.00745  [pdf, other

    cs.LG cs.AI stat.ML

    q-Paths: Generalizing the Geometric Annealing Path using Power Means

    Authors: Vaden Masrani, Rob Brekelmans, Thang Bui, Frank Nielsen, Aram Galstyan, Greg Ver Steeg, Frank Wood

    Abstract: Many common machine learning methods involve the geometric annealing path, a sequence of intermediate densities between two distributions of interest constructed using the geometric average. While alternatives such as the moment-averaging path have demonstrated performance gains in some settings, their practical applicability remains limited by exponential family endpoint assumptions and a lack of… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: text overlap with arXiv:2012.07823

  10. arXiv:2012.15480  [pdf, other

    cs.LG cs.IT stat.ML

    Likelihood Ratio Exponential Families

    Authors: Rob Brekelmans, Frank Nielsen, Alireza Makhzani, Aram Galstyan, Greg Ver Steeg

    Abstract: The exponential family is well known in machine learning and statistical physics as the maximum entropy distribution subject to a set of observed constraints, while the geometric mixture path is common in MCMC methods such as annealed importance sampling. Linking these two ideas, recent work has interpreted the geometric mixture path as an exponential family of distributions to analyze the thermod… ▽ More

    Submitted 15 January, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: NeurIPS Workshop on Deep Learning through Information Geometry

  11. arXiv:2012.07823  [pdf, other

    cs.LG

    Annealed Importance Sampling with q-Paths

    Authors: Rob Brekelmans, Vaden Masrani, Thang Bui, Frank Wood, Aram Galstyan, Greg Ver Steeg, Frank Nielsen

    Abstract: Annealed importance sampling (AIS) is the gold standard for estimating partition functions or marginal likelihoods, corresponding to importance sampling over a path of distributions between a tractable base and an unnormalized target. While AIS yields an unbiased estimator for any path, existing literature has been primarily limited to the geometric mixture or moment-averaged paths associated with… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

    Comments: NeurIPS Workshop on Deep Learning through Information Geometry (Best Paper Award)

    Journal ref: Published at UAI 2021 https://arxiv.org/abs/2107.00745

  12. arXiv:2010.15750  [pdf, other

    cs.LG

    Gaussian Process Bandit Optimization of the Thermodynamic Variational Objective

    Authors: Vu Nguyen, Vaden Masrani, Rob Brekelmans, Michael A. Osborne, Frank Wood

    Abstract: Achieving the full promise of the Thermodynamic Variational Objective (TVO), a recently proposed variational lower bound on the log evidence involving a one-dimensional Riemann integral approximation, requires choosing a "schedule" of sorted discretization points. This paper introduces a bespoke Gaussian process bandit optimization method for automatically choosing these points. Our approach not o… ▽ More

    Submitted 20 November, 2020; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2020

  13. arXiv:2007.00642  [pdf, other

    cs.LG stat.ML

    All in the Exponential Family: Bregman Duality in Thermodynamic Variational Inference

    Authors: Rob Brekelmans, Vaden Masrani, Frank Wood, Greg Ver Steeg, Aram Galstyan

    Abstract: The recently proposed Thermodynamic Variational Objective (TVO) leverages thermodynamic integration to provide a family of variational inference objectives, which both tighten and generalize the ubiquitous Evidence Lower Bound (ELBO). However, the tightness of TVO bounds was not previously known, an expensive grid search was used to choose a "schedule" of intermediate distributions, and model lear… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: ICML 2020

  14. arXiv:1912.00646  [pdf, other

    cs.LG stat.ML

    Discovery and Separation of Features for Invariant Representation Learning

    Authors: Ayush Jaiswal, Rob Brekelmans, Daniel Moyer, Greg Ver Steeg, Wael AbdAlmageed, Premkumar Natarajan

    Abstract: Supervised machine learning models often associate irrelevant nuisance factors with the prediction target, which hurts generalization. We propose a framework for training robust neural networks that induces invariance to nuisances through learning to discover and separate predictive and nuisance factors of data. We present an information theoretic formulation of our approach, from which we derive… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: 10 pages, 3 figures

  15. arXiv:1904.07199  [pdf, other

    cs.LG cs.IT stat.ML

    Exact Rate-Distortion in Autoencoders via Echo Noise

    Authors: Rob Brekelmans, Daniel Moyer, Aram Galstyan, Greg Ver Steeg

    Abstract: Compression is at the heart of effective representation learning. However, lossy compression is typically achieved through simple parametric models like Gaussian noise to preserve analytic tractability, and the limitations this imposes on learning are largely unexplored. Further, the Gaussian prior assumptions in models such as variational autoencoders (VAEs) provide only an upper bound on the com… ▽ More

    Submitted 14 November, 2019; v1 submitted 15 April, 2019; originally announced April 2019.

    Comments: NeurIPS 2019; updated Gaussian baseline results, added disentanglement

  16. arXiv:1805.09458  [pdf, other

    cs.LG stat.ML

    Invariant Representations without Adversarial Training

    Authors: Daniel Moyer, Shuyang Gao, Rob Brekelmans, Greg Ver Steeg, Aram Galstyan

    Abstract: Representations of data that are invariant to changes in specified factors are useful for a wide range of problems: removing potential biases in prediction problems, controlling the effects of covariates, and disentangling meaningful factors of variation. Unfortunately, learning representations that exhibit invariance to arbitrary nuisance factors yet remain useful for other tasks is challenging.… ▽ More

    Submitted 2 December, 2019; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: NeurIPS 2018, with corrections

  17. arXiv:1802.05822  [pdf, other

    cs.LG stat.ML

    Auto-Encoding Total Correlation Explanation

    Authors: Shuyang Gao, Rob Brekelmans, Greg Ver Steeg, Aram Galstyan

    Abstract: Advances in unsupervised learning enable reconstruction and generation of samples from complex distributions, but this success is marred by the inscrutability of the representations learned. We propose an information-theoretic approach to characterizing disentanglement and dependence in representation learning using multivariate mutual information, also called total correlation. The principle of t… ▽ More

    Submitted 15 February, 2018; originally announced February 2018.

  18. arXiv:1710.03839  [pdf, other

    cs.LG cs.IT

    Disentangled Representations via Synergy Minimization

    Authors: Greg Ver Steeg, Rob Brekelmans, Hrayr Harutyunyan, Aram Galstyan

    Abstract: Scientists often seek simplified representations of complex systems to facilitate prediction and understanding. If the factors comprising a representation allow us to make accurate predictions about our system, but obscuring any subset of the factors destroys our ability to make predictions, we say that the representation exhibits informational synergy. We argue that synergy is an undesirable feat… ▽ More

    Submitted 10 October, 2017; originally announced October 2017.

    Comments: 8 pages, 4 figures, 55th Annual Allerton Conference on Communication, Control, and Computing, 2017