Zum Hauptinhalt springen

Showing 1–21 of 21 results for author: Gutmann, M U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.03069  [pdf, other

    cs.LG stat.ML

    Improving Variational Autoencoder Estimation from Incomplete Data with Mixture Variational Families

    Authors: Vaidotas Simkus, Michael U. Gutmann

    Abstract: We consider the task of estimating variational autoencoders (VAEs) when the training data is incomplete. We show that missing data increases the complexity of the model's posterior distribution over the latent variables compared to the fully-observed case. The increased complexity may adversely affect the fit of the model due to a mismatch between the variational and model posterior distributions.… ▽ More

    Submitted 27 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Published in Transactions on Machine Learning Research (TMLR), 2024

    MSC Class: 62D10 ACM Class: I.2.6; G.3

  2. arXiv:2308.09078  [pdf, other

    cs.LG stat.ML

    Conditional Sampling of Variational Autoencoders via Iterated Approximate Ancestral Sampling

    Authors: Vaidotas Simkus, Michael U. Gutmann

    Abstract: Conditional sampling of variational autoencoders (VAEs) is needed in various applications, such as missing data imputation, but is computationally intractable. A principled choice for asymptotically exact conditional sampling is Metropolis-within-Gibbs (MWG). However, we observe that the tendency of VAEs to learn a structured latent space, a commonly desired property, can cause the MWG sampler to… ▽ More

    Submitted 8 November, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR), 2023

    MSC Class: 62D10 ACM Class: G.3

  3. arXiv:2305.07721  [pdf, other

    cs.LG stat.ME

    Designing Optimal Behavioral Experiments Using Machine Learning

    Authors: Simon Valentin, Steven Kleinegesse, Neil R. Bramley, Peggy Seriès, Michael U. Gutmann, Christopher G. Lucas

    Abstract: Computational models are powerful tools for understanding human cognition and behavior. They let us express our theories clearly and precisely, and offer predictions that can be subtle and often counter-intuitive. However, this same richness and ability to surprise means our scientific intuitions and traditional tools are ill-suited to designing experiments to test and compare these models. To avo… ▽ More

    Submitted 26 November, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: Accepted in eLife

  4. arXiv:2305.00869  [pdf, other

    stat.ML cs.LG

    Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logistic Regression

    Authors: Akash Srivastava, Seungwook Han, Kai Xu, Benjamin Rhodes, Michael U. Gutmann

    Abstract: Functions of the ratio of the densities $p/q$ are widely used in machine learning to quantify the discrepancy between the two distributions $p$ and $q$. For high-dimensional distributions, binary classification-based density ratio estimators have shown great promise. However, when densities are well separated, estimating the density ratio with a binary classifier is challenging. In this work, we s… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Journal ref: TMLR 2023

  5. arXiv:2208.02704  [pdf, other

    cs.LG stat.ML

    Bayesian Optimization with Informative Covariance

    Authors: Afonso Eduardo, Michael U. Gutmann

    Abstract: Bayesian optimization is a methodology for global optimization of unknown and expensive objectives. It combines a surrogate Bayesian regression model with an acquisition function to decide where to evaluate the objective. Typical regression models are given by Gaussian processes with stationary covariance functions. However, these functions are unable to express prior input-dependent information,… ▽ More

    Submitted 1 April, 2023; v1 submitted 4 August, 2022; originally announced August 2022.

    Journal ref: Transactions on Machine Learning Research (TMLR), 2023. URL: https://openreview.net/forum?id=JwgVBv18RG

  6. arXiv:2206.13446  [pdf, other

    cs.LG stat.ML

    Pen and Paper Exercises in Machine Learning

    Authors: Michael U. Gutmann

    Abstract: This is a collection of (mostly) pen-and-paper exercises in machine learning. The exercises are on the following topics: linear algebra, optimisation, directed graphical models, undirected graphical models, expressive power of graphical models, factor graphs and message passing, inference for hidden Markov models, model-based learning (including ICA and unnormalised models), sampling and Monte-Car… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: The associated github page is https://github.com/michaelgutmann/ml-pen-and-paper-exercises

  7. arXiv:2204.13999  [pdf, other

    cs.LG stat.ML

    Statistical applications of contrastive learning

    Authors: Michael U. Gutmann, Steven Kleinegesse, Benjamin Rhodes

    Abstract: The likelihood function plays a crucial role in statistical inference and experimental design. However, it is computationally intractable for several important classes of statistical models, including energy-based models and simulator-based models. Contrastive learning is an intuitive and computationally feasible alternative to likelihood-based learning. We here first provide an introduction to co… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: Accepted to Behaviormetrika

  8. arXiv:2111.13180  [pdf, other

    cs.LG stat.ME stat.ML

    Variational Gibbs Inference for Statistical Model Estimation from Incomplete Data

    Authors: Vaidotas Simkus, Benjamin Rhodes, Michael U. Gutmann

    Abstract: Statistical models are central to machine learning with broad applicability across a range of downstream tasks. The models are controlled by free parameters that are typically estimated from data by maximum-likelihood estimation or approximations thereof. However, when faced with real-world data sets many of the models run into a critical issue: they are formulated in terms of fully-observed data,… ▽ More

    Submitted 15 August, 2023; v1 submitted 25 November, 2021; originally announced November 2021.

    Comments: Published at Journal of Machine Learning Research (JMLR)

    MSC Class: 62D10 ACM Class: I.2.6; G.3

    Journal ref: Journal of Machine Learning Research, 24(196), 1-72, 2023

  9. arXiv:2111.02329  [pdf, other

    stat.ML cs.AI cs.LG stat.CO

    Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods

    Authors: Desi R. Ivanova, Adam Foster, Steven Kleinegesse, Michael U. Gutmann, Tom Rainforth

    Abstract: We introduce implicit Deep Adaptive Design (iDAD), a new method for performing adaptive experiments in real-time with implicit models. iDAD amortizes the cost of Bayesian optimal experimental design (BOED) by learning a design policy network upfront, which can then be deployed quickly at the time of the experiment. The iDAD network can be trained on any model which simulates differentiable samples… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 33 pages, 8 figures. Published as a conference paper at NeurIPS 2021

  10. arXiv:2110.15632  [pdf, other

    cs.LG

    Bayesian Optimal Experimental Design for Simulator Models of Cognition

    Authors: Simon Valentin, Steven Kleinegesse, Neil R. Bramley, Michael U. Gutmann, Christopher G. Lucas

    Abstract: Bayesian optimal experimental design (BOED) is a methodology to identify experiments that are expected to yield informative data. Recent work in cognitive science considered BOED for computational models of human behavior with tractable and known likelihood functions. However, tractability often comes at the cost of realism; simulator models that can capture the richness of human behavior are ofte… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Comments: Accepted as a poster at the NeurIPS 2021 Workshop "AI for Science"

  11. arXiv:2105.04379  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Gradient-based Bayesian Experimental Design for Implicit Models using Mutual Information Lower Bounds

    Authors: Steven Kleinegesse, Michael U. Gutmann

    Abstract: We introduce a framework for Bayesian experimental design (BED) with implicit models, where the data-generating distribution is intractable but sampling from it is still possible. In order to find optimal experimental designs for such models, our approach maximises mutual information lower bounds that are parametrised by neural networks. By training a neural network on sampled data, we simultaneou… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: Under review

    MSC Class: 62K05;

  12. arXiv:2006.12204  [pdf, other

    stat.ML cs.LG

    Telescoping Density-Ratio Estimation

    Authors: Benjamin Rhodes, Kai Xu, Michael U. Gutmann

    Abstract: Density-ratio estimation via classification is a cornerstone of unsupervised learning. It has provided the foundation for state-of-the-art methods in representation learning and generative modelling, with the number of use-cases continuing to proliferate. However, it suffers from a critical limitation: it fails to accurately estimate ratios p/q for which the two densities differ significantly. Emp… ▽ More

    Submitted 24 November, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

  13. arXiv:2003.09379  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Sequential Bayesian Experimental Design for Implicit Models via Mutual Information

    Authors: Steven Kleinegesse, Christopher Drovandi, Michael U. Gutmann

    Abstract: Bayesian experimental design (BED) is a framework that uses statistical models and decision making under uncertainty to optimise the cost and performance of a scientific experiment. Sequential BED, as opposed to static BED, considers the scenario where we can sequentially update our beliefs about the model parameters through data gathered in the experiment. A class of models of particular interest… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

    MSC Class: 62K05; 62L05

  14. arXiv:2002.08129  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Bayesian Experimental Design for Implicit Models by Mutual Information Neural Estimation

    Authors: Steven Kleinegesse, Michael U. Gutmann

    Abstract: Implicit stochastic models, where the data-generation distribution is intractable but sampling is possible, are ubiquitous in the natural sciences. The models typically have free parameters that need to be inferred from data collected in scientific experiments. A fundamental question is how to design the experiments so that the collected data are most useful. The field of Bayesian experimental des… ▽ More

    Submitted 14 August, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Accepted at the thirty-seventh International Conference on Machine Learning (ICML) 2020. Camera-ready version

    MSC Class: 62K05 (Primary) ACM Class: G.3

  15. arXiv:1904.02431  [pdf, other

    cs.RO

    To Stir or Not to Stir: Online Estimation of Liquid Properties for Pouring Actions

    Authors: Tatiana Lopez-Guevara, Rita Pucci, Nicholas Taylor, Michael U. Gutmann, Subramanian Ramamoorthy, Kartic Subr

    Abstract: Our brains are able to exploit coarse physical models of fluids to solve everyday manipulation tasks. There has been considerable interest in developing such a capability in robots so that they can autonomously manipulate fluids adapting to different conditions. In this paper, we investigate the problem of adaptation to liquids with different characteristics. We develop a simple calibration task (… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: Presented at the Modeling the Physical World: Perception, Learning, and Control Workshop (NeurIPS) 2018

  16. arXiv:1904.00670  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Robust Optimisation Monte Carlo

    Authors: Borislav Ikonomov, Michael U. Gutmann

    Abstract: This paper is on Bayesian inference for parametric statistical models that are defined by a stochastic simulator which specifies how data is generated. Exact sampling is then possible but evaluating the likelihood function is typically prohibitively expensive. Approximate Bayesian Computation (ABC) is a framework to perform approximate inference in such situations. While basic ABC algorithms are w… ▽ More

    Submitted 28 February, 2020; v1 submitted 1 April, 2019; originally announced April 2019.

    Comments: 8 pages + 6 page appendix; v2: made clarifications, added a second possible algorithm implementation and its results; v3: small clarifications, to be published in AISTATS 2020

  17. arXiv:1902.10704  [pdf, other

    stat.CO cs.LG stat.ME

    Adaptive Gaussian Copula ABC

    Authors: Yanzhi Chen, Michael U. Gutmann

    Abstract: Approximate Bayesian computation (ABC) is a set of techniques for Bayesian inference when the likelihood is intractable but sampling from the model is possible. This work presents a simple yet effective ABC algorithm based on the combination of two classical ABC approaches --- regression ABC and sequential ABC. The key idea is that rather than learning the posterior directly, we first target anoth… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

    Comments: 8 pages, 5 figures, accepted to AISTATS 2019

  18. arXiv:1810.09899  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Dynamic Likelihood-free Inference via Ratio Estimation (DIRE)

    Authors: Traiko Dinev, Michael U. Gutmann

    Abstract: Parametric statistical models that are implicitly defined in terms of a stochastic data generating process are used in a wide range of scientific disciplines because they enable accurate modeling. However, learning the parameters from observed data is generally very difficult because their likelihood function is typically intractable. Likelihood-free Bayesian inference methods have been proposed w… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: For a demo, see https://traiko.com/pages/research/lfire/

  19. arXiv:1806.03664  [pdf, other

    stat.ML cs.LG

    Conditional Noise-Contrastive Estimation of Unnormalised Models

    Authors: Ciwan Ceylan, Michael U. Gutmann

    Abstract: Many parametric statistical models are not properly normalised and only specified up to an intractable partition function, which renders parameter estimation difficult. Examples of unnormalised models are Gibbs distributions, Markov random fields, and neural network models in unsupervised deep learning. In previous work, the estimation principle called noise-contrastive estimation (NCE) was introd… ▽ More

    Submitted 10 June, 2018; originally announced June 2018.

    Comments: Accepted to ICML 2018

  20. arXiv:1806.00101  [pdf, other

    stat.ML cs.LG

    Generative Ratio Matching Networks

    Authors: Akash Srivastava, Kai Xu, Michael U. Gutmann, Charles Sutton

    Abstract: Deep generative models can learn to generate realistic-looking images, but many of the most effective methods are adversarial and involve a saddlepoint optimization, which requires a careful balancing of training between a generator network and a critic network. Maximum mean discrepancy networks (MMD-nets) avoid this issue by using kernel as a fixed adversary, but unfortunately, they have not on t… ▽ More

    Submitted 14 February, 2020; v1 submitted 31 May, 2018; originally announced June 2018.

    Comments: ICLR 2020; Code: https://github.com/GRAM-nets

  21. arXiv:1708.00707  [pdf, other

    stat.ML cs.MS stat.CO

    ELFI: Engine for Likelihood-Free Inference

    Authors: Jarno Lintusaari, Henri Vuollekoski, Antti Kangasrääsiö, Kusti Skytén, Marko Järvenpää, Pekka Marttinen, Michael U. Gutmann, Aki Vehtari, Jukka Corander, Samuel Kaski

    Abstract: Engine for Likelihood-Free Inference (ELFI) is a Python software library for performing likelihood-free inference (LFI). ELFI provides a convenient syntax for arranging components in LFI, such as priors, simulators, summaries or distances, to a network called ELFI graph. The components can be implemented in a wide variety of languages. The stand-alone ELFI graph can be used with any of the availab… ▽ More

    Submitted 5 July, 2018; v1 submitted 2 August, 2017; originally announced August 2017.

    Journal ref: Journal of Machine Learning Research, 19(16):1-7, 2018. http://jmlr.org/papers/v19/17-374.html