Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Vehtari, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11191  [pdf, other

    cs.LG physics.data-an

    Active Learning of Molecular Data for Task-Specific Objectives

    Authors: Kunal Ghosh, Milica Todorović, Aki Vehtari, Patrick Rinke

    Abstract: Active learning (AL) has shown promise for being a particularly data-efficient machine learning approach. Yet, its performance depends on the application and it is not clear when AL practitioners can expect computational savings. Here, we carry out a systematic AL performance assessment for three diverse molecular datasets and two common scientific tasks: compiling compact, informative datasets an… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  2. arXiv:2203.15945  [pdf, other

    stat.ML cs.LG stat.ME

    A Framework for Improving the Reliability of Black-box Variational Inference

    Authors: Manushi Welandawe, Michael Riis Andersen, Aki Vehtari, Jonathan H. Huggins

    Abstract: Black-box variational inference (BBVI) now sees widespread use in machine learning and statistics as a fast yet flexible alternative to Markov chain Monte Carlo methods for approximate Bayesian inference. However, stochastic optimization methods for BBVI remain unreliable and require substantial expertise and hand-tuning to apply effectively. In this paper, we propose Robust and Automated Black-bo… ▽ More

    Submitted 16 May, 2024; v1 submitted 29 March, 2022; originally announced March 2022.

  3. arXiv:2108.03782  [pdf, other

    stat.ML cs.LG

    Pathfinder: Parallel quasi-Newton variational inference

    Authors: Lu Zhang, Bob Carpenter, Andrew Gelman, Aki Vehtari

    Abstract: We propose Pathfinder, a variational method for approximately sampling from differentiable log densities. Starting from a random initialization, Pathfinder locates normal approximations to the target density along a quasi-Newton optimization path, with local covariance estimated using the inverse Hessian estimates produced by the optimizer. Pathfinder returns draws from the approximation with the… ▽ More

    Submitted 16 May, 2022; v1 submitted 8 August, 2021; originally announced August 2021.

    Comments: 46 pages, 21 figures

  4. arXiv:2103.01085  [pdf, other

    cs.LG stat.ME stat.ML

    Challenges and Opportunities in High-dimensional Variational Inference

    Authors: Akash Kumar Dhaka, Alejandro Catalina, Manushi Welandawe, Michael Riis Andersen, Jonathan Huggins, Aki Vehtari

    Abstract: Current black-box variational inference (BBVI) methods require the user to make numerous design choices -- such as the selection of variational objective and approximating family -- yet there is little principled guidance on how to do so. We develop a conceptual framework and set of experimental tools to understand the effects of these choices, which we leverage to propose best practices for maxim… ▽ More

    Submitted 30 June, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

  5. arXiv:2101.08954  [pdf, other

    stat.ME cs.LG stat.ML

    Bayesian hierarchical stacking: Some models are (somewhere) useful

    Authors: Yuling Yao, Gregor Pirš, Aki Vehtari, Andrew Gelman

    Abstract: Stacking is a widely used model averaging technique that asymptotically yields optimal predictions among linear averages. We show that stacking is most effective when model predictive performance is heterogeneous in inputs, and we can further improve the stacked mixture with a hierarchical model. We generalize stacking to Bayesian hierarchical stacking. The model weights are varying as a function… ▽ More

    Submitted 20 May, 2021; v1 submitted 22 January, 2021; originally announced January 2021.

    Comments: minor revision

    Journal ref: Bayesian Analysis. (2021)

  6. arXiv:2012.15471  [pdf, other

    cs.LG

    Good practices for Bayesian Optimization of high dimensional structured spaces

    Authors: Eero Siivola, Javier Gonzalez, Andrei Paleyes, Aki Vehtari

    Abstract: The increasing availability of structured but high dimensional data has opened new opportunities for optimization. One emerging and promising avenue is the exploration of unsupervised methods for projecting structured high dimensional data into low dimensional continuous representations, simplifying the optimization problem and enabling the application of traditional optimization methods. However,… ▽ More

    Submitted 6 January, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

  7. arXiv:2009.00666  [pdf, other

    cs.LG stat.ME stat.ML

    Robust, Accurate Stochastic Optimization for Variational Inference

    Authors: Akash Kumar Dhaka, Alejandro Catalina, Michael Riis Andersen, Måns Magnusson, Jonathan H. Huggins, Aki Vehtari

    Abstract: We consider the problem of fitting variational posterior approximations using stochastic optimization methods. The performance of these approximations depends on (1) how well the variational family matches the true posterior distribution,(2) the choice of divergence, and (3) the optimization of the variational objective. We show that even in the best-case scenario when the exact posterior belongs… ▽ More

    Submitted 3 September, 2020; v1 submitted 1 September, 2020; originally announced September 2020.

    Journal ref: NeurIPS 2020

  8. arXiv:2003.11435  [pdf, other

    cs.LG stat.ML

    Preferential Batch Bayesian Optimization

    Authors: Eero Siivola, Akash Kumar Dhaka, Michael Riis Andersen, Javier Gonzalez, Pablo Garcia Moreno, Aki Vehtari

    Abstract: Most research in Bayesian optimization (BO) has focused on \emph{direct feedback} scenarios, where one has access to exact values of some expensive-to-evaluate objective. This direction has been mainly driven by the use of BO in machine learning hyper-parameter configuration problems. However, in domains such as modelling human preferences, A/B tests, or recommender systems, there is a need for me… ▽ More

    Submitted 31 August, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

    Comments: 6 pages + 7 pages in supplementary material

  9. arXiv:1912.03549  [pdf, other

    stat.ML cs.LG q-bio.QM stat.ME

    lgpr: An interpretable nonparametric method for inferring covariate effects from longitudinal data

    Authors: Juho Timonen, Henrik Mannerström, Aki Vehtari, Harri Lähdesmäki

    Abstract: Longitudinal study designs are indispensable for studying disease progression. Inferring covariate effects from longitudinal data, however, requires interpretable methods that can model complicated covariance structures and detect nonlinear effects of both categorical and continuous covariates, as well as their interactions. Detecting disease effects is hindered by the fact that they often occur r… ▽ More

    Submitted 8 October, 2020; v1 submitted 7 December, 2019; originally announced December 2019.

    Comments: Contains main manuscript and supplementary material. Tables S1-S3 are in ancillary files

    Journal ref: Bioinformatics, 37(13), 2021, 1860-1867

  10. arXiv:1910.09358  [pdf, other

    cs.LG cs.AI cs.HC stat.ML

    A Decision-Theoretic Approach for Model Interpretability in Bayesian Framework

    Authors: Homayun Afrabandpey, Tomi Peltola, Juho Piironen, Aki Vehtari, Samuel Kaski

    Abstract: A salient approach to interpretable machine learning is to restrict modeling to simple models. In the Bayesian framework, this can be pursued by restricting the model structure and prior to favor interpretable models. Fundamentally, however, interpretability is about users' preferences, not the data generation mechanism; it is more natural to formulate interpretability as a utility function. In th… ▽ More

    Submitted 3 August, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: This version contains more experiments including a comparison with baseline methods from the literature and complemented some of the existing results in the previous version

    Journal ref: Machine Learning (2020)

  11. arXiv:1910.06121  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Batch simulations and uncertainty quantification in Gaussian process surrogate approximate Bayesian computation

    Authors: Marko Järvenpää, Aki Vehtari, Pekka Marttinen

    Abstract: The computational efficiency of approximate Bayesian computation (ABC) has been improved by using surrogate models such as Gaussian processes (GP). In one such promising framework the discrepancy between the simulated and observed data is modelled with a GP which is further used to form a model-based estimator for the intractable posterior. In this article we improve this approach in several ways.… ▽ More

    Submitted 6 August, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: Minor improvements and clarifications to the text over the previous version. 20 pages, 15 figures

  12. arXiv:1905.01252  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Parallel Gaussian process surrogate Bayesian inference with noisy likelihood evaluations

    Authors: Marko Järvenpää, Michael Gutmann, Aki Vehtari, Pekka Marttinen

    Abstract: We consider Bayesian inference when only a limited number of noisy log-likelihood evaluations can be obtained. This occurs for example when complex simulator-based statistical models are fitted to data, and synthetic likelihood (SL) method is used to form the noisy log-likelihood estimates using computationally costly forward simulations. We frame the inference task as a sequential Bayesian experi… ▽ More

    Submitted 6 March, 2020; v1 submitted 3 May, 2019; originally announced May 2019.

    Comments: Minor changes to the text. 37 pages, 18 figures

  13. arXiv:1904.10679  [pdf, other

    stat.ML cs.LG

    Bayesian leave-one-out cross-validation for large data

    Authors: Måns Magnusson, Michael Riis Andersen, Johan Jonasson, Aki Vehtari

    Abstract: Model inference, such as model comparison, model checking, and model selection, is an important part of model development. Leave-one-out cross-validation (LOO) is a general approach for assessing the generalizability of a model, but unfortunately, LOO does not scale well to large datasets. We propose a combination of using approximate inference techniques and probability-proportional-to-size-sampl… ▽ More

    Submitted 24 April, 2019; originally announced April 2019.

    Comments: Accepted to ICML 2019. This version is the submitted paper

    Journal ref: Thirty-sixth International Conference on Machine Learning, PMLR 97:4244-4253, 2019

  14. arXiv:1904.05268  [pdf, other

    stat.ML cs.LG

    Active Learning for Decision-Making from Imbalanced Observational Data

    Authors: Iiris Sundin, Peter Schulam, Eero Siivola, Aki Vehtari, Suchi Saria, Samuel Kaski

    Abstract: Machine learning can help personalized decision support by learning models to predict individual treatment effects (ITE). This work studies the reliability of prediction-based decision-making in a task of deciding which action $a$ to take for a target unit after observing its covariates $\tilde{x}$ and predicted outcomes $\hat{p}(\tilde{y} \mid \tilde{x}, a)$. An example case is personalized medic… ▽ More

    Submitted 6 June, 2019; v1 submitted 10 April, 2019; originally announced April 2019.

    Comments: Published in Proceedings of the 36th International Conference on Machine Learning (ICML) 2019. 15 pages (10 paper + 5 supplementary), 7 figures

  15. arXiv:1810.02406  [pdf, other

    stat.ML cs.LG

    Projective Inference in High-dimensional Problems: Prediction and Feature Selection

    Authors: Juho Piironen, Markus Paasiniemi, Aki Vehtari

    Abstract: This paper discusses predictive inference and feature selection for generalized linear models with scarce but high-dimensional data. We argue that in many cases one can benefit from a decision theoretically justified two-stage approach: first, construct a possibly non-sparse model that predicts well, and then find a minimal subset of features that characterize the predictions. The model built in t… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

    Journal ref: Electronic Journal of Statistics, 14(1):2155-2197, 2020. https://projecteuclid.org/euclid.ejs/1589335310

  16. arXiv:1710.04881  [pdf, other

    cs.HC cs.LG stat.ML

    User Modelling for Avoiding Overfitting in Interactive Knowledge Elicitation for Prediction

    Authors: Pedram Daee, Tomi Peltola, Aki Vehtari, Samuel Kaski

    Abstract: In human-in-the-loop machine learning, the user provides information beyond that in the training data. Many algorithms and user interfaces have been designed to optimize and facilitate this human--machine interaction; however, fewer studies have addressed the potential defects the designs can cause. Effective interaction often requires exposing the user to the training data or its statistics. The… ▽ More

    Submitted 8 March, 2018; v1 submitted 13 October, 2017; originally announced October 2017.

    Comments: 9 pages, 2 figures. The paper is published in the proceedings of IUI 2018. Codes and data available at https://github.com/HIIT/human-overfitting-in-IML

    ACM Class: H.1.2; I.2.6; H.3.3

  17. arXiv:1708.00707  [pdf, other

    stat.ML cs.MS stat.CO

    ELFI: Engine for Likelihood-Free Inference

    Authors: Jarno Lintusaari, Henri Vuollekoski, Antti Kangasrääsiö, Kusti Skytén, Marko Järvenpää, Pekka Marttinen, Michael U. Gutmann, Aki Vehtari, Jukka Corander, Samuel Kaski

    Abstract: Engine for Likelihood-Free Inference (ELFI) is a Python software library for performing likelihood-free inference (LFI). ELFI provides a convenient syntax for arranging components in LFI, such as priors, simulators, summaries or distances, to a network called ELFI graph. The components can be implemented in a wide variety of languages. The stand-alone ELFI graph can be used with any of the availab… ▽ More

    Submitted 5 July, 2018; v1 submitted 2 August, 2017; originally announced August 2017.

    Journal ref: Journal of Machine Learning Research, 19(16):1-7, 2018. http://jmlr.org/papers/v19/17-374.html

  18. arXiv:1604.05263  [pdf, other

    stat.ML cs.LG

    Chained Gaussian Processes

    Authors: Alan D. Saul, James Hensman, Aki Vehtari, Neil D. Lawrence

    Abstract: Gaussian process models are flexible, Bayesian non-parametric approaches to regression. Properties of multivariate Gaussians mean that they can be combined linearly in the manner of additive models and via a link function (like in generalized linear models) to handle non-Gaussian data. However, the link function formalism is restrictive, link functions are always invertible and must convert a para… ▽ More

    Submitted 18 April, 2016; originally announced April 2016.

    Comments: Appearing in Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS) 2016, Cadiz, Spain

  19. Comparison of Bayesian predictive methods for model selection

    Authors: Juho Piironen, Aki Vehtari

    Abstract: The goal of this paper is to compare several widely used Bayesian model selection methods in practical model selection problems, highlight their differences and give recommendations about the preferred approaches. We focus on the variable subset selection for regression and classification and perform several numerical experiments using both simulated and real world data. The results show that the… ▽ More

    Submitted 23 March, 2016; v1 submitted 30 March, 2015; originally announced March 2015.

    Comments: A few minor changes; added a few sentences, corrected some grammatical errors and modified Figure 7

    Journal ref: Statistics and Computing, 2017, Volume 27, Issue 3, 711-735

  20. arXiv:1206.5754  [pdf, other

    stat.ML cs.AI cs.MS

    Bayesian Modeling with Gaussian Processes using the GPstuff Toolbox

    Authors: Jarno Vanhatalo, Jaakko Riihimäki, Jouni Hartikainen, Pasi Jylänki, Ville Tolvanen, Aki Vehtari

    Abstract: Gaussian processes (GP) are powerful tools for probabilistic modeling purposes. They can be used to define prior distributions over latent functions in hierarchical Bayesian models. The prior over functions is defined implicitly by the mean and covariance function, which determine the smoothness and variability of the function. The inference can then be conducted directly in the function space by… ▽ More

    Submitted 15 July, 2015; v1 submitted 25 June, 2012; originally announced June 2012.

    Comments: - Updated according to GPstuff 4.6. Added, e.g., Pareto smoothed importance sampling

  21. arXiv:1206.3290  [pdf

    cs.LG stat.ML

    Modelling local and global phenomena with sparse Gaussian processes

    Authors: Jarno Vanhatalo, Aki Vehtari

    Abstract: Much recent work has concerned sparse approximations to speed up the Gaussian process regression from the unfavorable O(n3) scaling in computational time to O(nm2). Thus far, work has concentrated on models with one covariance function. However, in many practical situations additive models with multiple covariance functions may perform better, since the data may contain both long and short length-… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-571-578

  22. arXiv:1203.3524  [pdf

    stat.ML cs.LG

    Speeding up the binary Gaussian process classification

    Authors: Jarno Vanhatalo, Aki Vehtari

    Abstract: Gaussian processes (GP) are attractive building blocks for many probabilistic models. Their drawbacks, however, are the rapidly increasing inference time and memory requirement alongside increasing data. The problem can be alleviated with compactly supported (CS) covariance functions, which produce sparse covariance matrices that are fast in computations and cheap to store. CS functions have previ… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-623-631