Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Kucukelbir, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07658  [pdf, other

    cs.LG stat.ML

    Treeffuser: Probabilistic Predictions via Conditional Diffusions with Gradient-Boosted Trees

    Authors: Nicolas Beltran-Velez, Alessandro Antonio Grande, Achille Nazaret, Alp Kucukelbir, David Blei

    Abstract: Probabilistic prediction aims to compute predictive distributions rather than single-point predictions. These distributions enable practitioners to quantify uncertainty, compute risk, and detect outliers. However, most probabilistic methods assume parametric responses, such as Gaussian or Poisson distributions. When these assumptions fail, such models lead to bad predictions and poorly calibrated… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2006.07549  [pdf, other

    cs.LG stat.ML

    Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning

    Authors: Yunhao Tang, Alp Kucukelbir

    Abstract: We propose a graphical model framework for goal-conditioned RL, with an EM algorithm that operates on the lower bound of the RL objective. The E-step provides a natural interpretation of how 'learning in hindsight' techniques, such as HER, to handle extremely sparse goal-conditioned rewards. The M-step reduces policy optimization to supervised learning updates, which greatly stabilizes end-to-end… ▽ More

    Submitted 26 February, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: Accepted at International Conference on Artificial Intelligence and Statistics (AISTATS), 2021

  3. arXiv:1906.08868  [pdf, other

    cs.NE cs.LG stat.ML

    Variance Reduction for Evolution Strategies via Structured Control Variates

    Authors: Yunhao Tang, Krzysztof Choromanski, Alp Kucukelbir

    Abstract: Evolution Strategies (ES) are a powerful class of blackbox optimization techniques that recently became a competitive alternative to state-of-the-art policy gradient (PG) algorithms for reinforcement learning (RL). We propose a new method for improving accuracy of the ES algorithms, that as opposed to recent approaches utilizing only Monte Carlo structure of the gradient estimator, takes advantage… ▽ More

    Submitted 13 March, 2020; v1 submitted 29 May, 2019; originally announced June 2019.

    Comments: Accepted to AISTATS (International Conference on Artificial Intelligence and Statistics), 2020 in Palermo, Italy

  4. arXiv:1711.11225  [pdf, other

    cs.LG cs.AI stat.ML

    Variational Deep Q Network

    Authors: Yunhao Tang, Alp Kucukelbir

    Abstract: We propose a framework that directly tackles the probability distribution of the value function parameters in Deep Q Network (DQN), with powerful variational inference subroutines to approximate the posterior of the parameters. We will establish the equivalence between our proposed surrogate objective and variational inference loss. Our new algorithm achieves efficient exploration and performs wel… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

    Comments: 12 pages, 5 figures, Second workshop on Bayesian Deep Learning (NIPS 2017)

  5. arXiv:1610.09787  [pdf, other

    stat.CO cs.AI cs.PL stat.AP stat.ML

    Edward: A library for probabilistic modeling, inference, and criticism

    Authors: Dustin Tran, Alp Kucukelbir, Adji B. Dieng, Maja Rudolph, Dawen Liang, David M. Blei

    Abstract: Probabilistic modeling is a powerful approach for analyzing empirical information. We describe Edward, a library for probabilistic modeling. Edward's design reflects an iterative process pioneered by George Box: build a model of a phenomenon, make inferences about the model given data, and criticize the model's fit to the data. Edward supports a broad class of probabilistic models, efficient algor… ▽ More

    Submitted 31 January, 2017; v1 submitted 31 October, 2016; originally announced October 2016.

  6. arXiv:1606.03860  [pdf, other

    stat.ML cs.AI cs.LG

    Robust Probabilistic Modeling with Bayesian Data Reweighting

    Authors: Yixin Wang, Alp Kucukelbir, David M. Blei

    Abstract: Probabilistic models analyze data by relying on a set of assumptions. Data that exhibit deviations from these assumptions can undermine inference and prediction quality. Robust models offer protection against mismatch between a model's assumptions and reality. We propose a way to systematically detect and mitigate mismatch of a large class of probabilistic models. The idea is to raise the likeliho… ▽ More

    Submitted 19 June, 2018; v1 submitted 13 June, 2016; originally announced June 2016.

    Comments: In ICML 2017. Updated related work

  7. arXiv:1605.07604  [pdf, other

    stat.ML cs.AI stat.CO

    Posterior Dispersion Indices

    Authors: Alp Kucukelbir, David M. Blei

    Abstract: Probabilistic modeling is cyclical: we specify a model, infer its posterior, and evaluate its performance. Evaluation drives the cycle, as we revise our model based on how it performs. This requires a metric. Traditionally, predictive accuracy prevails. Yet, predictive accuracy does not tell the whole story. We propose to evaluate a model through posterior dispersion. The idea is to analyze how ea… ▽ More

    Submitted 24 May, 2016; originally announced May 2016.

  8. arXiv:1603.00788  [pdf, other

    stat.ML cs.AI cs.LG stat.CO

    Automatic Differentiation Variational Inference

    Authors: Alp Kucukelbir, Dustin Tran, Rajesh Ranganath, Andrew Gelman, David M. Blei

    Abstract: Probabilistic modeling is iterative. A scientist posits a simple model, fits it to her data, refines it according to her analysis, and repeats. However, fitting complex models to large data is a bottleneck in this process. Deriving algorithms for new models can be both mathematically and computationally challenging, which makes it difficult to efficiently cycle through the steps. To this end, we d… ▽ More

    Submitted 2 March, 2016; originally announced March 2016.

  9. arXiv:1601.00670  [pdf, other

    stat.CO cs.LG stat.ML

    Variational Inference: A Review for Statisticians

    Authors: David M. Blei, Alp Kucukelbir, Jon D. McAuliffe

    Abstract: One of the core problems of modern statistics is to approximate difficult-to-compute probability densities. This problem is especially important in Bayesian statistics, which frames all inference about unknown quantities as a calculation involving the posterior density. In this paper, we review variational inference (VI), a method from machine learning that approximates probability densities throu… ▽ More

    Submitted 9 May, 2018; v1 submitted 4 January, 2016; originally announced January 2016.

    Journal ref: Journal of the American Statistical Association, Vol. 112 , Iss. 518, 2017

  10. arXiv:1411.0292  [pdf

    stat.ML cs.LG

    Population Empirical Bayes

    Authors: Alp Kucukelbir, David M. Blei

    Abstract: Bayesian predictive inference analyzes a dataset to make predictions about new observations. When a model does not match the data, predictive accuracy suffers. We develop population empirical Bayes (POP-EB), a hierarchical framework that explicitly models the empirical population distribution as part of Bayesian analysis. We introduce a new concept, the latent dataset, as a hierarchical variable a… ▽ More

    Submitted 8 June, 2015; v1 submitted 2 November, 2014; originally announced November 2014.

    Comments: UAI 2015