Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Tatzel, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03334  [pdf, other

    cs.LG stat.ML

    Reparameterization invariance in approximate Bayesian inference

    Authors: Hrittik Roy, Marco Miani, Carl Henrik Ek, Philipp Hennig, Marvin Pförtner, Lukas Tatzel, Søren Hauberg

    Abstract: Current approximate posteriors in Bayesian neural networks (BNNs) exhibit a crucial limitation: they fail to maintain invariance under reparameterization, i.e. BNNs assign different posterior densities to different parametrizations of identical functions. This creates a fundamental flaw in the application of Bayesian principles as it breaks the correspondence between uncertainty over the parameter… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2310.20285  [pdf, other

    cs.LG stat.ML

    Accelerating Generalized Linear Models by Trading off Computation for Uncertainty

    Authors: Lukas Tatzel, Jonathan Wenger, Frank Schneider, Philipp Hennig

    Abstract: Bayesian Generalized Linear Models (GLMs) define a flexible probabilistic framework to model categorical, ordinal and continuous data, and are widely used in practice. However, exact inference in GLMs is prohibitively expensive for large datasets, thus requiring approximations in practice. The resulting approximation error adversely impacts the reliability of the model and is not accounted for in… ▽ More

    Submitted 7 February, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: Main text: 11 pages, 6 figures; Supplements: 13 pages, 2 figures

  3. arXiv:2106.02624  [pdf, other

    cs.LG stat.ML

    ViViT: Curvature access through the generalized Gauss-Newton's low-rank structure

    Authors: Felix Dangel, Lukas Tatzel, Philipp Hennig

    Abstract: Curvature in form of the Hessian or its generalized Gauss-Newton (GGN) approximation is valuable for algorithms that rely on a local model for the loss to train, compress, or explain deep networks. Existing methods based on implicit multiplication via automatic differentiation or Kronecker-factored block diagonal approximations do not consider noise in the mini-batch. We present ViViT, a curvature… ▽ More

    Submitted 10 February, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: Main text: 10 pages, 6 figures; Supplements: 26 pages, 27 figures, 5 tables