Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Nagler, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02354  [pdf, other

    cs.LG stat.ML

    Label-wise Aleatoric and Epistemic Uncertainty Quantification

    Authors: Yusuf Sale, Paul Hofman, Timo Löhr, Lisa Wimmer, Thomas Nagler, Eyke Hüllermeier

    Abstract: We present a novel approach to uncertainty quantification in classification tasks based on label-wise decomposition of uncertainty measures. This label-wise perspective allows uncertainty to be quantified at the individual class level, thereby improving cost-sensitive decision-making and helping understand the sources of uncertainty. Furthermore, it allows to define total, aleatoric, and epistemic… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Uncertainty in Artificial Intelligence. arXiv admin note: substantial text overlap with arXiv:2401.00276

  2. arXiv:2405.15393  [pdf, other

    stat.ML cs.LG

    Reshuffling Resampling Splits Can Improve Generalization of Hyperparameter Optimization

    Authors: Thomas Nagler, Lennart Schneider, Bernd Bischl, Matthias Feurer

    Abstract: Hyperparameter optimization is crucial for obtaining peak performance of machine learning models. The standard protocol evaluates various hyperparameter configurations using a resampling estimate of the generalization error to guide optimization and select a final hyperparameter configuration. Without much evidence, paired resampling splits, i.e., either a fixed train-validation split or a fixed c… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 39 pages, 4 tables, 29 figures

  3. arXiv:2405.02475  [pdf, other

    cs.LG cs.AI stat.CO stat.ME

    Generalizing Orthogonalization for Models with Non-Linearities

    Authors: David Rügamer, Chris Kolb, Tobias Weber, Lucas Kook, Thomas Nagler

    Abstract: The complexity of black-box algorithms can lead to various challenges, including the introduction of biases. These biases present immediate risks in the algorithms' application. It was, for instance, shown that neural networks can deduce racial information solely from a patient's X-ray scan, a task beyond the capability of medical experts. If this fact is not known to the medical expert, automatic… ▽ More

    Submitted 2 June, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  4. arXiv:2403.10923  [pdf, other

    cs.LG cs.AI stat.CO stat.ML

    Interpretable Machine Learning for TabPFN

    Authors: David Rundel, Julius Kobialka, Constantin von Crailsheim, Matthias Feurer, Thomas Nagler, David Rügamer

    Abstract: The recently developed Prior-Data Fitted Networks (PFNs) have shown very promising results for applications in low-data regimes. The TabPFN model, a special case of PFNs for tabular data, is able to achieve state-of-the-art performance on a variety of classification tasks while producing posterior predictive distributions in mere seconds by in-context learning without the need for learning paramet… ▽ More

    Submitted 23 July, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution is published in Explainable Artificial Intelligence, and is available online at https://doi.org/10.1007/978-3-031-63797-1_23

  5. arXiv:2401.00276  [pdf, other

    cs.LG stat.ML

    Second-Order Uncertainty Quantification: Variance-Based Measures

    Authors: Yusuf Sale, Paul Hofman, Lisa Wimmer, Eyke Hüllermeier, Thomas Nagler

    Abstract: Uncertainty quantification is a critical aspect of machine learning models, providing important insights into the reliability of predictions and aiding the decision-making process in real-world applications. This paper proposes a novel way to use variance-based measures to quantify uncertainty on the basis of second-order distributions in classification problems. A distinctive feature of the measu… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: 22 pages, 10 figures

  6. arXiv:2310.19683  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    An Online Bootstrap for Time Series

    Authors: Nicolai Palm, Thomas Nagler

    Abstract: Resampling methods such as the bootstrap have proven invaluable in the field of machine learning. However, the applicability of traditional bootstrap methods is limited when dealing with large streams of dependent data, such as time series or spatially correlated observations. In this paper, we propose a novel bootstrap method that is designed to account for data dependencies and can be executed o… ▽ More

    Submitted 26 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

  7. arXiv:2306.00541  [pdf, other

    stat.ML cs.LG

    Decomposing Global Feature Effects Based on Feature Interactions

    Authors: Julia Herbinger, Marvin N. Wright, Thomas Nagler, Bernd Bischl, Giuseppe Casalicchio

    Abstract: Global feature effect methods, such as partial dependence plots, provide an intelligible visualization of the expected marginal feature effect. However, such global feature effect methods can be misleading, as they do not represent local feature effects of single observations well when feature interactions are present. We formally introduce generalized additive decomposition of global effects (GAD… ▽ More

    Submitted 1 July, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

  8. arXiv:2305.11097  [pdf, other

    stat.ML cs.LG

    Statistical Foundations of Prior-Data Fitted Networks

    Authors: Thomas Nagler

    Abstract: Prior-data fitted networks (PFNs) were recently proposed as a new paradigm for machine learning. Instead of training the network to an observed training set, a fixed model is pre-trained offline on small, simulated training sets from a variety of tasks. The pre-trained model is then used to infer class probabilities in-context on fresh training sets with arbitrary size and distribution. Empiricall… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA. PMLR 202, 2023

  9. arXiv:2302.08883  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Approximately Bayes-Optimal Pseudo Label Selection

    Authors: Julian Rodemann, Jann Goschenhofer, Emilio Dorigatti, Thomas Nagler, Thomas Augustin

    Abstract: Semi-supervised learning by self-training heavily relies on pseudo-label selection (PLS). The selection often depends on the initial model fit on labeled data. Early overfitting might thus be propagated to the final model by selecting instances with overconfident but erroneous predictions, often referred to as confirmation bias. This paper introduces BPLS, a Bayesian framework for PLS that aims to… ▽ More

    Submitted 26 June, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: UAI 2023

  10. arXiv:2102.06416  [pdf, other

    stat.ME cs.LG stat.ML

    Explaining predictive models using Shapley values and non-parametric vine copulas

    Authors: Kjersti Aas, Thomas Nagler, Martin Jullum, Anders Løland

    Abstract: The original development of Shapley values for prediction explanation relied on the assumption that the features being described were independent. If the features in reality are dependent this may lead to incorrect explanations. Hence, there have recently been attempts of appropriately modelling/estimating the dependence between the features. Although the proposed methods clearly outperform the tr… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  11. arXiv:2012.09037  [pdf

    cs.LG physics.ao-ph

    Copula-based synthetic data augmentation for machine-learning emulators

    Authors: David Meyer, Thomas Nagler, Robin J. Hogan

    Abstract: Can we improve machine-learning (ML) emulators with synthetic data? If data are scarce or expensive to source and a physical model is available, statistically generated data may be useful for augmenting training sets cheaply. Here we explore the use of copula-based models for generating synthetically augmented datasets in weather and climate by testing the method on a toy physical model of downwel… ▽ More

    Submitted 26 September, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: Published version

    Journal ref: Geoscientific Model Development, 14(8), 5205--5215 (2021)