Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Wever, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17322  [pdf, other

    cs.LG cs.AI

    ALPBench: A Benchmark for Active Learning Pipelines on Tabular Data

    Authors: Valentin Margraf, Marcel Wever, Sandra Gilhuber, Gabriel Marques Tavares, Thomas Seidl, Eyke Hüllermeier

    Abstract: In settings where only a budgeted amount of labeled data can be afforded, active learning seeks to devise query strategies for selecting the most informative data points to be labeled, aiming to enhance learning algorithms' efficiency and performance. Numerous such query strategies have been proposed and compared in the active learning literature. However, the community still lacks standardized be… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2405.02200  [pdf, other

    cs.LG stat.ML

    Position: Why We Must Rethink Empirical Research in Machine Learning

    Authors: Moritz Herrmann, F. Julian D. Lange, Katharina Eggensperger, Giuseppe Casalicchio, Marcel Wever, Matthias Feurer, David Rügamer, Eyke Hüllermeier, Anne-Laure Boulesteix, Bernd Bischl

    Abstract: We warn against a common but incomplete understanding of empirical research in machine learning that leads to non-replicable results, makes findings unreliable, and threatens to undermine progress in the field. To overcome this alarming situation, we call for more awareness of the plurality of ways of gaining knowledge experimentally but also of some epistemic limitations. In particular, we argue… ▽ More

    Submitted 25 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: 20 pages, accepted for publication at ICML 2024, camera-ready version

  3. Automated Machine Learning for Multi-Label Classification

    Authors: Marcel Wever

    Abstract: Automated machine learning (AutoML) aims to select and configure machine learning algorithms and combine them into machine learning pipelines tailored to a dataset at hand. For supervised learning tasks, most notably binary and multinomial classification, aka single-label classification (SLC), such AutoML approaches have shown promising results. However, the task of multi-label classification (MLC… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  4. arXiv:2401.14283  [pdf, ps, other

    stat.ML cs.LG

    Information Leakage Detection through Approximate Bayes-optimal Prediction

    Authors: Pritha Gupta, Marcel Wever, Eyke Hüllermeier

    Abstract: In today's data-driven world, the proliferation of publicly available information raises security concerns due to the information leakage (IL) problem. IL involves unintentionally exposing sensitive information to unauthorized parties via observable system information. Conventional statistical approaches rely on estimating mutual information (MI) between observable and secret information for detec… ▽ More

    Submitted 29 July, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Under submission in Information Sciences

    MSC Class: 94A15; 62H30; 94A60 ACM Class: I.5.1; G.3; E.3

  5. arXiv:2302.00511  [pdf, other

    cs.LG cs.AI

    Iterative Deepening Hyperband

    Authors: Jasmin Brandt, Marcel Wever, Dimitrios Iliadis, Viktor Bengs, Eyke Hüllermeier

    Abstract: Hyperparameter optimization (HPO) is concerned with the automated search for the most appropriate hyperparameter configuration (HPC) of a parameterized machine learning algorithm. A state-of-the-art HPO method is Hyperband, which, however, has its own parameters that influence its performance. One of these parameters, the maximal budget, is especially problematic: If chosen too small, the budget n… ▽ More

    Submitted 6 February, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

  6. PyExperimenter: Easily distribute experiments and track results

    Authors: Tanja Tornede, Alexander Tornede, Lukas Fehring, Lukas Gehring, Helena Graf, Jonas Hanselle, Felix Mohr, Marcel Wever

    Abstract: PyExperimenter is a tool to facilitate the setup, documentation, execution, and subsequent evaluation of results from an empirical study of algorithms and in particular is designed to reduce the involved manual effort significantly. It is intended to be used by researchers in the field of artificial intelligence, but is not limited to those.

    Submitted 21 April, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

    Comments: Published in Journal of Open Source Software

  7. arXiv:2211.13681  [pdf, other

    cs.LG

    Meta-Learning for Automated Selection of Anomaly Detectors for Semi-Supervised Datasets

    Authors: David Schubert, Pritha Gupta, Marcel Wever

    Abstract: In anomaly detection, a prominent task is to induce a model to identify anomalies learned solely based on normal data. Generally, one is interested in finding an anomaly detector that correctly identifies anomalies, i.e., data points that do not belong to the normal class, without raising too many false alarms. Which anomaly detector is best suited depends on the dataset at hand and thus needs to… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

  8. arXiv:2211.04362  [pdf, other

    cs.LG

    Hyperparameter optimization in deep multi-target prediction

    Authors: Dimitrios Iliadis, Marcel Wever, Bernard De Baets, Willem Waegeman

    Abstract: As a result of the ever increasing complexity of configuring and fine-tuning machine learning models, the field of automated machine learning (AutoML) has emerged over the past decade. However, software implementations like Auto-WEKA and Auto-sklearn typically focus on classical machine learning (ML) tasks such as classification and regression. Our work can be seen as the first attempt at offering… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: 17 pages, 4 figures, 1 table

  9. A Survey of Methods for Automated Algorithm Configuration

    Authors: Elias Schede, Jasmin Brandt, Alexander Tornede, Marcel Wever, Viktor Bengs, Eyke Hüllermeier, Kevin Tierney

    Abstract: Algorithm configuration (AC) is concerned with the automated search of the most suitable parameter configuration of a parametrized algorithm. There is currently a wide variety of AC problem variants and methods proposed in the literature. Existing reviews do not take into account all derivatives of the AC problem, nor do they offer a complete classification scheme. To this end, we introduce taxono… ▽ More

    Submitted 13 October, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    ACM Class: I.2.6

    Journal ref: Journal of Artificial Intelligence Research (JAIR) 75 (2022) 425-487

  10. arXiv:2111.14514  [pdf, other

    cs.LG

    Naive Automated Machine Learning

    Authors: Felix Mohr, Marcel Wever

    Abstract: An essential task of Automated Machine Learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian Optimization, Grammar-Based Genetic Algorithms, and tree search algorithms. Most of the current approaches are motivated by the… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

  11. Towards Green Automated Machine Learning: Status Quo and Future Directions

    Authors: Tanja Tornede, Alexander Tornede, Jonas Hanselle, Marcel Wever, Felix Mohr, Eyke Hüllermeier

    Abstract: Automated machine learning (AutoML) strives for the automatic configuration of machine learning algorithms and their composition into an overall (software) solution - a machine learning pipeline - tailored to the learning task (dataset) at hand. Over the last decade, AutoML has developed into an independent research field with hundreds of contributions. At the same time, AutoML is being criticised… ▽ More

    Submitted 13 June, 2023; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: Published in Journal of Artificial Intelligence Research

  12. arXiv:2109.04744  [pdf, ps, other

    cs.AI cs.LG

    Automated Machine Learning, Bounded Rationality, and Rational Metareasoning

    Authors: Eyke Hüllermeier, Felix Mohr, Alexander Tornede, Marcel Wever

    Abstract: The notion of bounded rationality originated from the insight that perfectly rational behavior cannot be realized by agents with limited cognitive or computational resources. Research on bounded rationality, mainly initiated by Herbert Simon, has a longstanding tradition in economics and the social sciences, but also plays a major role in modern AI and intelligent agent design. Taking actions unde… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: Accepted at ECMLPKDD WORKSHOP ON AUTOMATING DATA SCIENCE (ADS2021) - https://sites.google.com/view/autods

  13. arXiv:2107.09414  [pdf, other

    cs.LG cs.AI

    Algorithm Selection on a Meta Level

    Authors: Alexander Tornede, Lukas Gehring, Tanja Tornede, Marcel Wever, Eyke Hüllermeier

    Abstract: The problem of selecting an algorithm that appears most suitable for a specific instance of an algorithmic problem class, such as the Boolean satisfiability problem, is called instance-specific algorithm selection. Over the past decade, the problem has received considerable attention, resulting in a number of different methods for algorithm selection. Although most of these methods are based on ma… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: under review for a special issue @ MLJ

  14. arXiv:2105.07270  [pdf, other

    cs.CL cs.AI

    Annotation Uncertainty in the Context of Grammatical Change

    Authors: Marie-Luis Merten, Marcel Wever, Michaela Geierhos, Doris Tophinke, Eyke Hüllermeier

    Abstract: This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by lacking annotation expertise. By examining… ▽ More

    Submitted 28 May, 2021; v1 submitted 15 May, 2021; originally announced May 2021.

  15. arXiv:2103.10496  [pdf, other

    cs.LG

    Naive Automated Machine Learning -- A Late Baseline for AutoML

    Authors: Felix Mohr, Marcel Wever

    Abstract: Automated Machine Learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on some given dataset. AutoML has received enormous attention in the last decade and has been addressed with sophisticated black-box optimization techniques such as Bayesian Optimization, Grammar-Based Genetic Algorithms, and tree search algorithms. In contrast to those… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  16. arXiv:2011.08784  [pdf, other

    cs.LG stat.ML

    Towards Meta-Algorithm Selection

    Authors: Alexander Tornede, Marcel Wever, Eyke Hüllermeier

    Abstract: Instance-specific algorithm selection (AS) deals with the automatic selection of an algorithm from a fixed set of candidates most suitable for a specific instance of an algorithmic problem class, where "suitability" often refers to an algorithm's runtime. Over the past years, a plethora of algorithm selectors have been proposed. As an algorithm selector is again an algorithm solving a specific pro… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: Accepted at 4th Workshop on Meta-Learning at NeurIPS 2020, Vancouver, Canada

  17. arXiv:2011.00792  [pdf, other

    cs.LG cs.AI

    A Flexible Class of Dependence-aware Multi-Label Loss Functions

    Authors: Eyke Hüllermeier, Marcel Wever, Eneldo Loza Mencia, Johannes Fürnkranz, Michael Rapp

    Abstract: Multi-label classification is the task of assigning a subset of labels to a given query instance. For evaluating such predictions, the set of predicted labels needs to be compared to the ground-truth label set associated with that instance, and various loss functions have been proposed for this purpose. In addition to assessing predictive accuracy, a key concern in this regard is to foster and to… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  18. arXiv:2008.01377  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Reliable Part-of-Speech Tagging of Historical Corpora through Set-Valued Prediction

    Authors: Stefan Heid, Marcel Wever, Eyke Hüllermeier

    Abstract: Syntactic annotation of corpora in the form of part-of-speech (POS) tags is a key requirement for both linguistic research and subsequent automated natural language processing (NLP) tasks. This problem is commonly tackled using machine learning methods, i.e., by training a POS tagger on a sufficiently large corpus of labeled data. While the problem of POS tagging can essentially be considered as s… ▽ More

    Submitted 16 August, 2021; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: 14 pages, 8 figures

    ACM Class: I.2.7

  19. arXiv:2007.02816  [pdf, other

    cs.LG stat.ML

    Run2Survive: A Decision-theoretic Approach to Algorithm Selection based on Survival Analysis

    Authors: Alexander Tornede, Marcel Wever, Stefan Werner, Felix Mohr, Eyke Hüllermeier

    Abstract: Algorithm selection (AS) deals with the automatic selection of an algorithm from a fixed set of candidate algorithms most suitable for a specific instance of an algorithmic problem class, where "suitability" often refers to an algorithm's runtime. Due to possibly extremely long runtimes of candidate algorithms, training data for algorithm selection models is usually generated under time constraint… ▽ More

    Submitted 10 July, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

  20. Extreme Algorithm Selection With Dyadic Feature Representation

    Authors: Alexander Tornede, Marcel Wever, Eyke Hüllermeier

    Abstract: Algorithm selection (AS) deals with selecting an algorithm from a fixed set of candidate algorithms most suitable for a specific instance of an algorithmic problem, e.g., choosing solvers for SAT problems. Benchmark suites for AS usually comprise candidate sets consisting of at most tens of algorithms, whereas in combined algorithm selection and hyperparameter optimization problems the number of c… ▽ More

    Submitted 22 October, 2020; v1 submitted 29 January, 2020; originally announced January 2020.

    Comments: Published at Discovery Science 2020

  21. arXiv:1811.04060  [pdf, other

    cs.LG stat.ML

    Automated Multi-Label Classification based on ML-Plan

    Authors: Marcel Wever, Felix Mohr, Eyke Hüllermeier

    Abstract: Automated machine learning (AutoML) has received increasing attention in the recent past. While the main tools for AutoML, such as Auto-WEKA, TPOT, and auto-sklearn, mainly deal with single-label classification and regression, there is very little work on other types of machine learning tasks. In particular, there is almost no work on automating the engineering of machine learning applications for… ▽ More

    Submitted 9 November, 2018; originally announced November 2018.

  22. arXiv:1809.00486  [pdf, other

    cs.SE

    Automated Machine Learning Service Composition

    Authors: Felix Mohr, Marcel Wever, Eyke Hüllermeier

    Abstract: Automated service composition as the process of creating new software in an automated fashion has been studied in many different ways over the last decade. However, the impact of automated service composition has been rather small as its utility in real-world applications has not been demonstrated so far. This paper presents \tool, an algorithm for automated service composition applied to the area… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.