Zum Hauptinhalt springen

Showing 1–24 of 24 results for author: van Rijn, J N

.
  1. arXiv:2408.06302  [pdf, ps, other

    cs.LG cs.CV

    Finding Patterns in Ambiguity: Interpretable Stress Testing in the Decision~Boundary

    Authors: Inês Gomes, Luís F. Teixeira, Jan N. van Rijn, Carlos Soares, André Restivo, Luís Cunha, Moisés Santos

    Abstract: The increasing use of deep learning across various domains highlights the importance of understanding the decision-making processes of these black-box models. Recent research focusing on the decision boundaries of deep classifiers, relies on generated synthetic instances in areas of low confidence, uncovering samples that challenge both models and humans. We propose a novel approach to enhance the… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: To be published in the Responsible Generative AI workshop at CVPR

  2. arXiv:2406.10154  [pdf, other

    cs.LG cs.AI cs.LO

    Automated Design of Linear Bounding Functions for Sigmoidal Nonlinearities in Neural Networks

    Authors: Matthias König, Xiyue Zhang, Holger H. Hoos, Marta Kwiatkowska, Jan N. van Rijn

    Abstract: The ubiquity of deep learning algorithms in various applications has amplified the need for assuring their robustness against small input perturbations such as those occurring in adversarial attacks. Existing complete verification techniques offer provable guarantees for all robustness queries but struggle to scale beyond small neural networks. To overcome this computational intractability, incomp… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2404.09703  [pdf, other

    cs.LG stat.ML

    AI Competitions and Benchmarks: Dataset Development

    Authors: Romain Egele, Julio C. S. Jacques Junior, Jan N. van Rijn, Isabelle Guyon, Xavier Baró, Albert Clapés, Prasanna Balaprakash, Sergio Escalera, Thomas Moeslund, Jun Wan

    Abstract: Machine learning is now used in many applications thanks to its ability to predict, generate, or discover patterns from large quantities of data. However, the process of collecting and transforming data for practical use is intricate. Even in today's digital era, where substantial data is generated daily, it is uncommon for it to be readily usable; most often, it necessitates meticulous manual dat… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Preprint version of the 3rd Chapter of the book: Competitions and Benchmarks, the science behind the contests (https://sites.google.com/chalearn.org/book/home)

  4. arXiv:2310.14139  [pdf, other

    cs.LG cs.AI stat.ML

    Are LSTMs Good Few-Shot Learners?

    Authors: Mike Huisman, Thomas M. Moerland, Aske Plaat, Jan N. van Rijn

    Abstract: Deep learning requires large amounts of data to learn new tasks well, limiting its applicability to domains where such data is available. Meta-learning overcomes this limitation by learning how to learn. In 2001, Hochreiter et al. showed that an LSTM trained with backpropagation across different tasks is capable of meta-learning. Despite promising results of this approach on small problems, and mo… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: Accepted at Machine Learning Journal, Special Issue of the ECML PKDD 2023 Journal Track

  5. arXiv:2310.09028  [pdf, other

    cs.LG cs.AI stat.ML

    Subspace Adaptation Prior for Few-Shot Learning

    Authors: Mike Huisman, Aske Plaat, Jan N. van Rijn

    Abstract: Gradient-based meta-learning techniques aim to distill useful prior knowledge from a set of training tasks such that new tasks can be learned more efficiently with gradient descent. While these methods have achieved successes in various scenarios, they commonly adapt all parameters of trainable layers when learning new tasks. This neglects potentially more efficient learning strategies for a given… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted at Machine Learning Journal, Special Issue of the ECML PKDD 2023 Journal Track

  6. arXiv:2310.06148  [pdf, other

    cs.LG cs.AI stat.ML

    Understanding Transfer Learning and Gradient-Based Meta-Learning Techniques

    Authors: Mike Huisman, Aske Plaat, Jan N. van Rijn

    Abstract: Deep neural networks can yield good performance on various tasks but often require large amounts of data to train them. Meta-learning received considerable attention as one approach to improve the generalization of these networks from a limited amount of data. Whilst meta-learning techniques have been observed to be successful at this in various scenarios, recent results suggest that when evaluate… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted at Machine Learning Journal, Special Issue on Discovery Science 2021

  7. arXiv:2305.08413  [pdf, other

    cs.CV eess.IV stat.AP

    Artificial intelligence to advance Earth observation: a perspective

    Authors: Devis Tuia, Konrad Schindler, Begüm Demir, Gustau Camps-Valls, Xiao Xiang Zhu, Mrinalini Kochupillai, Sašo Džeroski, Jan N. van Rijn, Holger H. Hoos, Fabio Del Frate, Mihai Datcu, Jorge-Arnulfo Quiané-Ruiz, Volker Markl, Bertrand Le Saux, Rochelle Schneider

    Abstract: Earth observation (EO) is a prime instrument for monitoring land and ocean processes, studying the dynamics at work, and taking the pulse of our planet. This article gives a bird's eye view of the essential scientific tools and approaches informing and supporting the transition from raw EO data to usable EO-based information. The promises, as well as the current challenges of these developments, a… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  8. arXiv:2302.08909  [pdf, other

    cs.CV

    Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification

    Authors: Ihsan Ullah, Dustin Carrión-Ojeda, Sergio Escalera, Isabelle Guyon, Mike Huisman, Felix Mohr, Jan N van Rijn, Haozhe Sun, Joaquin Vanschoren, Phan Anh Vu

    Abstract: We introduce Meta-Album, an image classification meta-dataset designed to facilitate few-shot learning, transfer learning, meta-learning, among other tasks. It includes 40 open datasets, each having at least 20 classes with 40 examples per class, with verified licences. They stem from diverse domains, such as ecology (fauna and flora), manufacturing (textures, vehicles), human actions, and optical… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Journal ref: 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks., NeurIPS, Nov 2022, New Orleans, United States

  9. Hyperparameter Importance of Quantum Neural Networks Across Small Datasets

    Authors: Charles Moussa, Jan N. van Rijn, Thomas Bäck, Vedran Dunjko

    Abstract: As restricted quantum computers are slowly becoming a reality, the search for meaningful first applications intensifies. In this domain, one of the more investigated approaches is the use of a special type of quantum circuit - a so-called quantum neural network -- to serve as a basis for a machine learning model. Roughly speaking, as the name suggests, a quantum neural network can play a similar r… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

    Comments: Submitted to Discovery Science 2022

  10. arXiv:2206.08138  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification

    Authors: Adrian El Baz, Ihsan Ullah, Edesio Alcobaça, André C. P. L. F. Carvalho, Hong Chen, Fabio Ferreira, Henry Gouk, Chaoyu Guan, Isabelle Guyon, Timothy Hospedales, Shell Hu, Mike Huisman, Frank Hutter, Zhengying Liu, Felix Mohr, Ekrem Öztürk, Jan N. van Rijn, Haozhe Sun, Xin Wang, Wenwu Zhu

    Abstract: Although deep neural networks are capable of achieving performance superior to humans on various tasks, they are notorious for requiring large amounts of data and computing resources, restricting their success to domains where such resources are available. Metalearning methods can address this problem by transferring knowledge from related tasks, thus reducing the amount of data and computing reso… ▽ More

    Submitted 11 July, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: version 2 is the correct version, including supplementary material at the end

    Journal ref: NeurIPS 2021 Competition and Demonstration Track, Dec 2021, On-line, United States

  11. arXiv:2201.12150  [pdf, other

    cs.LG

    Learning Curves for Decision Making in Supervised Machine Learning -- A Survey

    Authors: Felix Mohr, Jan N. van Rijn

    Abstract: Learning curves are a concept from social sciences that has been adopted in the context of machine learning to assess the performance of a learning algorithm with respect to a certain resource, e.g. the number of training examples or the number of training iterations. Learning curves have important applications in several contexts of machine learning, most importantly for the context of data acqui… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  12. arXiv:2111.13914  [pdf, other

    cs.LG

    Fast and Informative Model Selection using Learning Curve Cross-Validation

    Authors: Felix Mohr, Jan N. van Rijn

    Abstract: Common cross-validation (CV) methods like k-fold cross-validation or Monte-Carlo cross-validation estimate the predictive performance of a learner by repeatedly training it on a large portion of the given data and testing on the remaining data. These techniques have two major drawbacks. First, they can be unnecessarily slow on large datasets. Second, beyond an estimation of the final performance,… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

  13. Meta-Learning for Symbolic Hyperparameter Defaults

    Authors: Pieter Gijsbers, Florian Pfisterer, Jan N. van Rijn, Bernd Bischl, Joaquin Vanschoren

    Abstract: Hyperparameter optimization in machine learning (ML) deals with the problem of empirically learning an optimal algorithm configuration from data, usually formulated as a black-box optimization problem. In this work, we propose a zero-shot method to meta-learn symbolic default hyperparameter configurations that are expressed in terms of the properties of the dataset. This enables a much faster, but… ▽ More

    Submitted 11 June, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: Pieter Gijsbers and Florian Pfisterer contributed equally to the paper. V1: Two page GECCO poster paper accepted at GECCO 2021. V2: The original full length paper (8 pages) with appendix

  14. arXiv:2104.10527  [pdf, other

    cs.LG cs.AI stat.ML

    Stateless Neural Meta-Learning using Second-Order Gradients

    Authors: Mike Huisman, Aske Plaat, Jan N. van Rijn

    Abstract: Deep learning typically requires large data sets and much compute power for each new problem that is learned. Meta-learning can be used to learn a good prior that facilitates quick learning, thereby relaxing these requirements so that new tasks can be learned quicker; two popular approaches are MAML and the meta-learner LSTM. In this work, we compare the two and formally show that the meta-learner… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Journal ref: Machine Learning, 2022

  15. arXiv:2010.03522  [pdf, other

    cs.LG cs.AI stat.ML

    A Survey of Deep Meta-Learning

    Authors: Mike Huisman, Jan N. van Rijn, Aske Plaat

    Abstract: Deep neural networks can achieve great successes when presented with large data sets and sufficient computational resources. However, their ability to learn new concepts quickly is limited. Meta-learning is one approach to address this issue, by enabling the network to learn how to learn. The field of Deep Meta-Learning advances at great speed, but lacks a unified, in-depth overview of current tec… ▽ More

    Submitted 21 April, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: Published in the AI Review (AIRE) Journal (2021)

  16. arXiv:1911.02490  [pdf, other

    cs.LG stat.ML

    OpenML-Python: an extensible Python API for OpenML

    Authors: Matthias Feurer, Jan N. van Rijn, Arlind Kadra, Pieter Gijsbers, Neeratyoy Mallik, Sahithya Ravi, Andreas Müller, Joaquin Vanschoren, Frank Hutter

    Abstract: OpenML is an online platform for open science collaboration in machine learning, used to share datasets and results of machine learning experiments. In this paper we introduce OpenML-Python, a client API for Python, opening up the OpenML platform for a wide range of Python-based tools. It provides easy access to all datasets, tasks and experiments on OpenML from within Python. It also provides fun… ▽ More

    Submitted 23 June, 2021; v1 submitted 6 November, 2019; originally announced November 2019.

    Journal ref: Journal of Machine Learning Research 22(100), 2021

  17. arXiv:1811.09409  [pdf, other

    stat.ML cs.LG

    Learning Multiple Defaults for Machine Learning Algorithms

    Authors: Florian Pfisterer, Jan N. van Rijn, Philipp Probst, Andreas Müller, Bernd Bischl

    Abstract: The performance of modern machine learning methods highly depends on their hyperparameter configurations. One simple way of selecting a configuration is to use default settings, often proposed along with the publication and implementation of a new algorithm. Those default values are usually chosen in an ad-hoc manner to work good enough on a wide variety of datasets. To address this problem, diffe… ▽ More

    Submitted 30 April, 2021; v1 submitted 23 November, 2018; originally announced November 2018.

  18. arXiv:1805.01214  [pdf, other

    cs.AI

    The Algorithm Selection Competitions 2015 and 2017

    Authors: Marius Lindauer, Jan N. van Rijn, Lars Kotthoff

    Abstract: The algorithm selection problem is to choose the most suitable algorithm for solving a given problem instance. It leverages the complementarity between different approaches that is present in many areas of AI. We report on the state of the art in algorithm selection, as defined by the Algorithm Selection competitions in 2015 and 2017. The results of these competitions show how the state of the art… ▽ More

    Submitted 4 October, 2018; v1 submitted 3 May, 2018; originally announced May 2018.

  19. Hyperparameter Importance Across Datasets

    Authors: J. N. van Rijn, F. Hutter

    Abstract: With the advent of automated machine learning, automated hyperparameter optimization methods are by now routinely used in data mining. However, this progress is not yet matched by equal progress on automatic analyses that yield information beyond performance-optimizing hyperparameter settings. In this work, we aim to answer the following two questions: Given an algorithm, what are generally its mo… ▽ More

    Submitted 29 May, 2018; v1 submitted 12 October, 2017; originally announced October 2017.

    Comments: \c{opyright} 2018. Copyright is held by the owner/author(s). Publication rights licensed to ACM. This is the author's version of the work. It is posted here for your personal use, not for redistribution. The definitive Version of Record was published in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

  20. arXiv:1708.03731  [pdf, other

    stat.ML cs.LG

    OpenML Benchmarking Suites

    Authors: Bernd Bischl, Giuseppe Casalicchio, Matthias Feurer, Pieter Gijsbers, Frank Hutter, Michel Lang, Rafael G. Mantovani, Jan N. van Rijn, Joaquin Vanschoren

    Abstract: Machine learning research depends on objectively interpretable, comparable, and reproducible algorithm benchmarks. We advocate the use of curated, comprehensive suites of machine learning tasks to standardize the setup, execution, and reporting of benchmarks. We enable this through software tools that help to create and leverage these benchmarking suites. These are seamlessly integrated into the O… ▽ More

    Submitted 22 November, 2021; v1 submitted 11 August, 2017; originally announced August 2017.

    Comments: Accepted for publication in the Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS 2021)

    Journal ref: Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (2021)

  21. arXiv:1604.07553  [pdf, other

    cs.CC

    The Complexity of Rummikub Problems

    Authors: Jan N. van Rijn, Frank W. Takes, Jonathan K. Vis

    Abstract: Rummikub is a tile-based game in which each player starts with a hand of $14$ tiles. A tile has a value and a suit. The players form sets consisting of tiles with the same suit and consecutive values (runs) or tiles with the same value and different suits (groups). The corresponding optimization problem is, given a hand of tiles, to form valid sets such that the score (sum of tile values) is maxim… ▽ More

    Submitted 26 April, 2016; originally announced April 2016.

    Comments: First appeared in proceedings of BNAIC 2015 (http://bnaic2015.org)

  22. arXiv:1604.07312  [pdf, ps, other

    cs.AI

    Endgame Analysis of Dou Shou Qi

    Authors: Jan N. van Rijn, Jonathan K. Vis

    Abstract: Dou Shou Qi is a game in which two players control a number of pieces, each of them aiming to move one of their pieces onto a given square. We implemented an engine for analyzing the game. Moreover, we created a series of endgame tablebases containing all configurations with up to four pieces. These tablebases are the first steps towards theoretically solving the game. Finally, we constructed deci… ▽ More

    Submitted 25 April, 2016; originally announced April 2016.

    Comments: 5 pages, ICGA Journal, Vol. 37, pp. 120-124, 2014

    Journal ref: ICGA Journal, Vol. 37, pp. 120-124, 2014

  23. arXiv:1604.05487  [pdf, ps, other

    cs.CC

    Acyclic Constraint Logic and Games

    Authors: Hendrik Jan Hoogeboom, Walter A. Kosters, Jan N. van Rijn, Jonathan K. Vis

    Abstract: Non-deterministic Constraint Logic is a family of graph games introduced by Demaine and Hearn that facilitates the construction of complexity proofs. It is convenient for the analysis of games, providing a uniform view. We focus on the acyclic version, apply this to Klondike, Mahjong Solitaire and Nonogram (that requires planarity), and discuss the more complicated game of Dou Shou Qi. While for t… ▽ More

    Submitted 19 April, 2016; originally announced April 2016.

    Comments: 14 pages, originally published at: ICGA Journal Vol. 37, pp. 3-16, 2014

    Journal ref: ICGA Journal, Vol. 37, pp. 3-16, 2014

  24. OpenML: networked science in machine learning

    Authors: Joaquin Vanschoren, Jan N. van Rijn, Bernd Bischl, Luis Torgo

    Abstract: Many sciences have made significant breakthroughs by adopting online tools that help organize, structure and mine information that is too detailed to be printed in journals. In this paper, we introduce OpenML, a place for machine learning researchers to share and organize data in fine detail, so that they can work more effectively, be more visible, and collaborate with others to tackle harder prob… ▽ More

    Submitted 1 August, 2014; v1 submitted 29 July, 2014; originally announced July 2014.

    Comments: 12 pages, 10 figures

    Journal ref: SIGKDD Explor. Newsl. 15, 2 (June 2014), 49-60