Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Irurozki, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.10284  [pdf, other

    cs.CL cs.AI

    Towards More Robust NLP System Evaluation: Handling Missing Scores in Benchmarks

    Authors: Anas Himmi, Ekhine Irurozki, Nathan Noiry, Stephan Clemencon, Pierre Colombo

    Abstract: The evaluation of natural language processing (NLP) systems is crucial for advancing the field, but current benchmarking approaches often assume that all systems have scores available for all tasks, which is not always practical. In reality, several factors such as the cost of running baseline, private systems, computational limitations, or incomplete data may prevent some systems from being evalu… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  2. arXiv:2305.07345  [pdf, other

    cs.PF cs.DS math.OC stat.AP

    On the Fair Comparison of Optimization Algorithms in Different Machines

    Authors: Etor Arza, Josu Ceberio, Ekhiñe Irurozki, Aritz Pérez

    Abstract: An experimental comparison of two or more optimization algorithms requires the same computational resources to be assigned to each algorithm. When a maximum runtime is set as the stopping criterion, all algorithms need to be executed in the same machine if they are to use the same resources. Unfortunately, the implementation code of the algorithms is not always available, which means that running… ▽ More

    Submitted 7 August, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Journal ref: Ann. Appl. Stat. 18(1): 42-62 (March 2024)

  3. arXiv:2303.12878  [pdf, other

    cs.LG stat.ML

    Robust Consensus in Ranking Data Analysis: Definitions, Properties and Computational Issues

    Authors: Morgane Goibert, Clément Calauzènes, Ekhine Irurozki, Stéphan Clémençon

    Abstract: As the issue of robustness in AI systems becomes vital, statistical learning techniques that are reliable even in presence of partly contaminated data have to be developed. Preference data, in the form of (complete) rankings in the simplest situations, are no exception and the demand for appropriate concepts and tools is all the more pressing given that technologies fed by or producing this type o… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  4. arXiv:2203.07889  [pdf, other

    stat.ML cs.LG stat.ME

    Comparing Two Samples Through Stochastic Dominance: A Graphical Approach

    Authors: Etor Arza, Josu Ceberio, Ekhiñe Irurozki, Aritz Pérez

    Abstract: Non-deterministic measurements are common in real-world scenarios: the performance of a stochastic optimization algorithm or the total reward of a reinforcement learning agent in a chaotic environment are just two examples in which unpredictable outcomes are common. These measures can be modeled as random variables and compared among each other via their expected values or more sophisticated tools… ▽ More

    Submitted 30 August, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Journal ref: Etor Arza, Josu Ceberio, Ekhiñe Irurozki & Aritz Pérez (2022) Comparing Two Samples Through Stochastic Dominance: A Graphical Approach, Journal of Computational and Graphical Statistics

  5. arXiv:2202.03799  [pdf, other

    cs.CL cs.AI

    What are the best systems? New perspectives on NLP Benchmarking

    Authors: Pierre Colombo, Nathan Noiry, Ekhine Irurozki, Stephan Clemencon

    Abstract: In Machine Learning, a benchmark refers to an ensemble of datasets associated with one or multiple metrics together with a way to aggregate different systems performances. They are instrumental in (i) assessing the progress of new methods along different axes and (ii) selecting the best systems for practical use. This is particularly the case for NLP with the development of large pre-trained model… ▽ More

    Submitted 7 October, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  6. arXiv:2201.10453  [pdf, other

    cs.AI

    The First AI4TSP Competition: Learning to Solve Stochastic Routing Problems

    Authors: Laurens Bliek, Paulo da Costa, Reza Refaei Afshar, Yingqian Zhang, Tom Catshoek, Daniël Vos, Sicco Verwer, Fynn Schmitt-Ulms, André Hottung, Tapan Shah, Meinolf Sellmann, Kevin Tierney, Carl Perreault-Lafleur, Caroline Leboeuf, Federico Bobbio, Justine Pepin, Warley Almeida Silva, Ricardo Gama, Hugo L. Fernandes, Martin Zaefferer, Manuel López-Ibáñez, Ekhine Irurozki

    Abstract: This paper reports on the first international competition on AI for the traveling salesman problem (TSP) at the International Joint Conference on Artificial Intelligence 2021 (IJCAI-21). The TSP is one of the classical combinatorial optimization problems, with many variants inspired by real-world applications. This first competition asked the participants to develop algorithms to solve a time-depe… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: 21 pages

    MSC Class: 68T05

  7. arXiv:2201.08105  [pdf, other

    cs.LG stat.ML

    Statistical Depth Functions for Ranking Distributions: Definitions, Statistical Learning and Applications

    Authors: Morgane Goibert, Stéphan Clémençon, Ekhine Irurozki, Pavlo Mozharovskyi

    Abstract: The concept of median/consensus has been widely investigated in order to provide a statistical summary of ranking data, i.e. realizations of a random permutation $Σ$ of a finite set, $\{1,\; \ldots,\; n\}$ with $n\geq 1$ say. As it sheds light onto only one aspect of $Σ$'s distribution $P$, it may neglect other informative features. It is the purpose of this paper to define analogs of quantiles, r… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  8. Kernels of Mallows Models under the Hamming Distance for solving the Quadratic Assignment Problem

    Authors: Etor Arza, Aritz Perez, Ekhine Irurozki, Josu Ceberio

    Abstract: The Quadratic Assignment Problem (QAP) is a well-known permutation-based combinatorial optimization problem with real applications in industrial and logistics environments. Motivated by the challenge that this NP-hard problem represents, it has captured the attention of the optimization community for decades. As a result, a large number of algorithms have been proposed to tackle this problem. Amon… ▽ More

    Submitted 18 August, 2020; v1 submitted 19 October, 2019; originally announced October 2019.

    Comments: 23 pages

  9. arXiv:1910.08795  [pdf, other

    stat.ML cs.LG

    Rank aggregation for non-stationary data streams

    Authors: Ekhine Irurozki, Jesus Lobo, Aritz Perez, Javier Del Ser

    Abstract: We consider the problem of learning over non-stationary ranking streams. The rankings can be interpreted as the preferences of a population and the non-stationarity means that the distribution of preferences changes over time. Our goal is to learn, in an online manner, the current distribution of rankings. The bottleneck of this process is a rank aggregation problem. We propose a generalization… ▽ More

    Submitted 27 October, 2020; v1 submitted 19 October, 2019; originally announced October 2019.

    Comments: 23 pages