Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Darrin, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07640  [pdf, other

    cs.LG cs.AI

    When is an Embedding Model More Promising than Another?

    Authors: Maxime Darrin, Philippe Formont, Ismail Ben Ayed, Jackie CK Cheung, Pablo Piantanida

    Abstract: Embedders play a central role in machine learning, projecting any object into numerical representations that can, in turn, be leveraged to perform various downstream tasks. The evaluation of embedding models typically depends on domain-specific empirical approaches utilizing downstream tasks, primarily because of the lack of a standardized framework for comparison. However, acquiring adequately la… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2406.07359  [pdf, other

    cs.CL

    GLIMPSE: Pragmatically Informative Multi-Document Summarization for Scholarly Reviews

    Authors: Maxime Darrin, Ines Arous, Pablo Piantanida, Jackie CK Cheung

    Abstract: Scientific peer review is essential for the quality of academic publications. However, the increasing number of paper submissions to conferences has strained the reviewing process. This surge poses a burden on area chairs who have to carefully read an ever-growing volume of reviews and discern each reviewer's main arguments as part of their decision process. In this paper, we introduce \sys, a sum… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2402.19457  [pdf, other

    cs.CL cs.AI

    $\texttt{COSMIC}$: Mutual Information for Task-Agnostic Summarization Evaluation

    Authors: Maxime Darrin, Philippe Formont, Jackie Chi Kit Cheung, Pablo Piantanida

    Abstract: Assessing the quality of summarizers poses significant challenges. In response, we propose a novel task-oriented evaluation approach that assesses summarizers based on their capacity to produce summaries that are useful for downstream tasks, while preserving task outcomes. We theoretically establish a direct relationship between the resulting error probability of these tasks and the mutual informa… ▽ More

    Submitted 14 August, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  4. arXiv:2302.09852  [pdf, other

    cs.CL cs.AI

    Unsupervised Layer-wise Score Aggregation for Textual OOD Detection

    Authors: Maxime Darrin, Guillaume Staerman, Eduardo Dadalto Câmara Gomes, Jackie CK Cheung, Pablo Piantanida, Pierre Colombo

    Abstract: Out-of-distribution (OOD) detection is a rapidly growing field due to new robustness and security requirements driven by an increased number of AI-based systems. Existing OOD textual detectors often rely on an anomaly score (e.g., Mahalanobis distance) computed on the embedding output of the last layer of the encoder. In this work, we observe that OOD detection performance varies greatly depending… ▽ More

    Submitted 21 February, 2024; v1 submitted 20 February, 2023; originally announced February 2023.

  5. Rainproof: An Umbrella To Shield Text Generators From Out-Of-Distribution Data

    Authors: Maxime Darrin, Pablo Piantanida, Pierre Colombo

    Abstract: Implementing effective control mechanisms to ensure the proper functioning and security of deployed NLP models, from translation to chatbots, is essential. A key ingredient to ensure safe system behaviour is Out-Of-Distribution (OOD) detection, which aims to detect whether an input sample is statistically far from the training distribution. Although OOD detection is a widely covered topic in class… ▽ More

    Submitted 1 November, 2023; v1 submitted 18 December, 2022; originally announced December 2022.