Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Arnal, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.15578  [pdf, other

    math.DG cs.CG

    The distance function to a finite set is a topological Morse function

    Authors: Charles Arnal

    Abstract: In this short note, we show that the distance function to any finite set $X\subset \mathbb{R}^n$ is a topological Morse function, regardless of whether $X$ is in general position. We also precisely characterize its topological critical points and their indices, and relate them to the differential critical points of the function.

    Submitted 22 July, 2024; originally announced July 2024.

  2. arXiv:2407.10645  [pdf, other

    cs.CL cs.AI cs.CY

    Prompt Selection Matters: Enhancing Text Annotations for Social Sciences with Large Language Models

    Authors: Louis Abraham, Charles Arnal, Antoine Marie

    Abstract: Large Language Models have recently been applied to text annotation tasks from social sciences, equalling or surpassing the performance of human workers at a fraction of the cost. However, no inquiry has yet been made on the impact of prompt selection on labelling accuracy. In this study, we show that performance greatly varies between prompts, and we apply the method of automatic prompt optimizat… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  3. arXiv:2406.14919  [pdf, other

    cs.CG math.PR

    Wasserstein convergence of Čech persistence diagrams for samplings of submanifolds

    Authors: Charles Arnal, David Cohen-Steiner, Vincent Divol

    Abstract: Čech Persistence diagrams (PDs) are topological descriptors routinely used to capture the geometry of complex datasets. They are commonly compared using the Wasserstein distances $OT_{p}$; however, the extent to which PDs are stable with respect to these metrics remains poorly understood. We partially close this gap by focusing on the case where datasets are sampled on an $m$-dimensional submanifo… ▽ More

    Submitted 12 July, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    MSC Class: 55N31; 62R40

  4. arXiv:2406.02128  [pdf, other

    cs.LG cs.AI cs.CL

    Iteration Head: A Mechanistic Study of Chain-of-Thought

    Authors: Vivien Cabannes, Charles Arnal, Wassim Bouaziz, Alice Yang, Francois Charton, Julia Kempe

    Abstract: Chain-of-Thought (CoT) reasoning is known to improve Large Language Models both empirically and in terms of theoretical approximation power. However, our understanding of the inner workings and conditions of apparition of CoT capabilities remains limited. This paper helps fill this gap by demonstrating how CoT reasoning emerges in transformers in a controlled and interpretable setting. In particul… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  5. arXiv:2402.13079  [pdf, ps, other

    stat.ML cs.IR cs.IT cs.LG

    Mode Estimation with Partial Feedback

    Authors: Charles Arnal, Vivien Cabannes, Vianney Perchet

    Abstract: The combination of lightly supervised pre-training and online fine-tuning has played a key role in recent AI developments. These new learning pipelines call for new theoretical frameworks. In this paper, we formalize core aspects of weakly supervised and active learning with a simple problem: the estimation of the mode of a distribution using partial feedback. We show how entropy coding allows for… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    MSC Class: 62L05; 62B86; 62D10; 62B10

  6. arXiv:2311.13845  [pdf, other

    cs.LG cs.AI stat.ML

    Touring sampling with pushforward maps

    Authors: Vivien Cabannes, Charles Arnal

    Abstract: The number of sampling methods could be daunting for a practitioner looking to cast powerful machine learning methods to their specific problem. This paper takes a theoretical stance to review and organize many sampling approaches in the ``generative modeling'' setting, where one wants to generate new data that are similar to some training examples. By revealing links between existing methods, it… ▽ More

    Submitted 20 February, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: 5 pages

    Journal ref: ICASSP, 2024

  7. arXiv:2305.13271  [pdf, other

    stat.ML cs.LG

    MAGDiff: Covariate Data Set Shift Detection via Activation Graphs of Deep Neural Networks

    Authors: Charles Arnal, Felix Hensel, Mathieu Carrière, Théo Lacombe, Hiroaki Kurihara, Yuichi Ike, Frédéric Chazal

    Abstract: Despite their successful application to a variety of tasks, neural networks remain limited, like other machine learning methods, by their sensitivity to shifts in the data: their performance can be severely impacted by differences in distribution between the data on which they were trained and that on which they are deployed. In this article, we propose a new family of representations, called MAGD… ▽ More

    Submitted 12 May, 2024; v1 submitted 22 May, 2023; originally announced May 2023.