Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Servan, C

.
  1. arXiv:2404.11122  [pdf, other

    cs.AI

    Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification

    Authors: Pierre Lepagnol, Thomas Gerald, Sahar Ghannay, Christophe Servan, Sophie Rosset

    Abstract: This study is part of the debate on the efficiency of large versus small language models for text classification by prompting.We assess the performance of small language models in zero-shot text classification, challenging the prevailing dominance of large models.Across 15 datasets, our investigation benchmarks language models from 77M to 40B parameters using different architectures and scoring fu… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Journal ref: LREC-COLING 2024, May 2024, TURIN, Italy

  2. arXiv:2403.19727  [pdf, ps, other

    cs.CL cs.AI

    New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark

    Authors: Nadège Alavoine, Gaëlle Laperriere, Christophe Servan, Sahar Ghannay, Sophie Rosset

    Abstract: Intent classification and slot-filling are essential tasks of Spoken Language Understanding (SLU). In most SLUsystems, those tasks are realized by independent modules. For about fifteen years, models achieving both of themjointly and exploiting their mutual enhancement have been proposed. A multilingual module using a joint modelwas envisioned to create a touristic dialogue system for a European p… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Journal ref: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Torino, Italy

  3. arXiv:2403.19726  [pdf, other

    cs.CL cs.AI q-bio.QM

    A Benchmark Evaluation of Clinical Named Entity Recognition in French

    Authors: Nesrine Bannour, Christophe Servan, Aurélie Névéol, Xavier Tannier

    Abstract: Background: Transformer-based language models have shown strong performance on many Natural LanguageProcessing (NLP) tasks. Masked Language Models (MLMs) attract sustained interest because they can be adaptedto different languages and sub-domains through training or fine-tuning on specific corpora while remaining lighterthan modern Large Language Models (LLMs). Recently, several MLMs have been rel… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Journal ref: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Torino, Italy

  4. arXiv:2403.18338  [pdf, other

    cs.AI

    mALBERT: Is a Compact Multilingual BERT Model Still Worth It?

    Authors: Christophe Servan, Sahar Ghannay, Sophie Rosset

    Abstract: Within the current trend of Pretained Language Models (PLM), emerge more and more criticisms about the ethical andecological impact of such models. In this article, considering these critical remarks, we propose to focus on smallermodels, such as compact models like ALBERT, which are more ecologically virtuous than these PLM. However,PLMs enable huge breakthroughs in Natural Language Processing ta… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, May 2024, Torino, Italy

  5. arXiv:2310.14392  [pdf, other

    q-bio.PE cond-mat.stat-mech

    Effects of phylogeny on coexistence in model communities

    Authors: Carlos A. Servan, Jose A. Capitan, Zachary R. Miller, Stefano Allesina

    Abstract: Species' interactions are shaped by their traits. Thus, we expect traits -- in particular, trait (dis)similarity -- to play a central role in determining whether a particular set of species coexists. Traits are, in turn, the outcome of an eco-evolutionary process summarized by a phylogenetic tree. Therefore, the phylogenetic tree associated with a set of species should carry information about the… ▽ More

    Submitted 10 August, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

    MSC Class: 92D40 (Primary); 60B20 (Secondary)

  6. arXiv:2305.04153  [pdf, ps, other

    math.GT math.AG math.CV

    Isometric embeddings of Teichmüller spaces are covering constructions

    Authors: Frederik Benirschke, Carlos A. Serván

    Abstract: Pulling back complex structures along a branched covering induces a holomorphic isometric embedding of Teichmüller spaces. We show that for dimension at least $2$, all isometric embeddings arise from branched coverings. This generalizes a theorem of Royden. As a consequence we obtain that totally geodesic submanifolds of Teichmüller space, which are isometric to some Teichmüller space, are coverin… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

    Comments: 17 pages

  7. arXiv:2207.09157  [pdf, ps, other

    cs.CL

    On the cross-lingual transferability of multilingual prototypical models across NLU tasks

    Authors: Oralie Cattan, Christophe Servan, Sophie Rosset

    Abstract: Supervised deep learning-based approaches have been applied to task-oriented dialog and have proven to be effective for limited domain and language applications when a sufficient number of training examples are available. In practice, these approaches suffer from the drawbacks of domain-driven design and under-resourced languages. Domain and language models are supposed to grow and change as the p… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted to the ACL workshop METANLP 2021

    MSC Class: 68T50 ACM Class: I.2.7

  8. arXiv:2207.09152  [pdf, ps, other

    cs.CL cs.AI

    Benchmarking Transformers-based models on French Spoken Language Understanding tasks

    Authors: Oralie Cattan, Sahar Ghannay, Christophe Servan, Sophie Rosset

    Abstract: In the last five years, the rise of the self-attentional Transformer-based architectures led to state-of-the-art performances over many natural language tasks. Although these approaches are increasingly popular, they require large amounts of data and computational resources. There is still a substantial need for benchmarking methodologies ever upwards on under-resourced languages in data-scarce ap… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted paper at INTERSPEECH 2022

    MSC Class: 68T50 ACM Class: I.2.7

  9. arXiv:2207.09150  [pdf, ps, other

    cs.CL cs.AI

    On the Usability of Transformers-based models for a French Question-Answering task

    Authors: Oralie Cattan, Christophe Servan, Sophie Rosset

    Abstract: For many tasks, state-of-the-art results have been achieved with Transformer-based architectures, resulting in a paradigmatic shift in practices from the use of task-specific architectures to the fine-tuning of pre-trained language models. The ongoing trend consists in training models with an ever-increasing amount of data and parameters, which requires considerable resources. It leads to a strong… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: French compact model paper: FrALBERT, Accepted to RANLP 2021

    MSC Class: 68T50 ACM Class: I.2.7

  10. arXiv:2207.01704  [pdf, other

    math.AG math.GT

    On the uniqueness of the Prym map

    Authors: Carlos A. Serván

    Abstract: The classical Prym construction associates to a smooth, genus $g$ complex curve $X$ equipped with a nonzero cohomology class $θ\in H^1(X,\mathbb{Z}/2\mathbb{Z})$, a principally polarized abelian variety (PPAV) $\mbox{Prym}(X,θ)$. Denote the moduli space of pairs $(X,θ)$ by $\mathcal{R}_g$, and let $\mathcal{A}_h$ be the moduli space of PPAVs of dimension $h$. The Prym construction globalizes to a… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  11. arXiv:1910.07481  [pdf, ps, other

    cs.CL

    Using Whole Document Context in Neural Machine Translation

    Authors: Valentin Macé, Christophe Servan

    Abstract: In Machine Translation, considering the document as a whole can help to resolve ambiguities and inconsistencies. In this paper, we propose a simple yet promising approach to add contextual information in Neural Machine Translation. We present a method to add source context that capture the whole document with accurate boundaries, taking every word into account. We provide this additional informati… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: Accepted paper to IWSLT2019

  12. arXiv:1907.05790  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Qwant Research @DEFT 2019: Document matching and information retrieval using clinical cases

    Authors: Estelle Maudet, Oralie Cattan, Maureen de Seyssel, Christophe Servan

    Abstract: This paper reports on Qwant Research contribution to tasks 2 and 3 of the DEFT 2019's challenge, focusing on French clinical cases analysis. Task 2 is a task on semantic similarity between clinical cases and discussions. For this task, we propose an approach based on language models and evaluate the impact on the results of different preprocessings and matching techniques. For task 3, we have deve… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

    Comments: Article accepted at the workshop DEfi fouille de Texte (DEFT 2019). Article in French

    Journal ref: DEFT 2019

  13. arXiv:1903.11299  [pdf, other

    cs.CV cs.CL

    Image search using multilingual texts: a cross-modal learning approach between image and text

    Authors: Maxime Portaz, Hicham Randrianarivo, Adrien Nivaggioli, Estelle Maudet, Christophe Servan, Sylvain Peyronnet

    Abstract: Multilingual (or cross-lingual) embeddings represent several languages in a unique vector space. Using a common embedding space enables for a shared semantic between words from different languages. In this paper, we propose to embed images and texts into a unique distributional vector space, enabling to search images by using text queries expressing information needs related to the (visual) conten… ▽ More

    Submitted 14 May, 2019; v1 submitted 27 March, 2019; originally announced March 2019.

  14. arXiv:1902.08278  [pdf, other

    cs.SI physics.soc-ph

    Thresholding normally distributed data creates complex networks

    Authors: George T. Cantwell, Yanchen Liu, Benjamin F. Maier, Alice C. Schwarze, Carlos A. Serván, Jordan Snyder, Guillaume St-Onge

    Abstract: Network data sets are often constructed by some kind of thresholding procedure. The resulting networks frequently possess properties such as heavy-tailed degree distributions, clustering, large connected components and short average shortest path lengths. These properties are considered typical of complex networks and appear in many contexts, prompting consideration of their universality. Here we… ▽ More

    Submitted 29 May, 2020; v1 submitted 21 February, 2019; originally announced February 2019.

    Comments: incorporated referees' suggestions; to be published in Phys. Rev. E

    Journal ref: Phys. Rev. E 101, 062302 (2020)

  15. arXiv:1709.03814  [pdf, other

    cs.CL

    SYSTRAN Purely Neural MT Engines for WMT2017

    Authors: Yongchao Deng, Jungi Kim, Guillaume Klein, Catherine Kobus, Natalia Segal, Christophe Servan, Bo Wang, Dakun Zhang, Josep Crego, Jean Senellart

    Abstract: This paper describes SYSTRAN's systems submitted to the WMT 2017 shared news translation task for English-German, in both translation directions. Our systems are built using OpenNMT, an open-source neural machine translation system, implementing sequence-to-sequence models with LSTM encoder/decoders and attention. We experimented using monolingual data automatically back-translated. Our resulting… ▽ More

    Submitted 12 September, 2017; originally announced September 2017.

    Comments: Published in WMT 2017

  16. arXiv:1612.06141  [pdf, other

    cs.CL

    Domain specialization: a post-training domain adaptation for Neural Machine Translation

    Authors: Christophe Servan, Josep Crego, Jean Senellart

    Abstract: Domain adaptation is a key feature in Machine Translation. It generally encompasses terminology, domain and style adaptation, especially for human post-editing workflows in Computer Assisted Translation (CAT). With Neural Machine Translation (NMT), we introduce a new notion of domain adaptation that we call "specialization" and which is showing promising results both in the learning speed and in a… ▽ More

    Submitted 19 December, 2016; originally announced December 2016.

    Comments: Submitted to EACL 2017 short paper

  17. arXiv:1612.01744  [pdf, other

    cs.CL

    Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text Translation

    Authors: Alexandre Berard, Olivier Pietquin, Christophe Servan, Laurent Besacier

    Abstract: This paper proposes a first attempt to build an end-to-end speech-to-text translation system, which does not use source language transcription during learning or decoding. We propose a model for direct speech-to-text translation, which gives promising results on a small French-English synthetic corpus. Relaxing the need for source language transcription would drastically change the data collection… ▽ More

    Submitted 6 December, 2016; originally announced December 2016.

    Comments: accepted to NIPS workshop on End-to-end Learning for Speech and Audio Processing

  18. arXiv:1610.05540  [pdf, ps, other

    cs.CL

    SYSTRAN's Pure Neural Machine Translation Systems

    Authors: Josep Crego, Jungi Kim, Guillaume Klein, Anabel Rebollo, Kathy Yang, Jean Senellart, Egor Akhanov, Patrice Brunelle, Aurelien Coquard, Yongchao Deng, Satoshi Enoue, Chiyo Geiss, Joshua Johanson, Ardas Khalsa, Raoum Khiari, Byeongil Ko, Catherine Kobus, Jean Lorieux, Leidiana Martins, Dang-Chuan Nguyen, Alexandra Priori, Thomas Riccardi, Natalia Segal, Christophe Servan, Cyril Tiquet , et al. (5 additional authors not shown)

    Abstract: Since the first online demonstration of Neural Machine Translation (NMT) by LISA, NMT development has recently moved from laboratory to production systems as demonstrated by several entities announcing roll-out of NMT engines to replace their existing technologies. NMT systems have a large number of training configurations and the training process of such systems is usually very long, often a few… ▽ More

    Submitted 18 October, 2016; originally announced October 2016.

  19. arXiv:1610.01291  [pdf, ps, other

    cs.CL

    Word2Vec vs DBnary: Augmenting METEOR using Vector Representations or Lexical Resources?

    Authors: Christophe Servan, Alexandre Berard, Zied Elloumi, Hervé Blanchon, Laurent Besacier

    Abstract: This paper presents an approach combining lexico-semantic resources and distributed representations of words applied to the evaluation in machine translation (MT). This study is made through the enrichment of a well-known MT evaluation metric: METEOR. This metric enables an approximate match (synonymy or morphological similarity) between an automatic and a reference translation. Our experiments ar… ▽ More

    Submitted 5 October, 2016; originally announced October 2016.

    Comments: accepted to COLING 2016 conference