Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Rita, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.21783  [pdf, other

    cs.AI cs.CL cs.CV

    The Llama 3 Herd of Models

    Authors: Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang , et al. (510 additional authors not shown)

    Abstract: Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical… ▽ More

    Submitted 15 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

  2. arXiv:2404.19409  [pdf, other

    cs.CL

    Countering Reward Over-optimization in LLM with Demonstration-Guided Reinforcement Learning

    Authors: Mathieu Rita, Florian Strub, Rahma Chaabouni, Paul Michel, Emmanuel Dupoux, Olivier Pietquin

    Abstract: While Reinforcement Learning (RL) has been proven essential for tuning large language models (LLMs), it can lead to reward over-optimization (ROO). Existing approaches address ROO by adding KL regularization, requiring computationally expensive hyperparameter tuning. Additionally, KL regularization focuses solely on regularizing the language policy, neglecting a potential source of regularization:… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  3. arXiv:2403.11958  [pdf, other

    cs.CL cs.MA

    Language Evolution with Deep Learning

    Authors: Mathieu Rita, Paul Michel, Rahma Chaabouni, Olivier Pietquin, Emmanuel Dupoux, Florian Strub

    Abstract: Computational modeling plays an essential role in the study of language emergence. It aims to simulate the conditions and learning processes that could trigger the emergence of a structured language within a simulated controlled environment. Several methods have been used to investigate the origin of our language, including agent-based systems, Bayesian agents, genetic algorithms, and rule-based s… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: to appear in the Oxford Handbook of Approaches to Language Evolution

  4. arXiv:2305.12941  [pdf, other

    cs.CL cs.NE

    On the Correspondence between Compositionality and Imitation in Emergent Neural Communication

    Authors: Emily Cheng, Mathieu Rita, Thierry Poibeau

    Abstract: Compositionality is a hallmark of human language that not only enables linguistic generalization, but also potentially facilitates acquisition. When simulating language emergence with neural networks, compositionality has been shown to improve communication performance; however, its impact on imitation learning has yet to be investigated. Our work explores the link between compositionality and imi… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023; 5 pages + 8 pages of supplementary materials

  5. arXiv:2209.15342  [pdf, other

    cs.MA cs.CL cs.IT

    Emergent Communication: Generalization and Overfitting in Lewis Games

    Authors: Mathieu Rita, Corentin Tallec, Paul Michel, Jean-Bastien Grill, Olivier Pietquin, Emmanuel Dupoux, Florian Strub

    Abstract: Lewis signaling games are a class of simple communication games for simulating the emergence of language. In these games, two agents must agree on a communication protocol in order to solve a cooperative task. Previous work has shown that agents trained to play this game with reinforcement learning tend to develop languages that display undesirable properties from a linguistic point of view (lack… ▽ More

    Submitted 15 October, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  6. arXiv:2204.12982  [pdf, other

    cs.MA

    On the role of population heterogeneity in emergent communication

    Authors: Mathieu Rita, Florian Strub, Jean-Bastien Grill, Olivier Pietquin, Emmanuel Dupoux

    Abstract: Populations have often been perceived as a structuring component for language to emerge and evolve: the larger the population, the more structured the language. While this observation is widespread in the sociolinguistic literature, it has not been consistently reproduced in computer simulations with neural agents. In this paper, we thus aim to clarify this apparent contradiction. We explore emerg… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: International Conference on Learning Representations (ICLR) 2022

  7. arXiv:2010.01878  [pdf, other

    cs.CL cs.AI cs.MA

    "LazImpa": Lazy and Impatient neural agents learn to communicate efficiently

    Authors: Mathieu Rita, Rahma Chaabouni, Emmanuel Dupoux

    Abstract: Previous work has shown that artificial neural agents naturally develop surprisingly non-efficient codes. This is illustrated by the fact that in a referential game involving a speaker and a listener neural networks optimizing accurate transmission over a discrete channel, the emergent messages fail to achieve an optimal length. Furthermore, frequent messages tend to be longer than infrequent ones… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted to CoNLL 2020

    MSC Class: I.2 ACM Class: I.2

    Journal ref: Proceedings of CoNLL 2020