Zum Hauptinhalt springen

Showing 1–26 of 26 results for author: Colas, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04503  [pdf, other

    physics.soc-ph cs.AI cs.MA

    When LLMs Play the Telephone Game: Cumulative Changes and Attractors in Iterated Cultural Transmissions

    Authors: Jérémy Perez, Corentin Léger, Grgur Kovač, Cédric Colas, Gaia Molinaro, Maxime Derex, Pierre-Yves Oudeyer, Clément Moulin-Frier

    Abstract: As large language models (LLMs) start interacting with each other and generating an increasing amount of text online, it becomes crucial to better understand how information is transformed as it passes from one LLM to the next. While significant research has examined individual LLM behaviors, existing studies have largely overlooked the collective behaviors and information distortions arising from… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Code available at https://github.com/jeremyperez2/TelephoneGameLLM. Companion website with a Data Explorer tool at https://sites.google.com/view/telephone-game-llm

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:2405.04118  [pdf, other

    cs.LG cs.AI cs.CL

    Policy Learning with a Language Bottleneck

    Authors: Megha Srivastava, Cedric Colas, Dorsa Sadigh, Jacob Andreas

    Abstract: Modern AI systems such as self-driving cars and game-playing agents achieve superhuman performance, but often lack human-like features such as generalization, interpretability and human inter-operability. Inspired by the rich interactions between language and decision-making in humans, we introduce Policy Learning with a Language Bottleneck (PLLB), a framework enabling AI agents to generate lingui… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 18 pages, 13 figures

  3. arXiv:2311.00344  [pdf, other

    cs.AI

    A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

    Authors: Olivier Sigaud, Gianluca Baldassarre, Cedric Colas, Stephane Doncieux, Richard Duro, Pierre-Yves Oudeyer, Nicolas Perrin-Gilbert, Vieri Giuliano Santucci

    Abstract: A lot of recent machine learning research papers have ``open-ended learning'' in their title. But very few of them attempt to define what they mean when using the term. Even worse, when looking more closely there seems to be no consensus on what distinguishes open-ended learning from related concepts such as continual learning, lifelong learning or autotelic learning. In this paper, we contribute… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  4. arXiv:2310.10692  [pdf, other

    cs.LG cs.AI

    ACES: Generating Diverse Programming Puzzles with with Autotelic Generative Models

    Authors: Julien Pourcel, Cédric Colas, Gaia Molinaro, Pierre-Yves Oudeyer, Laetitia Teodorescu

    Abstract: The ability to invent novel and interesting problems is a remarkable feature of human intelligence that drives innovation, art, and science. We propose a method that aims to automate this process by harnessing the power of state-of-the-art generative models to produce a diversity of challenging yet solvable problems, here in the context of Python programming puzzles. Inspired by the intrinsically… ▽ More

    Submitted 29 May, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

  5. arXiv:2307.07870  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Models as Superpositions of Cultural Perspectives

    Authors: Grgur Kovač, Masataka Sawayama, Rémy Portelas, Cédric Colas, Peter Ford Dominey, Pierre-Yves Oudeyer

    Abstract: Large Language Models (LLMs) are often misleadingly recognized as having a personality or a set of values. We argue that an LLM can be seen as a superposition of perspectives with different values and personality traits. LLMs exhibit context-dependent values and personality traits that change based on the induced perspective (as opposed to humans, who tend to have more coherent values and personal… ▽ More

    Submitted 7 November, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: Preprint

    MSC Class: 68T07 ACM Class: I.2.7

  6. arXiv:2305.12487  [pdf, other

    cs.AI cs.CL cs.LG

    Augmenting Autotelic Agents with Large Language Models

    Authors: Cédric Colas, Laetitia Teodorescu, Pierre-Yves Oudeyer, Xingdi Yuan, Marc-Alexandre Côté

    Abstract: Humans learn to master open-ended repertoires of skills by imagining and practicing their own goals. This autotelic learning process, literally the pursuit of self-generated (auto) goals (telos), becomes more and more open-ended as the goals become more diverse, abstract and creative. The resulting exploration of the space of possible skills is supported by an inter-individual exploration: goal re… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

  7. arXiv:2302.06692  [pdf, other

    cs.LG cs.AI cs.CL

    Guiding Pretraining in Reinforcement Learning with Large Language Models

    Authors: Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas

    Abstract: Reinforcement learning algorithms typically struggle in the absence of a dense, well-shaped reward function. Intrinsically motivated exploration methods address this limitation by rewarding agents for visiting novel states or transitions, but these methods offer limited benefits in large environments where most discovered novelty is irrelevant for downstream tasks. We describe a method that uses b… ▽ More

    Submitted 14 September, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  8. Language and Culture Internalisation for Human-Like Autotelic AI

    Authors: Cédric Colas, Tristan Karch, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: Building autonomous agents able to grow open-ended repertoires of skills across their lives is a fundamental goal of artificial intelligence (AI). A promising developmental approach recommends the design of intrinsically motivated agents that learn new skills by generating and pursuing their own goals - autotelic agents. But despite recent progress, existing algorithms still show serious limitatio… ▽ More

    Submitted 16 November, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    Journal ref: Nature Machine Intelligence 4, 1068-1076 (2022)

  9. arXiv:2202.05129  [pdf, other

    cs.AI cs.HC

    Help Me Explore: Minimal Social Interventions for Graph-Based Autotelic Agents

    Authors: Ahmed Akakzia, Olivier Serris, Olivier Sigaud, Cédric Colas

    Abstract: In the quest for autonomous agents learning open-ended repertoires of skills, most works take a Piagetian perspective: learning trajectories are the results of interactions between developmental agents and their physical environment. The Vygotskian perspective, on the other hand, emphasizes the centrality of the socio-cultural environment: higher cognitive functions emerge from transmissions of so… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: 18 pages, 11 figures

  10. Towards Teachable Autotelic Agents

    Authors: Olivier Sigaud, Ahmed Akakzia, Hugo Caselles-Dupré, Cédric Colas, Pierre-Yves Oudeyer, Mohamed Chetouani

    Abstract: Autonomous discovery and direct instruction are two distinct sources of learning in children but education sciences demonstrate that mixed approaches such as assisted discovery or guided play result in improved skill acquisition. In the field of Artificial Intelligence, these extremes respectively map to autonomous agents learning from their own signals and interactive learning agents fully taught… ▽ More

    Submitted 20 March, 2023; v1 submitted 25 May, 2021; originally announced May 2021.

    Journal ref: Sigaud, O., Akakzia, A., Caselles-Dupré, H., Colas, C., Oudeyer, P. Y., & Chetouani, M. (2022). Towards Teachable Autotelic Agents. IEEE Transactions on Cognitive and Developmental Systems

  11. arXiv:2012.09830  [pdf, other

    cs.LG cs.AI

    Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey

    Authors: Cédric Colas, Tristan Karch, Olivier Sigaud, Pierre-Yves Oudeyer

    Abstract: Building autonomous machines that can explore open-ended environments, discover possible interactions and build repertoires of skills is a general objective of artificial intelligence. Developmental approaches argue that this can only be achieved by $autotelic$ $agents$: intrinsically motivated learning agents that can learn to represent, generate, select and solve their own problems. In recent ye… ▽ More

    Submitted 12 July, 2022; v1 submitted 17 December, 2020; originally announced December 2020.

  12. arXiv:2010.04452  [pdf, other

    cs.LG math.OC q-bio.PE

    EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models

    Authors: Cédric Colas, Boris Hejblum, Sébastien Rouillon, Rodolphe Thiébaut, Pierre-Yves Oudeyer, Clément Moulin-Frier, Mélanie Prague

    Abstract: Epidemiologists model the dynamics of epidemics in order to propose control strategies based on pharmaceutical and non-pharmaceutical interventions (contact limitation, lock down, vaccination, etc). Hand-designing such strategies is not trivial because of the number of possible interventions and the difficulty to predict long-term effects. This task can be cast as an optimization problem where sta… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Journal ref: Journal of Artificial Intelligence Research-2021

  13. arXiv:2006.07185  [pdf, other

    cs.AI cs.LG stat.ML

    Grounding Language to Autonomously-Acquired Skills via Goal Generation

    Authors: Ahmed Akakzia, Cédric Colas, Pierre-Yves Oudeyer, Mohamed Chetouani, Olivier Sigaud

    Abstract: We are interested in the autonomous acquisition of repertoires of skills. Language-conditioned reinforcement learning (LC-RL) approaches are great tools in this quest, as they allow to express abstract goals as sets of constraints on the states. However, most LC-RL agents are not autonomous and cannot learn without external instructions and feedback. Besides, their direct language condition cannot… ▽ More

    Submitted 25 January, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: Published at ICLR 2021

  14. arXiv:2006.07043  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Language-Conditioned Goal Generation: a New Approach to Language Grounding for RL

    Authors: Cédric Colas, Ahmed Akakzia, Pierre-Yves Oudeyer, Mohamed Chetouani, Olivier Sigaud

    Abstract: In the real world, linguistic agents are also embodied agents: they perceive and act in the physical world. The notion of Language Grounding questions the interactions between language and embodiment: how do learning agents connect or ground linguistic representations to the physical world ? This question has recently been approached by the Reinforcement Learning community under the framework of i… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  15. arXiv:2003.09443  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Sets for Generalization in RL

    Authors: Tristan Karch, Cédric Colas, Laetitia Teodorescu, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: This paper investigates the idea of encoding object-centered representations in the design of the reward function and policy architectures of a language-guided reinforcement learning agent. This is done using a combination of object-wise permutation invariant networks inspired from Deep Sets and gated-attention mechanisms. In a 2D procedurally-generated world where agents targeting goals in natura… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

    Comments: 15 pages, 10 figures, published as a workshop Paper at ICLR: Beyond tabula rasa in RL (BeTR-RL). arXiv admin note: substantial text overlap with arXiv:2002.09253

  16. arXiv:2003.04664  [pdf, other

    cs.LG cs.AI stat.ML

    Automatic Curriculum Learning For Deep RL: A Short Survey

    Authors: Rémy Portelas, Cédric Colas, Lilian Weng, Katja Hofmann, Pierre-Yves Oudeyer

    Abstract: Automatic Curriculum Learning (ACL) has become a cornerstone of recent successes in Deep Reinforcement Learning (DRL).These methods shape the learning trajectories of agents by challenging them with tasks adapted to their capacities. In recent years, they have been used to improve sample efficiency and asymptotic performance, to organize exploration, to encourage generalization or to solve sparse… ▽ More

    Submitted 28 May, 2020; v1 submitted 10 March, 2020; originally announced March 2020.

    Comments: Accepted at IJCAI2020

  17. arXiv:2003.01825  [pdf, other

    cs.NE cs.AI cs.LG

    Scaling MAP-Elites to Deep Neuroevolution

    Authors: Cédric Colas, Joost Huizinga, Vashisht Madhavan, Jeff Clune

    Abstract: Quality-Diversity (QD) algorithms, and MAP-Elites (ME) in particular, have proven very useful for a broad range of applications including enabling real robots to recover quickly from joint damage, solving strongly deceptive maze tasks or evolving robot morphologies to discover new gaits. However, present implementations of MAP-Elites and other QD algorithms seem to be limited to low-dimensional co… ▽ More

    Submitted 5 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: Accepted to GECCO 2020

  18. arXiv:2002.09253  [pdf, other

    cs.AI cs.CL cs.LG

    Language as a Cognitive Tool to Imagine Goals in Curiosity-Driven Exploration

    Authors: Cédric Colas, Tristan Karch, Nicolas Lair, Jean-Michel Dussoux, Clément Moulin-Frier, Peter Ford Dominey, Pierre-Yves Oudeyer

    Abstract: Developmental machine learning studies how artificial agents can model the way children learn open-ended repertoires of skills. Such agents need to create and represent goals, select which ones to pursue and learn to achieve them. Recent approaches have considered goal spaces that were either fixed and hand-defined or learned using generative models of states. This limited agents to sample goals w… ▽ More

    Submitted 21 October, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: Contains main article and supplementaries

    Journal ref: NeurIPS 2020

  19. arXiv:1911.03219  [pdf, other

    cs.LG cs.CL stat.ML

    Language Grounding through Social Interactions and Curiosity-Driven Multi-Goal Learning

    Authors: Nicolas Lair, Cédric Colas, Rémy Portelas, Jean-Michel Dussoux, Peter Ford Dominey, Pierre-Yves Oudeyer

    Abstract: Autonomous reinforcement learning agents, like children, do not have access to predefined goals and reward functions. They must discover potential goals, learn their own reward functions and engage in their own learning trajectory. Children, however, benefit from exposure to language, helping to organize and mediate their thought. We propose LE2 (Language Enhanced Exploration), a learning algorith… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: NeurIPS 2019 Workshop ViGIL : Visually Grounded Interaction and Language

  20. arXiv:1910.07224  [pdf, other

    cs.LG cs.RO stat.ML

    Teacher algorithms for curriculum learning of Deep RL in continuously parameterized environments

    Authors: Rémy Portelas, Cédric Colas, Katja Hofmann, Pierre-Yves Oudeyer

    Abstract: We consider the problem of how a teacher algorithm can enable an unknown Deep Reinforcement Learning (DRL) student to become good at a skill over a wide range of diverse environments. To do so, we study how a teacher algorithm can learn to generate a learning curriculum, whereby it sequentially samples parameters controlling a stochastic procedural generation of environments. Because it does not i… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: Accepted at CoRL 2019

  21. arXiv:1904.06979  [pdf, other

    stat.ME cs.LG

    A Hitchhiker's Guide to Statistical Comparisons of Reinforcement Learning Algorithms

    Authors: Cédric Colas, Olivier Sigaud, Pierre-Yves Oudeyer

    Abstract: Consistently checking the statistical significance of experimental results is the first mandatory step towards reproducible science. This paper presents a hitchhiker's guide to rigorous comparisons of reinforcement learning algorithms. After introducing the concepts of statistical testing, we review the relevant statistical tests and compare them empirically in terms of false positive rate and sta… ▽ More

    Submitted 29 August, 2022; v1 submitted 15 April, 2019; originally announced April 2019.

    Comments: 8 pages + supplementary material

  22. arXiv:1901.09720  [pdf, other

    cs.LG stat.ML

    CLIC: Curriculum Learning and Imitation for object Control in non-rewarding environments

    Authors: Pierre Fournier, Olivier Sigaud, Cédric Colas, Mohamed Chetouani

    Abstract: In this paper we study a new reinforcement learning setting where the environment is non-rewarding, contains several possibly related objects of various controllability, and where an apt agent Bob acts independently, with non-observable intentions. We argue that this setting defines a realistic scenario and we present a generic discrete-state discrete-action model of such environments. To learn in… ▽ More

    Submitted 25 March, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

  23. arXiv:1810.06284  [pdf, other

    cs.AI

    CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning

    Authors: Cédric Colas, Pierre Fournier, Olivier Sigaud, Mohamed Chetouani, Pierre-Yves Oudeyer

    Abstract: In open-ended environments, autonomous learning agents must set their own goals and build their own curriculum through an intrinsically motivated exploration. They may consider a large diversity of goals, aiming to discover what is controllable in their environments, and what is not. Because some goals might prove easy and some impossible, agents must actively select which goal to practice at any… ▽ More

    Submitted 29 May, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: Accepted at ICML 2019

    Report number: PMLR 97:1331-1340

    Journal ref: Proceedings of the 36th International Conference on Machine Learning 2019

  24. arXiv:1807.11752  [pdf, other

    cs.HC

    Compact Convolutional Neural Networks for Multi-Class, Personalised, Closed-Loop EEG-BCI

    Authors: Pablo Ortega, Cedric Colas, Aldo Faisal

    Abstract: For many people suffering from motor disabilities, assistive devices controlled with only brain activity are the only way to interact with their environment. Natural tasks often require different kinds of interactions, involving different controllers the user should be able to select in a self-paced way. We developed a Brain-Computer Interface (BCI) allowing users to switch between four control mo… ▽ More

    Submitted 31 July, 2018; originally announced July 2018.

  25. arXiv:1806.08295  [pdf, other

    cs.LG stat.ML

    How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement Learning Experiments

    Authors: Cédric Colas, Olivier Sigaud, Pierre-Yves Oudeyer

    Abstract: Consistently checking the statistical significance of experimental results is one of the mandatory methodological steps to address the so-called "reproducibility crisis" in deep reinforcement learning. In this tutorial paper, we explain how the number of random seeds relates to the probabilities of statistical errors. For both the t-test and the bootstrap confidence interval test, we recall theore… ▽ More

    Submitted 5 July, 2018; v1 submitted 21 June, 2018; originally announced June 2018.

  26. arXiv:1802.05054  [pdf, other

    cs.LG

    GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms

    Authors: Cédric Colas, Olivier Sigaud, Pierre-Yves Oudeyer

    Abstract: In continuous action domains, standard deep reinforcement learning algorithms like DDPG suffer from inefficient exploration when facing sparse or deceptive reward problems. Conversely, evolutionary and developmental methods focusing on exploration like Novelty Search, Quality-Diversity or Goal Exploration Processes explore more robustly but are less efficient at fine-tuning policies using gradient… ▽ More

    Submitted 20 September, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

    Comments: accepted at ICML 2018, 14 pages