Skip to main content

Showing 1–5 of 5 results for author: Jacq, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.03521  [pdf, other

    cs.AI

    On the importance of data collection for training general goal-reaching policies

    Authors: Alexis Jacq, Manu Orsini, Gabriel Dulac-Arnold, Olivier Pietquin, Matthieu Geist, Olivier Bachem

    Abstract: Recent advances in ML suggest that the quantity of data available to a model is one of the primary bottlenecks to high performance. Although for language-based tasks there exist almost unlimited amounts of reasonably coherent data to train from, this is generally not the case for Reinforcement Learning, especially when dealing with a novel environment. In effect, even a relatively trivial continuo… ▽ More

    Submitted 20 February, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

  2. arXiv:2203.08542  [pdf, other

    cs.LG cs.AI

    Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act

    Authors: Alexis Jacq, Johan Ferret, Olivier Pietquin, Matthieu Geist

    Abstract: Traditionally, Reinforcement Learning (RL) aims at deciding how to act optimally for an artificial agent. We argue that deciding when to act is equally important. As humans, we drift from default, instinctive or memorized behaviors to focused, thought-out behaviors when required by the situation. To enhance RL agents with this aptitude, we propose to augment the standard Markov Decision Process an… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: AAMAS 2022 (14 pages extended version, added Sec. 7.4 and appendix K)

    Journal ref: Autonomous Agents and Multi-Agent Systems (2022)

  3. arXiv:2006.00979  [pdf, other

    cs.LG cs.AI

    Acme: A Research Framework for Distributed Reinforcement Learning

    Authors: Matthew W. Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Nikola Momchev, Danila Sinopalnikov, Piotr Stańczyk, Sabela Ramos, Anton Raichuk, Damien Vincent, Léonard Hussenot, Robert Dadashi, Gabriel Dulac-Arnold, Manu Orsini, Alexis Jacq, Johan Ferret, Nino Vieillard, Seyed Kamyar Seyed Ghasemipour, Sertan Girgin, Olivier Pietquin, Feryal Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang , et al. (14 additional authors not shown)

    Abstract: Deep reinforcement learning (RL) has led to many recent and groundbreaking advances. However, these advances have often come at the cost of both increased scale in the underlying architectures being trained as well as increased complexity of the RL algorithms used to train them. These increases have in turn made it more difficult for researchers to rapidly prototype new ideas or reproduce publishe… ▽ More

    Submitted 20 September, 2022; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: This work presents a second version of the paper which coincides with an increase in modularity, additional emphasis on offline, imitation and learning from demonstrations algorithms, as well as various new agents implemented as part of Acme

  4. arXiv:1906.09831  [pdf, other

    cs.GT cs.AI cs.LG

    Foolproof Cooperative Learning

    Authors: Alexis Jacq, Julien Perolat, Matthieu Geist, Olivier Pietquin

    Abstract: This paper extends the notion of learning equilibrium in game theory from matrix games to stochastic games. We introduce Foolproof Cooperative Learning (FCL), an algorithm that converges to a Tit-for-Tat behavior. It allows cooperative strategies when played against itself while being not exploitable by selfish players. We prove that in repeated symmetric games, this algorithm is a learning equili… ▽ More

    Submitted 15 October, 2020; v1 submitted 24 June, 2019; originally announced June 2019.

    Journal ref: Proceedings of The 12th Asian Conference on Machine Learning, PMLR 129:401-416, 2020

  5. arXiv:1602.06703  [pdf, other

    cs.RO cs.CY

    Cognitive Architecture for Mutual Modelling

    Authors: Alexis Jacq, Wafa Johal, Pierre Dillenbourg, Ana Paiva

    Abstract: In social robotics, robots needs to be able to be understood by humans. Especially in collaborative tasks where they have to share mutual knowledge. For instance, in an educative scenario, learners share their knowledge and they must adapt their behaviour in order to make sure they are understood by others. Learners display behaviours in order to show their understanding and teachers adapt in orde… ▽ More

    Submitted 22 February, 2016; originally announced February 2016.

    Comments: Presented at "2nd Workshop on Cognitive Architectures for Social Human-Robot Interaction 2016 (arXiv:1602.01868)

    Report number: CogArch4sHRI/2016/07