Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Pierrot, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14150  [pdf, other

    cs.LG

    Multi-modal Transfer Learning between Biological Foundation Models

    Authors: Juan Jose Garau-Luis, Patrick Bordes, Liam Gonzalez, Masa Roller, Bernardo P. de Almeida, Lorenz Hexemer, Christopher Blum, Stefan Laurent, Jan Grzegorzewski, Maren Lang, Thomas Pierrot, Guillaume Richard

    Abstract: Biological sequences encode fundamental instructions for the building blocks of life, in the form of DNA, RNA, and proteins. Modeling these sequences is key to understand disease mechanisms and is an active research area in computational biology. Recently, Large Language Models have shown great promise in solving certain biological tasks but current approaches are limited to a single sequence moda… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    MSC Class: 68T07 (Primary)

  2. arXiv:2405.15840  [pdf, other

    q-bio.QM cs.LG

    Learning the Language of Protein Structure

    Authors: Benoit Gaujac, Jérémie Donà, Liviu Copoiu, Timothy Atkinson, Thomas Pierrot, Thomas D. Barrett

    Abstract: Representation learning and \emph{de novo} generation of proteins are pivotal computational biology tasks. Whilst natural language processing (NLP) techniques have proven highly effective for protein sequence modelling, structure modelling presents a complex challenge, primarily due to its continuous and three-dimensional nature. Motivated by this discrepancy, we introduce an approach using a vect… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2403.17164  [pdf, other

    cs.NE cs.AI cs.LG

    Multi-Objective Quality-Diversity for Crystal Structure Prediction

    Authors: Hannah Janmohamed, Marta Wolinska, Shikha Surana, Thomas Pierrot, Aron Walsh, Antoine Cully

    Abstract: Crystal structures are indispensable across various domains, from batteries to solar cells, and extensive research has been dedicated to predicting their properties based on their atomic configurations. However, prevailing Crystal Structure Prediction methods focus on identifying the most stable solutions that lie at the global minimum of the energy function. This approach overlooks other potentia… ▽ More

    Submitted 21 June, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted GECCO 2024

  4. arXiv:2308.03665  [pdf, other

    cs.AI cs.NE

    QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration

    Authors: Felix Chalumeau, Bryan Lim, Raphael Boige, Maxime Allard, Luca Grillotti, Manon Flageat, Valentin Macé, Arthur Flajolet, Thomas Pierrot, Antoine Cully

    Abstract: QDax is an open-source library with a streamlined and modular API for Quality-Diversity (QD) optimization algorithms in Jax. The library serves as a versatile tool for optimization purposes, ranging from black-box optimization to continuous control. QDax offers implementations of popular QD, Neuroevolution, and Reinforcement Learning (RL) algorithms, supported by various examples. All the implemen… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  5. arXiv:2307.10936  [pdf, other

    cs.AI cs.LG

    PASTA: Pretrained Action-State Transformer Agents

    Authors: Raphael Boige, Yannis Flet-Berliac, Arthur Flajolet, Guillaume Richard, Thomas Pierrot

    Abstract: Self-supervised learning has brought about a revolutionary paradigm shift in various computing domains, including NLP, vision, and biology. Recent approaches involve pre-training transformer models on vast amounts of unlabeled data, serving as a starting point for efficiently solving downstream tasks. In reinforcement learning, researchers have recently adapted these approaches, developing models… ▽ More

    Submitted 4 December, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

  6. Gradient-Informed Quality Diversity for the Illumination of Discrete Spaces

    Authors: Raphael Boige, Guillaume Richard, Jérémie Dona, Thomas Pierrot, Antoine Cully

    Abstract: Quality Diversity (QD) algorithms have been proposed to search for a large collection of both diverse and high-performing solutions instead of a single set of local optima. While early QD algorithms view the objective and descriptor functions as black-box functions, novel tools have been introduced to use gradient information to accelerate the search and improve overall performance of those algori… ▽ More

    Submitted 13 September, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Journal ref: GECCO 2023 Proceedings of the Genetic and Evolutionary Computation Conference; Pages 119-128

  7. The Quality-Diversity Transformer: Generating Behavior-Conditioned Trajectories with Decision Transformers

    Authors: Valentin Macé, Raphaël Boige, Felix Chalumeau, Thomas Pierrot, Guillaume Richard, Nicolas Perrin-Gilbert

    Abstract: In the context of neuroevolution, Quality-Diversity algorithms have proven effective in generating repertoires of diverse and efficient policies by relying on the definition of a behavior space. A natural goal induced by the creation of such a repertoire is trying to achieve behaviors on demand, which can be done by running the corresponding policy from the repertoire. However, in uncertain enviro… ▽ More

    Submitted 13 September, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 10+7 pages

  8. arXiv:2303.12803  [pdf, other

    cs.NE cs.AI

    Evolving Populations of Diverse RL Agents with MAP-Elites

    Authors: Thomas Pierrot, Arthur Flajolet

    Abstract: Quality Diversity (QD) has emerged as a powerful alternative optimization paradigm that aims at generating large and diverse collections of solutions, notably with its flagship algorithm MAP-ELITES (ME) which evolves solutions through mutations and crossovers. While very effective for some unstructured problems, early ME implementations relied exclusively on random search to evolve the population… ▽ More

    Submitted 15 June, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  9. arXiv:2302.12668  [pdf, other

    cs.NE cs.AI cs.LG cs.RO

    Improving the Data Efficiency of Multi-Objective Quality-Diversity through Gradient Assistance and Crowding Exploration

    Authors: Hannah Janmohamed, Thomas Pierrot, Antoine Cully

    Abstract: Quality-Diversity (QD) algorithms have recently gained traction as optimisation methods due to their effectiveness at escaping local optima and capability of generating wide-ranging and high-performing solutions. Recently, Multi-Objective MAP-Elites (MOME) extended the QD paradigm to the multi-objective setting by maintaining a Pareto front in each cell of a map-elites grid. MOME achieved a global… ▽ More

    Submitted 16 May, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Accepted GECCO 2023

  10. arXiv:2211.13742  [pdf, other

    cs.NE cs.AI

    Assessing Quality-Diversity Neuro-Evolution Algorithms Performance in Hard Exploration Problems

    Authors: Felix Chalumeau, Thomas Pierrot, Valentin Macé, Arthur Flajolet, Karim Beguir, Antoine Cully, Nicolas Perrin-Gilbert

    Abstract: A fascinating aspect of nature lies in its ability to produce a collection of organisms that are all high-performing in their niche. Quality-Diversity (QD) methods are evolutionary algorithms inspired by this observation, that obtained great results in many applications, from wing design to robot adaptation. Recently, several works demonstrated that these methods could be applied to perform neuro-… ▽ More

    Submitted 8 September, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: GECCO 2022 Workshop on Quality Diversity Algorithm Benchmarks

  11. arXiv:2210.03516  [pdf, other

    cs.NE cs.AI cs.LG

    Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery

    Authors: Felix Chalumeau, Raphael Boige, Bryan Lim, Valentin Macé, Maxime Allard, Arthur Flajolet, Antoine Cully, Thomas Pierrot

    Abstract: Deep Reinforcement Learning (RL) has emerged as a powerful paradigm for training neural policies to solve complex control tasks. However, these policies tend to be overfit to the exact specifications of the task and environment they were trained on, and thus do not perform well when conditions deviate slightly or when composed hierarchically to solve even more complex tasks. Recent work has shown… ▽ More

    Submitted 8 September, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: Camera ready version for ICLR2023 (spotlight)

  12. arXiv:2206.08888  [pdf, other

    cs.LG cs.AI cs.NE

    Fast Population-Based Reinforcement Learning on a Single Machine

    Authors: Arthur Flajolet, Claire Bizon Monroc, Karim Beguir, Thomas Pierrot

    Abstract: Training populations of agents has demonstrated great promise in Reinforcement Learning for stabilizing training, improving exploration and asymptotic performance, and generating a diverse set of solutions. However, population-based training is often not considered by practitioners as it is perceived to be either prohibitively slow (when implemented sequentially), or computationally expensive (if… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  13. Multi-Objective Quality Diversity Optimization

    Authors: Thomas Pierrot, Guillaume Richard, Karim Beguir, Antoine Cully

    Abstract: In this work, we consider the problem of Quality-Diversity (QD) optimization with multiple objectives. QD algorithms have been proposed to search for a large collection of both diverse and high-performing solutions instead of a single set of local optima. Thriving for diversity was shown to be useful in many industrial and robotics applications. On the other hand, most real-life problems exhibit s… ▽ More

    Submitted 31 May, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

  14. arXiv:2012.01736  [pdf, other

    q-bio.BM cs.LG

    Designing a Prospective COVID-19 Therapeutic with Reinforcement Learning

    Authors: Marcin J. Skwark, Nicolás López Carranza, Thomas Pierrot, Joe Phillips, Slim Said, Alexandre Laterre, Amine Kerkeni, Uğur Şahin, Karim Beguir

    Abstract: The SARS-CoV-2 pandemic has created a global race for a cure. One approach focuses on designing a novel variant of the human angiotensin-converting enzyme 2 (ACE2) that binds more tightly to the SARS-CoV-2 spike protein and diverts it from human cells. Here we formulate a novel protein design framework as a reinforcement learning problem. We generate new designs efficiently through the combination… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

  15. arXiv:2011.14379  [pdf, other

    cs.LG

    Offline Reinforcement Learning Hands-On

    Authors: Louis Monier, Jakub Kmec, Alexandre Laterre, Thomas Pierrot, Valentin Courgeau, Olivier Sigaud, Karim Beguir

    Abstract: Offline Reinforcement Learning (RL) aims to turn large datasets into powerful decision-making engines without any online interactions with the environment. This great promise has motivated a large amount of research that hopes to replicate the success RL has experienced in simulation settings. This work ambitions to reflect upon these efforts from a practitioner viewpoint. We start by discussing t… ▽ More

    Submitted 29 November, 2020; originally announced November 2020.

    Comments: Accepted at NeurIPS 2020 Offline Reinforcement Learning Workshop. First two authors contributed equally. Authors three and four advised equally

  16. arXiv:2007.13363  [pdf, other

    cs.AI

    Learning Compositional Neural Programs for Continuous Control

    Authors: Thomas Pierrot, Nicolas Perrin, Feryal Behbahani, Alexandre Laterre, Olivier Sigaud, Karim Beguir, Nando de Freitas

    Abstract: We propose a novel solution to challenging sparse-reward, continuous control problems that require hierarchical planning at multiple levels of abstraction. Our solution, dubbed AlphaNPI-X, involves three separate stages of learning. First, we use off-policy reinforcement learning algorithms with experience replay to learn a set of atomic goal-conditioned policies, which can be easily repurposed fo… ▽ More

    Submitted 13 April, 2021; v1 submitted 27 July, 2020; originally announced July 2020.

  17. Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization

    Authors: Thomas Pierrot, Valentin Macé, Félix Chalumeau, Arthur Flajolet, Geoffrey Cideron, Karim Beguir, Antoine Cully, Olivier Sigaud, Nicolas Perrin-Gilbert

    Abstract: A fascinating aspect of nature lies in its ability to produce a large and diverse collection of organisms that are all high-performing in their niche. By contrast, most AI algorithms focus on finding a single efficient solution to a given problem. Aiming for diversity in addition to performance is a convenient way to deal with the exploration-exploitation trade-off that plays a central role in lea… ▽ More

    Submitted 31 May, 2022; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: Add several baselines (Policy Gradient assisted MAP Elites, DIAYN, AGAC) Change writing to take the point of view of the evo community Change style, writing, explanation, figures

  18. arXiv:1905.12941  [pdf, other

    cs.AI

    Learning Compositional Neural Programs with Recursive Tree Search and Planning

    Authors: Thomas Pierrot, Guillaume Ligner, Scott Reed, Olivier Sigaud, Nicolas Perrin, Alexandre Laterre, David Kas, Karim Beguir, Nando de Freitas

    Abstract: We propose a novel reinforcement learning algorithm, AlphaNPI, that incorporates the strengths of Neural Programmer-Interpreters (NPI) and AlphaZero. NPI contributes structural biases in the form of modularity, hierarchy and recursion, which are helpful to reduce sample complexity, improve generalization and increase interpretability. AlphaZero contributes powerful neural network guided search alg… ▽ More

    Submitted 13 April, 2021; v1 submitted 30 May, 2019; originally announced May 2019.

  19. arXiv:1810.08102  [pdf, other

    cs.LG stat.ML

    First-order and second-order variants of the gradient descent in a unified framework

    Authors: Thomas Pierrot, Nicolas Perrin, Olivier Sigaud

    Abstract: In this paper, we provide an overview of first-order and second-order variants of the gradient descent method that are commonly used in machine learning. We propose a general framework in which 6 of these variants can be interpreted as different instances of the same approach. They are the vanilla gradient descent, the classical and generalized Gauss-Newton methods, the natural gradient descent me… ▽ More

    Submitted 14 August, 2021; v1 submitted 18 October, 2018; originally announced October 2018.

    Comments: 13 pages