Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Noukhovitch, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17031  [pdf, other

    cs.LG

    The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

    Authors: Shengyi Huang, Michael Noukhovitch, Arian Hosseini, Kashif Rasul, Weixun Wang, Lewis Tunstall

    Abstract: This work is the first to openly reproduce the Reinforcement Learning from Human Feedback (RLHF) scaling behaviors reported in OpenAI's seminal TL;DR summarization work. We create an RLHF pipeline from scratch, enumerate over 20 key implementation details, and share key insights during the reproduction. Our RLHF-trained Pythia models demonstrate significant gains in response quality that scale wit… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  2. arXiv:2312.07551  [pdf, other

    cs.CL

    Language Model Alignment with Elastic Reset

    Authors: Michael Noukhovitch, Samuel Lavoie, Florian Strub, Aaron Courville

    Abstract: Finetuning language models with reinforcement learning (RL), e.g. from human feedback (HF), is a prominent method for alignment. But optimizing against a reward model can improve on reward while degrading performance in other areas, a phenomenon known as reward hacking, alignment tax, or language drift. First, we argue that commonly-used test metrics are insufficient and instead measure how differ… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Published at NeurIPS 2023

  3. arXiv:2307.01403  [pdf, other

    cs.AI cs.LG

    Learning Multi-Agent Communication with Contrastive Learning

    Authors: Yat Long Lo, Biswa Sengupta, Jakob Foerster, Michael Noukhovitch

    Abstract: Communication is a powerful tool for coordination in multi-agent RL. But inducing an effective, common language is a difficult challenge, particularly in the decentralized setting. In this work, we introduce an alternative perspective where communicative messages sent between agents are considered as different incomplete views of the environment state. By examining the relationship between message… ▽ More

    Submitted 1 February, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: The 12th International Conference on Learning Representations (ICLR)

  4. arXiv:2204.00616  [pdf, other

    cs.LG cs.CV

    Simplicial Embeddings in Self-Supervised Learning and Downstream Classification

    Authors: Samuel Lavoie, Christos Tsirigotis, Max Schwarzer, Ankit Vani, Michael Noukhovitch, Kenji Kawaguchi, Aaron Courville

    Abstract: Simplicial Embeddings (SEM) are representations learned through self-supervised learning (SSL), wherein a representation is projected into $L$ simplices of $V$ dimensions each using a softmax operation. This procedure conditions the representation onto a constrained space during pretraining and imparts an inductive bias for group sparsity. For downstream classification, we formally prove that the… ▽ More

    Submitted 30 September, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: 30 pages, 8 figures, Preprint

  5. arXiv:2106.04799  [pdf, other

    cs.LG

    Pretraining Representations for Data-Efficient Reinforcement Learning

    Authors: Max Schwarzer, Nitarshan Rajkumar, Michael Noukhovitch, Ankesh Anand, Laurent Charlin, Devon Hjelm, Philip Bachman, Aaron Courville

    Abstract: Data efficiency is a key challenge for deep reinforcement learning. We address this problem by using unlabeled data to pretrain an encoder which is then finetuned on a small amount of task-specific data. To encourage learning representations which capture diverse aspects of the underlying MDP, we employ a combination of latent dynamics modelling and unsupervised goal-conditioned RL. When limited t… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

  6. arXiv:2101.10276  [pdf, other

    cs.LG cs.AI cs.MA

    Emergent Communication under Competition

    Authors: Michael Noukhovitch, Travis LaCroix, Angeliki Lazaridou, Aaron Courville

    Abstract: The literature in modern machine learning has only negative results for learning to communicate between competitive agents using standard RL. We introduce a modified sender-receiver game to study the spectrum of partially-competitive scenarios and show communication can indeed emerge in a competitive setting. We empirically demonstrate three key takeaways for future research. First, we show that c… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: To be presented at AAMAS 2021

  7. arXiv:1811.12889  [pdf, other

    cs.CL cs.AI

    Systematic Generalization: What Is Required and Can It Be Learned?

    Authors: Dzmitry Bahdanau, Shikhar Murty, Michael Noukhovitch, Thien Huu Nguyen, Harm de Vries, Aaron Courville

    Abstract: Numerous models for grounded language understanding have been recently proposed, including (i) generic models that can be easily adapted to any given task and (ii) intuitively appealing modular models that require background knowledge to be instantiated. We compare both types of models in how much they lend themselves to a particular form of systematic generalization. Using a synthetic VQA test, w… ▽ More

    Submitted 21 April, 2019; v1 submitted 30 November, 2018; originally announced November 2018.

    Comments: Published as a conference paper at ICLR 2019

  8. arXiv:1804.09259  [pdf, other

    cs.CL

    Commonsense mining as knowledge base completion? A study on the impact of novelty

    Authors: Stanisław Jastrzębski, Dzmitry Bahdanau, Seyedarian Hosseini, Michael Noukhovitch, Yoshua Bengio, Jackie Chi Kit Cheung

    Abstract: Commonsense knowledge bases such as ConceptNet represent knowledge in the form of relational triples. Inspired by the recent work by Li et al., we analyse if knowledge base completion models can be used to mine commonsense knowledge from raw text. We propose novelty of predicted triples with respect to the training set as an important factor in interpreting results. We critically analyse the diffi… ▽ More

    Submitted 24 April, 2018; originally announced April 2018.

    Comments: Published in Workshop on New Forms of Generalization in Deep Learning and Natural Language Processing (NAACL 2018)