Zum Hauptinhalt springen

Showing 1–13 of 13 results for author: Asri, L E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.01171  [pdf, other

    cs.CL cs.AI cs.LG

    Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation

    Authors: Kushal Arora, Layla El Asri, Hareesh Bahuleyan, Jackie Chi Kit Cheung

    Abstract: Current language generation models suffer from issues such as repetition, incoherence, and hallucinations. An often-repeated hypothesis is that this brittleness of generation models is caused by the training and the generation procedure mismatch, also referred to as exposure bias. In this paper, we verify this hypothesis by analyzing exposure bias from an imitation learning perspective. We show th… ▽ More

    Submitted 9 January, 2023; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: Accepted in Findings of ACL 2022. v2: Equation 7 updated, typo fixes

  2. arXiv:2102.05034  [pdf, other

    cs.LG cs.AI

    SLAPS: Self-Supervision Improves Structure Learning for Graph Neural Networks

    Authors: Bahare Fatemi, Layla El Asri, Seyed Mehran Kazemi

    Abstract: Graph neural networks (GNNs) work well when the graph structure is provided. However, this structure may not always be available in real-world applications. One solution to this problem is to infer a task-specific latent structure and then apply a GNN to the inferred graph. Unfortunately, the space of possible graph structures grows super-exponentially with the number of nodes and so the task-spec… ▽ More

    Submitted 31 October, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

    Comments: Accepted at NeurIPS 2021

  3. arXiv:2010.07665  [pdf, other

    cs.CL

    Diverse Keyphrase Generation with Neural Unlikelihood Training

    Authors: Hareesh Bahuleyan, Layla El Asri

    Abstract: In this paper, we study sequence-to-sequence (S2S) keyphrase generation models from the perspective of diversity. Recent advances in neural natural language generation have made possible remarkable progress on the task of keyphrase generation, demonstrated through improvements on quality metrics such as F1-score. However, the importance of diversity in keyphrase generation has been largely ignored… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: Accepted to COLING 2020

  4. arXiv:1906.09310  [pdf, other

    cs.LG cs.AI cs.CL

    A Study of State Aliasing in Structured Prediction with RNNs

    Authors: Layla El Asri, Adam Trischler

    Abstract: End-to-end reinforcement learning agents learn a state representation and a policy at the same time. Recurrent neural networks (RNNs) have been trained successfully as reinforcement learning agents in settings like dialogue that require structured prediction. In this paper, we investigate the representations learned by RNN-based agents when trained with both policy gradient and value-based methods… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

    Comments: Deep Reinforcement Learning Meets Structured Prediction workshop at ICLR 2019 and Representation Learning for NLP workshop at ACL 2019

  5. arXiv:1812.07023  [pdf, other

    cs.CL cs.CV

    From FiLM to Video: Multi-turn Question Answering with Multi-modal Context

    Authors: Dat Tien Nguyen, Shikhar Sharma, Hannes Schulz, Layla El Asri

    Abstract: Understanding audio-visual content and the ability to have an informative conversation about it have both been challenging areas for intelligent systems. The Audio Visual Scene-aware Dialog (AVSD) challenge, organized as a track of the Dialog System Technology Challenge 7 (DSTC7), proposes a combined task, where a system has to answer questions pertaining to a video given a dialogue with previous… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

    Comments: Accepted for an Oral presentation at the DSTC7 workshop at AAAI 2019

  6. arXiv:1812.00855  [pdf, other

    cs.LG cs.CL stat.ML

    Towards Solving Text-based Games by Producing Adaptive Action Spaces

    Authors: Ruo Yu Tao, Marc-Alexandre Côté, Xingdi Yuan, Layla El Asri

    Abstract: To solve a text-based game, an agent needs to formulate valid text commands for a given context and find the ones that lead to success. Recent attempts at solving text-based games with deep reinforcement learning have focused on the latter, i.e., learning to act optimally when valid actions are known in advance. In this work, we propose to tackle the first task and train a model that generates the… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

  7. arXiv:1811.09845  [pdf, other

    cs.CV

    Tell, Draw, and Repeat: Generating and Modifying Images Based on Continual Linguistic Instruction

    Authors: Alaaeldin El-Nouby, Shikhar Sharma, Hannes Schulz, Devon Hjelm, Layla El Asri, Samira Ebrahimi Kahou, Yoshua Bengio, Graham W. Taylor

    Abstract: Conditional text-to-image generation is an active area of research, with many possible applications. Existing research has primarily focused on generating a single image from available conditioning information in one step. One practical extension beyond one-step generation is a system that generates an image iteratively, conditioned on ongoing linguistic input or feedback. This is significantly mo… ▽ More

    Submitted 23 September, 2019; v1 submitted 24 November, 2018; originally announced November 2018.

    Comments: Accepted at ICCV 2019

    Journal ref: Proceedings of the 2019 IEEE International Conference on Computer Vision (ICCV)

  8. arXiv:1806.11532  [pdf, other

    cs.LG cs.CL stat.ML

    TextWorld: A Learning Environment for Text-based Games

    Authors: Marc-Alexandre Côté, Ákos Kádár, Xingdi Yuan, Ben Kybartas, Tavian Barnes, Emery Fine, James Moore, Ruo Yu Tao, Matthew Hausknecht, Layla El Asri, Mahmoud Adada, Wendy Tay, Adam Trischler

    Abstract: We introduce TextWorld, a sandbox learning environment for the training and evaluation of RL agents on text-based games. TextWorld is a Python library that handles interactive play-through of text games, as well as backend functions like state tracking and reward assignment. It comes with a curated list of games whose features and challenges we have analyzed. More significantly, it enables users t… ▽ More

    Submitted 8 November, 2019; v1 submitted 29 June, 2018; originally announced June 2018.

    Comments: Presented at the Computer Games Workshop at IJCAI 2018, Stockholm

  9. arXiv:1706.09799  [pdf, other

    cs.CL

    Relevance of Unsupervised Metrics in Task-Oriented Dialogue for Evaluating Natural Language Generation

    Authors: Shikhar Sharma, Layla El Asri, Hannes Schulz, Jeremie Zumer

    Abstract: Automated metrics such as BLEU are widely used in the machine translation literature. They have also been used recently in the dialogue community for evaluating dialogue response generation. However, previous work in dialogue response generation has shown that these metrics do not correlate strongly with human judgment in the non task-oriented dialogue setting. Task-oriented dialogue responses are… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

  10. arXiv:1706.01690  [pdf, other

    cs.CL

    A Frame Tracking Model for Memory-Enhanced Dialogue Systems

    Authors: Hannes Schulz, Jeremie Zumer, Layla El Asri, Shikhar Sharma

    Abstract: Recently, resources and tasks were proposed to go beyond state tracking in dialogue systems. An example is the frame tracking task, which requires recording multiple frames, one for each user goal set during the dialogue. This allows a user, for instance, to compare items corresponding to different goals. This paper proposes a model which takes as input the list of frames created so far during the… ▽ More

    Submitted 6 June, 2017; originally announced June 2017.

  11. arXiv:1704.00057  [pdf, other

    cs.CL

    Frames: A Corpus for Adding Memory to Goal-Oriented Dialogue Systems

    Authors: Layla El Asri, Hannes Schulz, Shikhar Sharma, Jeremie Zumer, Justin Harris, Emery Fine, Rahul Mehrotra, Kaheer Suleman

    Abstract: This paper presents the Frames dataset (Frames is available at http://datasets.maluuba.com/Frames), a corpus of 1369 human-human dialogues with an average of 15 turns per dialogue. We developed this dataset to study the role of memory in goal-oriented dialogue systems. Based on Frames, we introduce a task called frame tracking, which extends state tracking to a setting where several states are tra… ▽ More

    Submitted 13 April, 2017; v1 submitted 31 March, 2017; originally announced April 2017.

  12. arXiv:1607.00070  [pdf, other

    cs.CL

    A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems

    Authors: Layla El Asri, Jing He, Kaheer Suleman

    Abstract: User simulation is essential for generating enough data to train a statistical spoken dialogue system. Previous models for user simulation suffer from several drawbacks, such as the inability to take dialogue history into account, the need of rigid structure to ensure coherent user behaviour, heavy dependence on a specific domain, the inability to output several user intentions during one dialogue… ▽ More

    Submitted 30 June, 2016; originally announced July 2016.

    Comments: Accepted for publication at Interspeech 2016

  13. arXiv:1606.03152  [pdf, other

    cs.CL cs.AI

    Policy Networks with Two-Stage Training for Dialogue Systems

    Authors: Mehdi Fatemi, Layla El Asri, Hannes Schulz, Jing He, Kaheer Suleman

    Abstract: In this paper, we propose to use deep policy networks which are trained with an advantage actor-critic method for statistically optimised dialogue systems. First, we show that, on summary state and action spaces, deep Reinforcement Learning (RL) outperforms Gaussian Processes methods. Summary state and action spaces lead to good performance but require pre-engineering effort, RL knowledge, and dom… ▽ More

    Submitted 12 September, 2016; v1 submitted 9 June, 2016; originally announced June 2016.

    Comments: SIGDial 2016 (Submitted: May 2016; Accepted: Jun 30, 2016)

    Journal ref: Proceedings of the SIGDIAL 2016 Conference, pages 101--110, Los Angeles, USA, 13-15 September 2016. Association for Computational Linguistics