Skip to main content

Showing 1–3 of 3 results for author: Gordon-Hall, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.07769  [pdf, other

    cs.LG

    $α$VIL: Learning to Leverage Auxiliary Tasks for Multitask Learning

    Authors: Rafael Kourdis, Gabriel Gordon-Hall, Philip John Gorinski

    Abstract: Multitask Learning is a Machine Learning paradigm that aims to train a range of (usually related) tasks with the help of a shared model. While the goal is often to improve the joint performance of all training tasks, another approach is to focus on the performance of a specific target task, while treating the remaining ones as auxiliary data from which to possibly leverage positive transfer toward… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 1 algorithm, 4 figures, 2 tables

  2. arXiv:2004.11054  [pdf, other

    cs.CL cs.LG cs.NE

    Learning Dialog Policies from Weak Demonstrations

    Authors: Gabriel Gordon-Hall, Philip John Gorinski, Shay B. Cohen

    Abstract: Deep reinforcement learning is a promising approach to training a dialog manager, but current methods struggle with the large state and action spaces of multi-domain dialog systems. Building upon Deep Q-learning from Demonstrations (DQfD), an algorithm that scores highly in difficult Atari games, we leverage dialog data to guide the agent to successfully respond to a user's requests. We make progr… ▽ More

    Submitted 13 August, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: 9 pages + 2 pages references + 1 page appendices, 6 figures, 2 tables, 1 algorithm, accepted as long paper at ACL2020

  3. arXiv:2004.08114  [pdf, other

    cs.CL cs.LG cs.NE

    Show Us the Way: Learning to Manage Dialog from Demonstrations

    Authors: Gabriel Gordon-Hall, Philip John Gorinski, Gerasimos Lampouras, Ignacio Iacobacci

    Abstract: We present our submission to the End-to-End Multi-Domain Dialog Challenge Track of the Eighth Dialog System Technology Challenge. Our proposed dialog system adopts a pipeline architecture, with distinct components for Natural Language Understanding, Dialog State Tracking, Dialog Management and Natural Language Generation. At the core of our system is a reinforcement learning algorithm which uses D… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

    Comments: 8 pages + 2 pages references, 4 figures, 4 tables, accepted to DSTC8 Workshop at AAAI2020