Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Mott, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2010.02418  [pdf, other

    cs.LG cs.AI cs.CV

    The Effectiveness of Memory Replay in Large Scale Continual Learning

    Authors: Yogesh Balaji, Mehrdad Farajtabar, Dong Yin, Alex Mott, Ang Li

    Abstract: We study continual learning in the large scale setting where tasks in the input sequence are not limited to classification, and the outputs can be of high dimension. Among multiple state-of-the-art methods, we found vanilla experience replay (ER) still very competitive in terms of both performance and scalability, despite its simplicity. However, a degraded performance is observed for ER with smal… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 15 pages

  2. arXiv:2006.10974  [pdf, ps, other

    cs.LG stat.ML

    Optimization and Generalization of Regularization-Based Continual Learning: a Loss Approximation Viewpoint

    Authors: Dong Yin, Mehrdad Farajtabar, Ang Li, Nir Levine, Alex Mott

    Abstract: Neural networks have achieved remarkable success in many cognitive tasks. However, when they are trained sequentially on multiple tasks without access to old data, their performance on early tasks tend to drop significantly. This problem is often referred to as catastrophic forgetting, a key challenge in continual learning of neural networks. The regularization-based approach is one of the primary… ▽ More

    Submitted 8 February, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: Preliminary version with a different title presented at ICML Workshop on Continual Learning, 2020 (spotlight)

  3. arXiv:1912.02184  [pdf, other

    cs.CV

    Towards Robust Image Classification Using Sequential Attention Models

    Authors: Daniel Zoran, Mike Chrzanowski, Po-Sen Huang, Sven Gowal, Alex Mott, Pushmeet Kohl

    Abstract: In this paper we propose to augment a modern neural-network architecture with an attention model inspired by human perception. Specifically, we adversarially train and analyze a neural model incorporating a human inspired, visual attention component that is guided by a recurrent top-down sequential process. Our experimental evaluation uncovers several notable findings about the robustness and beha… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

  4. arXiv:1910.07104  [pdf, other

    cs.LG stat.ML

    Orthogonal Gradient Descent for Continual Learning

    Authors: Mehrdad Farajtabar, Navid Azizan, Alex Mott, Ang Li

    Abstract: Neural networks are achieving state of the art and sometimes super-human performance on learning tasks across a variety of domains. Whenever these problems require learning in a continual or sequential manner, however, neural networks suffer from the problem of catastrophic forgetting; they forget how to solve previous tasks after being trained on a new task, despite having the essential capacity… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  5. arXiv:1908.04480  [pdf, other

    quant-ph cs.LG hep-ph

    Quantum adiabatic machine learning with zooming

    Authors: Alexander Zlokapa, Alex Mott, Joshua Job, Jean-Roch Vlimant, Daniel Lidar, Maria Spiropulu

    Abstract: Recent work has shown that quantum annealing for machine learning, referred to as QAML, can perform comparably to state-of-the-art machine learning methods with a specific application to Higgs boson classification. We propose QAML-Z, a novel algorithm that iteratively zooms in on a region of the energy surface by mapping the problem to a continuous space and sequentially applying quantum annealing… ▽ More

    Submitted 23 October, 2020; v1 submitted 13 August, 2019; originally announced August 2019.

    Comments: 9 pages, 5 figures

    Journal ref: Phys. Rev. A 102, 062405 (2020)

  6. arXiv:1906.02500  [pdf, other

    cs.LG stat.ML

    Towards Interpretable Reinforcement Learning Using Attention Augmented Agents

    Authors: Alex Mott, Daniel Zoran, Mike Chrzanowski, Daan Wierstra, Danilo J. Rezende

    Abstract: Inspired by recent work in attention models for image captioning and question answering, we present a soft attention model for the reinforcement learning domain. This model uses a soft, top-down attention mechanism to create a bottleneck in the agent, forcing it to focus on task-relevant information by sequentially querying its view of the environment. The output of the attention mechanism allows… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.