Zum Hauptinhalt springen

Showing 1–38 of 38 results for author: Ke, N R

.
  1. arXiv:2407.21009  [pdf, other

    cs.AI cs.LG

    AI-Assisted Generation of Difficult Math Questions

    Authors: Vedant Shah, Dingli Yu, Kaifeng Lyu, Simon Park, Nan Rosemary Ke, Michael Mozer, Yoshua Bengio, Sanjeev Arora, Anirudh Goyal

    Abstract: Current LLM training positions mathematical reasoning as a core capability. With publicly available sources fully tapped, there is unmet demand for diverse and challenging math questions. Relying solely on human experts is both time-consuming and costly, while LLM-generated questions often lack the requisite diversity and difficulty. We present a design framework that combines the strengths of LLM… ▽ More

    Submitted 2 September, 2024; v1 submitted 30 July, 2024; originally announced July 2024.

  2. arXiv:2405.15485  [pdf, other

    cs.AI cs.CL cs.LG

    Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs

    Authors: Siyuan Guo, Aniket Didolkar, Nan Rosemary Ke, Anirudh Goyal, Ferenc Huszár, Bernhard Schölkopf

    Abstract: We are beginning to see progress in language model assisted scientific discovery. Motivated by the use of LLMs as a general scientific assistant, this paper assesses the domain knowledge of LLMs through its understanding of different mathematical skills required to solve problems. In particular, we look at not just what the pre-trained model already knows, but how it learned to learn from informat… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2405.12205  [pdf, other

    cs.AI cs.LG

    Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

    Authors: Aniket Didolkar, Anirudh Goyal, Nan Rosemary Ke, Siyuan Guo, Michal Valko, Timothy Lillicrap, Danilo Rezende, Yoshua Bengio, Michael Mozer, Sanjeev Arora

    Abstract: Metacognitive knowledge refers to humans' intuitive knowledge of their own thinking and reasoning processes. Today's best LLMs clearly possess some reasoning processes. The paper gives evidence that they also have metacognitive knowledge, including ability to name skills and procedures to apply given a task. We explore this primarily in context of math reasoning, developing a prompt-guided interac… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Preprint. Under review

  4. arXiv:2304.05823  [pdf, other

    q-bio.MN cs.LG q-bio.GN

    DiscoGen: Learning to Discover Gene Regulatory Networks

    Authors: Nan Rosemary Ke, Sara-Jane Dunn, Jorg Bornschein, Silvia Chiappa, Melanie Rey, Jean-Baptiste Lespiau, Albin Cassirer, Jane Wang, Theophane Weber, David Barrett, Matthew Botvinick, Anirudh Goyal, Mike Mozer, Danilo Rezende

    Abstract: Accurately inferring Gene Regulatory Networks (GRNs) is a critical and challenging task in biology. GRNs model the activatory and inhibitory interactions between genes and are inherently causal in nature. To accurately identify GRNs, perturbational data is required. However, most GRN discovery methods only operate on observational data. Recent advances in neural network-based causal discovery meth… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  5. arXiv:2302.04250  [pdf, other

    cs.LG stat.ML

    Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration

    Authors: Chentian Jiang, Nan Rosemary Ke, Hado van Hasselt

    Abstract: To generalize across tasks, an agent should acquire knowledge from past tasks that facilitate adaptation and exploration in future tasks. We focus on the problem of in-context adaptation and exploration, where an agent only relies on context, i.e., history of states, actions and/or rewards, rather than gradient-based updates. Posterior sampling (extension of Thompson sampling) is a promising appro… ▽ More

    Submitted 4 May, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: In proceedings of the Reincarnating Reinforcement Learning (RRL) Workshop at ICLR 2023 and the Neuro-Symbolic AI for Agent and Multi-Agent Systems (NeSyMAS) Workshop at AAMAS 2023

  6. arXiv:2210.13583  [pdf, other

    cs.LG cs.AI stat.ME

    Learning Latent Structural Causal Models

    Authors: Jithendaraa Subramanian, Yashas Annadani, Ivaxi Sheth, Nan Rosemary Ke, Tristan Deleu, Stefan Bauer, Derek Nowrouzezahrai, Samira Ebrahimi Kahou

    Abstract: Causal learning has long concerned itself with the accurate recovery of underlying causal mechanisms. Such causal modelling enables better explanations of out-of-distribution data. Prior works on causal learning assume that the high-level causal variables are given. However, in machine learning tasks, one often operates on low-level data like image pixels or high-dimensional vectors. In such setti… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 21 pages, 19 figures

  7. arXiv:2206.08353  [pdf, other

    cs.LG stat.ML

    Towards Understanding How Machines Can Learn Causal Overhypotheses

    Authors: Eliza Kosoy, David M. Chan, Adrian Liu, Jasmine Collins, Bryanna Kaufmann, Sandy Han Huang, Jessica B. Hamrick, John Canny, Nan Rosemary Ke, Alison Gopnik

    Abstract: Recent work in machine learning and cognitive science has suggested that understanding causal information is essential to the development of intelligence. The extensive literature in cognitive science using the ``blicket detector'' environment shows that children are adept at many kinds of causal inference and learning. We propose to adapt that environment for machine learning agents. One of the k… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  8. arXiv:2206.04620  [pdf, other

    cs.LG cs.AI stat.ML

    On the Generalization and Adaption Performance of Causal Models

    Authors: Nino Scherrer, Anirudh Goyal, Stefan Bauer, Yoshua Bengio, Nan Rosemary Ke

    Abstract: Learning models that offer robust out-of-distribution generalization and fast adaptation is a key challenge in modern machine learning. Modelling causal structure into neural networks holds the promise to accomplish robust zero and few-shot adaptation. Recent advances in differentiable causal discovery have proposed to factorize the data generating process into a set of modules, i.e. one module fo… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  9. arXiv:2205.14794  [pdf, other

    cs.LG cs.AI

    Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning

    Authors: Aniket Didolkar, Kshitij Gupta, Anirudh Goyal, Nitesh B. Gundavarapu, Alex Lamb, Nan Rosemary Ke, Yoshua Bengio

    Abstract: Recurrent neural networks have a strong inductive bias towards learning temporally compressed representations, as the entire history of a sequence is represented by a single vector. By contrast, Transformers have little inductive bias towards learning temporally compressed representations, as they allow for attention over all previously computed elements in a sequence. Having a more compressed rep… ▽ More

    Submitted 25 October, 2022; v1 submitted 29 May, 2022; originally announced May 2022.

  10. arXiv:2204.04875  [pdf, other

    stat.ML cs.LG

    Learning to Induce Causal Structure

    Authors: Nan Rosemary Ke, Silvia Chiappa, Jane Wang, Anirudh Goyal, Jorg Bornschein, Melanie Rey, Theophane Weber, Matthew Botvinic, Michael Mozer, Danilo Jimenez Rezende

    Abstract: The fundamental challenge in causal induction is to infer the underlying graph structure given observational and/or interventional data. Most existing causal induction algorithms operate by generating candidate graphs and evaluating them using either score-based methods (including continuous optimization) or independence tests. In our work, we instead treat the inference process as a black box and… ▽ More

    Submitted 7 October, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

  11. arXiv:2202.10430  [pdf, other

    cs.LG cs.AI cs.NE

    Learning Causal Overhypotheses through Exploration in Children and Computational Models

    Authors: Eliza Kosoy, Adrian Liu, Jasmine Collins, David M Chan, Jessica B Hamrick, Nan Rosemary Ke, Sandy H Huang, Bryanna Kaufmann, John Canny, Alison Gopnik

    Abstract: Despite recent progress in reinforcement learning (RL), RL algorithms for exploration still remain an active area of research. Existing methods often focus on state-based metrics, which do not consider the underlying causal structures of the environment, and while recent research has begun to explore RL environments for causal learning, these environments primarily leverage causal information thro… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  12. arXiv:2202.08417  [pdf, other

    cs.LG

    Retrieval-Augmented Reinforcement Learning

    Authors: Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adria Puigdomenech Badia, Arthur Guez, Mehdi Mirza, Peter C. Humphreys, Ksenia Konyushkova, Laurent Sifre, Michal Valko, Simon Osindero, Timothy Lillicrap, Nicolas Heess, Charles Blundell

    Abstract: Most deep reinforcement learning (RL) algorithms distill experience into parametric behavior policies or value functions via gradient updates. While effective, this approach has several disadvantages: (1) it is computationally expensive, (2) it can take many updates to integrate experiences into the parametric model, (3) experiences that are not fully integrated do not appropriately influence the… ▽ More

    Submitted 24 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  13. arXiv:2109.02429  [pdf, other

    stat.ML cs.LG

    Learning Neural Causal Models with Active Interventions

    Authors: Nino Scherrer, Olexa Bilaniuk, Yashas Annadani, Anirudh Goyal, Patrick Schwab, Bernhard Schölkopf, Michael C. Mozer, Yoshua Bengio, Stefan Bauer, Nan Rosemary Ke

    Abstract: Discovering causal structures from data is a challenging inference problem of fundamental importance in all areas of science. The appealing properties of neural networks have recently led to a surge of interest in differentiable neural network-based methods for learning causal structures from data. So far, differentiable causal discovery has focused on static datasets of observational or fixed int… ▽ More

    Submitted 5 March, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

  14. arXiv:2107.00848  [pdf, other

    stat.ML cs.LG

    Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning

    Authors: Nan Rosemary Ke, Aniket Didolkar, Sarthak Mittal, Anirudh Goyal, Guillaume Lajoie, Stefan Bauer, Danilo Rezende, Yoshua Bengio, Michael Mozer, Christopher Pal

    Abstract: Inducing causal relationships from observations is a classic problem in machine learning. Most work in causality starts from the premise that the causal variables themselves are observed. However, for AI agents such as robots trying to make sense of their environment, the only observables are low-level variables like pixels in images. To generalize well, an agent must induce high-level variables,… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  15. arXiv:2105.08710  [pdf, other

    cs.LG cs.AI

    Fast and Slow Learning of Recurrent Independent Mechanisms

    Authors: Kanika Madan, Nan Rosemary Ke, Anirudh Goyal, Bernhard Schölkopf, Yoshua Bengio

    Abstract: Decomposing knowledge into interchangeable pieces promises a generalization advantage when there are changes in distribution. A learning agent interacting with its environment is likely to be faced with situations requiring novel combinations of existing pieces of knowledge. We hypothesize that such a decomposition of knowledge is particularly relevant for being able to generalize in a systematic… ▽ More

    Submitted 18 May, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: Accepted at ICLR'21

  16. arXiv:2103.01937  [pdf, other

    cs.AI cs.LG stat.ML

    Neural Production Systems: Learning Rule-Governed Visual Dynamics

    Authors: Anirudh Goyal, Aniket Didolkar, Nan Rosemary Ke, Charles Blundell, Philippe Beaudoin, Nicolas Heess, Michael Mozer, Yoshua Bengio

    Abstract: Visual environments are structured, consisting of distinct objects or entities. These entities have properties -- both visible and latent -- that determine the manner in which they interact with one another. To partition images into entities, deep-learning researchers have proposed structural inductive biases such as slot-based architectures. To model interactions among entities, equivariant graph… ▽ More

    Submitted 23 March, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: NeurIPS'21

  17. arXiv:2103.01197  [pdf, other

    cs.LG cs.AI stat.ML

    Coordination Among Neural Modules Through a Shared Global Workspace

    Authors: Anirudh Goyal, Aniket Didolkar, Alex Lamb, Kartikeya Badola, Nan Rosemary Ke, Nasim Rahaman, Jonathan Binas, Charles Blundell, Michael Mozer, Yoshua Bengio

    Abstract: Deep learning has seen a movement away from representing examples with a monolithic hidden state towards a richly structured state. For example, Transformers segment by position, and object-centric architectures decompose images into entities. In all these architectures, interactions between different elements are modeled via pairwise interactions: Transformers make use of self-attention to incorp… ▽ More

    Submitted 22 March, 2022; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: ICLR'22 accepted paper

  18. arXiv:2102.11107  [pdf, other

    cs.LG cs.AI

    Towards Causal Representation Learning

    Authors: Bernhard Schölkopf, Francesco Locatello, Stefan Bauer, Nan Rosemary Ke, Nal Kalchbrenner, Anirudh Goyal, Yoshua Bengio

    Abstract: The two fields of machine learning and graphical causality arose and developed separately. However, there is now cross-pollination and increasing interest in both fields to benefit from the advances of the other. In the present paper, we review fundamental concepts of causal inference and relate them to crucial open problems of machine learning, including transfer and generalization, thereby assay… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: Special Issue of Proceedings of the IEEE - Advances in Machine Learning and Deep Neural Networks

  19. arXiv:2011.11150  [pdf, other

    cs.LG stat.ML

    On the Convergence of Continuous Constrained Optimization for Structure Learning

    Authors: Ignavier Ng, Sébastien Lachapelle, Nan Rosemary Ke, Simon Lacoste-Julien, Kun Zhang

    Abstract: Recently, structure learning of directed acyclic graphs (DAGs) has been formulated as a continuous optimization problem by leveraging an algebraic characterization of acyclicity. The constrained problem is solved using the augmented Lagrangian method (ALM) which is often preferred to the quadratic penalty method (QPM) by virtue of its standard convergence result that does not require the penalty c… ▽ More

    Submitted 10 April, 2022; v1 submitted 22 November, 2020; originally announced November 2020.

    Comments: AISTATS 2022. A preliminary version of this paper was presented at the NeurIPS 2020 Workshop on Causal Discovery and Causality-Inspired Machine Learning. The code is available at https://github.com/ignavierng/notears-convergence

  20. arXiv:2008.09301  [pdf, other

    stat.ML cs.LG

    Amortized learning of neural causal representations

    Authors: Nan Rosemary Ke, Jane. X. Wang, Jovana Mitrovic, Martin Szummer, Danilo J. Rezende

    Abstract: Causal models can compactly and efficiently encode the data-generating process under all interventions and hence may generalize better under changes in distribution. These models are often represented as Bayesian networks and learning them scales poorly with the number of variables. Moreover, these approaches cannot leverage previously learned knowledge to help with learning new causal models. In… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: ICLR 2020 causal learning for decision making workshop

  21. arXiv:2002.02836  [pdf, other

    cs.LG cs.AI stat.ML

    Causally Correct Partial Models for Reinforcement Learning

    Authors: Danilo J. Rezende, Ivo Danihelka, George Papamakarios, Nan Rosemary Ke, Ray Jiang, Theophane Weber, Karol Gregor, Hamza Merzic, Fabio Viola, Jane Wang, Jovana Mitrovic, Frederic Besse, Ioannis Antonoglou, Lars Buesing

    Abstract: In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling future observations can be computationally expensive or even intractable if the observations are high-dimensional (e.g. images). For this reason, previous works have considered partial models, which model only part of the observation. In this pa… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

  22. arXiv:1910.01075  [pdf, other

    stat.ML cs.AI cs.LG

    Learning Neural Causal Models from Unknown Interventions

    Authors: Nan Rosemary Ke, Olexa Bilaniuk, Anirudh Goyal, Stefan Bauer, Hugo Larochelle, Bernhard Schölkopf, Michael C. Mozer, Chris Pal, Yoshua Bengio

    Abstract: Promising results have driven a recent surge of interest in continuous optimization methods for Bayesian network structure learning from observational data. However, there are theoretical limitations on the identifiability of underlying structures obtained from observational data alone. Interventional data provides much richer information about the underlying data-generating process. However, the… ▽ More

    Submitted 23 August, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

  23. arXiv:1903.01599  [pdf, other

    stat.ML cs.LG

    Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future

    Authors: Nan Rosemary Ke, Amanpreet Singh, Ahmed Touati, Anirudh Goyal, Yoshua Bengio, Devi Parikh, Dhruv Batra

    Abstract: In model-based reinforcement learning, the agent interleaves between model learning and planning. These two components are inextricably intertwined. If the model is not able to provide sensible long-term prediction, the executed planner would exploit model flaws, which can yield catastrophic failures. This paper focuses on building a model that reasons about the long-term future and demonstrates h… ▽ More

    Submitted 16 March, 2019; v1 submitted 4 March, 2019; originally announced March 2019.

    Comments: To appear at ICLR 2019

  24. arXiv:1810.03023  [pdf, other

    stat.ML cs.LG

    h-detach: Modifying the LSTM Gradient Towards Better Optimization

    Authors: Devansh Arpit, Bhargav Kanuparthi, Giancarlo Kerg, Nan Rosemary Ke, Ioannis Mitliagkas, Yoshua Bengio

    Abstract: Recurrent neural networks are known for their notorious exploding and vanishing gradient problem (EVGP). This problem becomes more evident in tasks where the information needed to correctly solve them exist over long time scales, because EVGP prevents important gradient components from being back-propagated adequately over a large number of steps. We introduce a simple stochastic algorithm (\texti… ▽ More

    Submitted 9 January, 2019; v1 submitted 6 October, 2018; originally announced October 2018.

    Comments: First two authors contributed equally. Published in ICLR 2019

  25. arXiv:1809.03702  [pdf, other

    cs.LG stat.ML

    Sparse Attentive Backtracking: Temporal CreditAssignment Through Reminding

    Authors: Nan Rosemary Ke, Anirudh Goyal, Olexa Bilaniuk, Jonathan Binas, Michael C. Mozer, Chris Pal, Yoshua Bengio

    Abstract: Learning long-term dependencies in extended temporal sequences requires credit assignment to events far back in the past. The most common method for training recurrent neural networks, back-propagation through time (BPTT), requires credit information to be propagated backwards through every single step of the forward computation, potentially over thousands or millions of time steps. This becomes c… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: To appear as a Spotlight presentation at NIPS 2018

  26. arXiv:1806.04342  [pdf, other

    stat.ML cs.LG

    Focused Hierarchical RNNs for Conditional Sequence Processing

    Authors: Nan Rosemary Ke, Konrad Zolna, Alessandro Sordoni, Zhouhan Lin, Adam Trischler, Yoshua Bengio, Joelle Pineau, Laurent Charlin, Chris Pal

    Abstract: Recurrent Neural Networks (RNNs) with attention mechanisms have obtained state-of-the-art results for many sequence processing tasks. Most of these models use a simple form of encoder with attention that looks over the entire sequence and assigns a weight to each token independently. We present a mechanism for focusing RNN encoders for sequence modelling tasks which allows them to attend to key pa… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: To appear at ICML 2018

  27. arXiv:1801.06700  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    A Deep Reinforcement Learning Chatbot (Short Version)

    Authors: Iulian V. Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeswar, Alexandre de Brebisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau, Yoshua Bengio

    Abstract: We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including neural network and template-based… ▽ More

    Submitted 20 January, 2018; originally announced January 2018.

    Comments: 9 pages, 1 figure, 2 tables; presented at NIPS 2017, Conversational AI: "Today's Practice and Tomorrow's Potential" Workshop

    ACM Class: I.5.1; I.2.7

  28. arXiv:1711.09050  [pdf, other

    cs.CL

    Ethical Challenges in Data-Driven Dialogue Systems

    Authors: Peter Henderson, Koustuv Sinha, Nicolas Angelard-Gontier, Nan Rosemary Ke, Genevieve Fried, Ryan Lowe, Joelle Pineau

    Abstract: The use of dialogue systems as a medium for human-machine interaction is an increasingly prevalent paradigm. A growing number of dialogue systems use conversation strategies that are learned from large datasets. There are well documented instances where interactions with these system have resulted in biased or even offensive conversations due to the data-driven training process. Here, we highlight… ▽ More

    Submitted 24 November, 2017; originally announced November 2017.

    Comments: In Submission to the AAAI/ACM conference on Artificial Intelligence, Ethics, and Society

  29. arXiv:1711.05411  [pdf, other

    stat.ML cs.LG

    Z-Forcing: Training Stochastic Recurrent Networks

    Authors: Anirudh Goyal, Alessandro Sordoni, Marc-Alexandre Côté, Nan Rosemary Ke, Yoshua Bengio

    Abstract: Many efforts have been devoted to training generative latent variable models with autoregressive decoders, such as recurrent neural networks (RNN). Stochastic recurrent models have been successful in capturing the variability observed in natural sequential data such as speech. We unify successful ideas from recently proposed architectures into a stochastic recurrent model: each step in the sequenc… ▽ More

    Submitted 16 November, 2017; v1 submitted 15 November, 2017; originally announced November 2017.

    Comments: To appear in NIPS'17

  30. arXiv:1711.04755  [pdf, other

    stat.ML cs.LG

    ACtuAL: Actor-Critic Under Adversarial Learning

    Authors: Anirudh Goyal, Nan Rosemary Ke, Alex Lamb, R Devon Hjelm, Chris Pal, Joelle Pineau, Yoshua Bengio

    Abstract: Generative Adversarial Networks (GANs) are a powerful framework for deep generative modeling. Posed as a two-player minimax problem, GANs are typically trained end-to-end on real-valued data and can be used to train a generator of high-dimensional and realistic images. However, a major limitation of GANs is that training relies on passing gradients from the discriminator through the generator via… ▽ More

    Submitted 13 November, 2017; originally announced November 2017.

  31. arXiv:1711.02326  [pdf, other

    cs.AI cs.LG cs.NE stat.ML

    Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent Networks

    Authors: Nan Rosemary Ke, Anirudh Goyal, Olexa Bilaniuk, Jonathan Binas, Laurent Charlin, Chris Pal, Yoshua Bengio

    Abstract: A major drawback of backpropagation through time (BPTT) is the difficulty of learning long-term dependencies, coming from having to propagate credit information backwards through every single step of the forward computation. This makes BPTT both computationally impractical and biologically implausible. For this reason, full backpropagation through time is rarely used on long sequences, and truncat… ▽ More

    Submitted 7 November, 2017; originally announced November 2017.

  32. arXiv:1711.02282  [pdf, other

    stat.ML cs.LG cs.NE

    Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net

    Authors: Anirudh Goyal, Nan Rosemary Ke, Surya Ganguli, Yoshua Bengio

    Abstract: We propose a novel method to directly learn a stochastic transition operator whose repeated application provides generated samples. Traditional undirected graphical models approach this problem indirectly by learning a Markov chain model whose stationary distribution obeys detailed balance with respect to a parameterized energy function. The energy function is then modified so the model and data d… ▽ More

    Submitted 6 November, 2017; originally announced November 2017.

    Comments: To appear at NIPS 2017

  33. arXiv:1709.02349  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    A Deep Reinforcement Learning Chatbot

    Authors: Iulian V. Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeshwar, Alexandre de Brebisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau, Yoshua Bengio

    Abstract: We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including template-based models, bag-of-wor… ▽ More

    Submitted 5 November, 2017; v1 submitted 7 September, 2017; originally announced September 2017.

    Comments: 40 pages, 9 figures, 11 tables

    ACM Class: I.5.1; I.2.7

  34. arXiv:1708.06742  [pdf, other

    cs.LG stat.ML

    Twin Networks: Matching the Future for Sequence Generation

    Authors: Dmitriy Serdyuk, Nan Rosemary Ke, Alessandro Sordoni, Adam Trischler, Chris Pal, Yoshua Bengio

    Abstract: We propose a simple technique for encouraging generative RNNs to plan ahead. We train a "backward" recurrent network to generate a given sequence in reverse order, and we encourage states of the forward model to predict cotemporal states of the backward model. The backward network is used only during training, and plays no role during sampling or inference. We hypothesize that our approach eases m… ▽ More

    Submitted 23 February, 2018; v1 submitted 22 August, 2017; originally announced August 2017.

    Comments: 12 pages, 3 figures, published at ICLR 2018

  35. arXiv:1606.01305  [pdf, other

    cs.NE cs.CL cs.LG

    Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations

    Authors: David Krueger, Tegan Maharaj, János Kramár, Mohammad Pezeshki, Nicolas Ballas, Nan Rosemary Ke, Anirudh Goyal, Yoshua Bengio, Aaron Courville, Chris Pal

    Abstract: We propose zoneout, a novel method for regularizing RNNs. At each timestep, zoneout stochastically forces some hidden units to maintain their previous values. Like dropout, zoneout uses random noise to train a pseudo-ensemble, improving generalization. But by preserving instead of dropping hidden units, gradient information and state information are more readily propagated through time, as in feed… ▽ More

    Submitted 22 September, 2017; v1 submitted 3 June, 2016; originally announced June 2016.

    Comments: David Krueger and Tegan Maharaj contributed equally to this work

  36. arXiv:1603.05359  [pdf, ps, other

    cs.LG stat.ML

    Cascading Bandits for Large-Scale Recommendation Problems

    Authors: Shi Zong, Hao Ni, Kenny Sung, Nan Rosemary Ke, Zheng Wen, Branislav Kveton

    Abstract: Most recommender systems recommend a list of items. The user examines the list, from the first item to the last, and often chooses the first attractive item and does not examine the rest. This type of user behavior can be modeled by the cascade model. In this work, we study cascading bandits, an online learning variant of the cascade model where the goal is to recommend $K$ most attractive items f… ▽ More

    Submitted 30 June, 2016; v1 submitted 17 March, 2016; originally announced March 2016.

    Comments: Accepted to UAI 2016

  37. arXiv:1511.06456  [pdf, other

    cs.LG

    Task Loss Estimation for Sequence Prediction

    Authors: Dzmitry Bahdanau, Dmitriy Serdyuk, Philémon Brakel, Nan Rosemary Ke, Jan Chorowski, Aaron Courville, Yoshua Bengio

    Abstract: Often, the performance on a supervised machine learning task is evaluated with a emph{task loss} function that cannot be optimized directly. Examples of such loss functions include the classification error, the edit distance and the BLEU score. A common workaround for this problem is to instead optimize a emph{surrogate loss} function, such as for instance cross-entropy or hinge loss. In order for… ▽ More

    Submitted 19 January, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: Submitted to ICLR 2016

  38. arXiv:1504.01483  [pdf, other

    cs.LG cs.CL cs.NE stat.ML

    Transferring Knowledge from a RNN to a DNN

    Authors: William Chan, Nan Rosemary Ke, Ian Lane

    Abstract: Deep Neural Network (DNN) acoustic models have yielded many state-of-the-art results in Automatic Speech Recognition (ASR) tasks. More recently, Recurrent Neural Network (RNN) models have been shown to outperform DNNs counterparts. However, state-of-the-art DNN and RNN models tend to be impractical to deploy on embedded systems with limited computational capacity. Traditionally, the approach for e… ▽ More

    Submitted 7 April, 2015; originally announced April 2015.