Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Salvatier, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2011.06709  [pdf, other

    cs.LG cs.AI stat.ML

    Active Reinforcement Learning: Observing Rewards at a Cost

    Authors: David Krueger, Jan Leike, Owain Evans, John Salvatier

    Abstract: Active reinforcement learning (ARL) is a variant on reinforcement learning where the agent does not observe the reward unless it chooses to pay a query cost c > 0. The central question of ARL is how to quantify the long-term value of reward information. Even in multi-armed bandits, computing the value of this information is intractable and we have to rely on heuristics. We propose and evaluate sev… ▽ More

    Submitted 24 November, 2020; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: Originally appeared at the NeurIPS 2016 "Future of Interactive Learning Machines (FILM)" workshop

  2. arXiv:1705.08807  [pdf, other

    cs.AI cs.CY

    When Will AI Exceed Human Performance? Evidence from AI Experts

    Authors: Katja Grace, John Salvatier, Allan Dafoe, Baobao Zhang, Owain Evans

    Abstract: Advances in artificial intelligence (AI) will transform modern life by reshaping transportation, health, science, finance, and the military. To adapt public policy, we need to better anticipate these advances. Here we report the results from a large survey of machine learning researchers on their beliefs about progress in AI. Researchers predict AI will outperform humans in many activities in the… ▽ More

    Submitted 3 May, 2018; v1 submitted 24 May, 2017; originally announced May 2017.

    Comments: Accepted by Journal of Artificial Intelligence Research (AI and Society Track). Minor update to refer to related work (page 5)

  3. arXiv:1701.04079  [pdf, other

    cs.LG cs.AI

    Agent-Agnostic Human-in-the-Loop Reinforcement Learning

    Authors: David Abel, John Salvatier, Andreas Stuhlmüller, Owain Evans

    Abstract: Providing Reinforcement Learning agents with expert advice can dramatically improve various aspects of learning. Prior work has developed teaching protocols that enable agents to learn efficiently in complex environments; many of these methods tailor the teacher's guidance to agents with a particular representation or underlying learning scheme, offering effective but specialized teaching procedur… ▽ More

    Submitted 15 January, 2017; originally announced January 2017.

    Comments: Presented at the NIPS Workshop on the Future of Interactive Learning Machines, 2016

  4. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures