Skip to main content

Showing 1–13 of 13 results for author: Friesen, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.03870  [pdf, other

    cs.LG

    Knowledge Transfer from Teachers to Learners in Growing-Batch Reinforcement Learning

    Authors: Patrick Emedom-Nnamdi, Abram L. Friesen, Bobak Shahriari, Nando de Freitas, Matt W. Hoffman

    Abstract: Standard approaches to sequential decision-making exploit an agent's ability to continually interact with its environment and improve its control policy. However, due to safety, ethical, and practicality constraints, this type of trial-and-error experimentation is often infeasible in many real-world domains such as healthcare and robotics. Instead, control policies in these domains are typically t… ▽ More

    Submitted 9 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Reincarnating Reinforcement Learning Workshop at ICLR 2023

  2. arXiv:2210.04971  [pdf, other

    cs.LG cs.AI

    Multi-step Planning for Automated Hyperparameter Optimization with OptFormer

    Authors: Lucio M. Dery, Abram L. Friesen, Nando De Freitas, Marc'Aurelio Ranzato, Yutian Chen

    Abstract: As machine learning permeates more industries and models become more expensive and time consuming to train, the need for efficient automated hyperparameter optimization (HPO) has never been more pressing. Multi-step planning based approaches to hyperparameter optimization promise improved efficiency over myopic alternatives by more effectively balancing out exploration and exploitation. However, t… ▽ More

    Submitted 16 November, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 8 pages, 7 figures

  3. arXiv:2204.10256  [pdf, other

    cs.LG cs.AI

    Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach

    Authors: Bobak Shahriari, Abbas Abdolmaleki, Arunkumar Byravan, Abe Friesen, Siqi Liu, Jost Tobias Springenberg, Nicolas Heess, Matt Hoffman, Martin Riedmiller

    Abstract: Actor-critic algorithms that make use of distributional policy evaluation have frequently been shown to outperform their non-distributional counterparts on many challenging control tasks. Examples of this behavior include the D4PG and DMPO algorithms as compared to DDPG and MPO, respectively [Barth-Maron et al., 2018; Hoffman et al., 2020]. However, both agents rely on the C51 critic for value est… ▽ More

    Submitted 22 April, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

  4. arXiv:2202.08417  [pdf, other

    cs.LG

    Retrieval-Augmented Reinforcement Learning

    Authors: Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adria Puigdomenech Badia, Arthur Guez, Mehdi Mirza, Peter C. Humphreys, Ksenia Konyushkova, Laurent Sifre, Michal Valko, Simon Osindero, Timothy Lillicrap, Nicolas Heess, Charles Blundell

    Abstract: Most deep reinforcement learning (RL) algorithms distill experience into parametric behavior policies or value functions via gradient updates. While effective, this approach has several disadvantages: (1) it is computationally expensive, (2) it can take many updates to integrate experiences into the parametric model, (3) experiences that are not fully integrated do not appropriately influence the… ▽ More

    Submitted 24 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  5. arXiv:2112.04153  [pdf, other

    cs.LG cs.AI

    Model-Value Inconsistency as a Signal for Epistemic Uncertainty

    Authors: Angelos Filos, Eszter Vértes, Zita Marinho, Gregory Farquhar, Diana Borsa, Abram Friesen, Feryal Behbahani, Tom Schaul, André Barreto, Simon Osindero

    Abstract: Using a model of the environment and a value function, an agent can construct many estimates of a state's value, by unrolling the model for different lengths and bootstrapping with its value function. Our key insight is that one can treat this set of value estimates as a type of ensemble, which we call an \emph{implicit value ensemble} (IVE). Consequently, the discrepancy between these estimates c… ▽ More

    Submitted 29 June, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: The first three authors contributed equally. Accepted at ICML 2022

  6. arXiv:2109.11397  [pdf, other

    cs.PL

    Position Paper: Goals of the Luau Type System

    Authors: Lily Brown, Andy Friesen, Alan Jeffrey

    Abstract: Luau is the scripting language that powers user-generated experiences on the Roblox platform. It is a statically-typed language, based on the dynamically-typed Lua language, with type inference. These types are used for providing editor assistance in Roblox Studio, the IDE for authoring Roblox experiences. Due to Roblox's uniquely heterogeneous developer community, Luau must operate in a somewhat… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: In HATRA '21: Human Aspects of Types and Reasoning Assistants 2021

  7. arXiv:2109.10781  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Introducing Symmetries to Black Box Meta Reinforcement Learning

    Authors: Louis Kirsch, Sebastian Flennerhag, Hado van Hasselt, Abram Friesen, Junhyuk Oh, Yutian Chen

    Abstract: Meta reinforcement learning (RL) attempts to discover new RL algorithms automatically from environment interaction. In so-called black-box approaches, the policy and the learning algorithm are jointly represented by a single neural network. These methods are very flexible, but they tend to underperform in terms of generalisation to new, unseen environments. In this paper, we explore the role of sy… ▽ More

    Submitted 5 June, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: AAAI 2022

  8. arXiv:2011.04021  [pdf, other

    cs.AI cs.LG

    On the role of planning in model-based deep reinforcement learning

    Authors: Jessica B. Hamrick, Abram L. Friesen, Feryal Behbahani, Arthur Guez, Fabio Viola, Sims Witherspoon, Thomas Anthony, Lars Buesing, Petar Veličković, Théophane Weber

    Abstract: Model-based planning is often thought to be necessary for deep, careful reasoning and generalization in artificial agents. While recent successes of model-based reinforcement learning (MBRL) with deep function approximation have strengthened this hypothesis, the resulting diversity of model-based methods has also made it difficult to track which components drive success and why. In this paper, we… ▽ More

    Submitted 17 March, 2021; v1 submitted 8 November, 2020; originally announced November 2020.

    Comments: Published at ICLR 2021

  9. arXiv:2006.00979  [pdf, other

    cs.LG cs.AI

    Acme: A Research Framework for Distributed Reinforcement Learning

    Authors: Matthew W. Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Nikola Momchev, Danila Sinopalnikov, Piotr Stańczyk, Sabela Ramos, Anton Raichuk, Damien Vincent, Léonard Hussenot, Robert Dadashi, Gabriel Dulac-Arnold, Manu Orsini, Alexis Jacq, Johan Ferret, Nino Vieillard, Seyed Kamyar Seyed Ghasemipour, Sertan Girgin, Olivier Pietquin, Feryal Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang , et al. (14 additional authors not shown)

    Abstract: Deep reinforcement learning (RL) has led to many recent and groundbreaking advances. However, these advances have often come at the cost of both increased scale in the underlying architectures being trained as well as increased complexity of the RL algorithms used to train them. These increases have in turn made it more difficult for researchers to rapidly prototype new ideas or reproduce publishe… ▽ More

    Submitted 20 September, 2022; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: This work presents a second version of the paper which coincides with an increase in modularity, additional emphasis on offline, imitation and learning from demonstrations algorithms, as well as various new agents implemented as part of Acme

  10. arXiv:1909.05557  [pdf, other

    cs.LG cs.AI stat.ML

    Modular Meta-Learning with Shrinkage

    Authors: Yutian Chen, Abram L. Friesen, Feryal Behbahani, Arnaud Doucet, David Budden, Matthew W. Hoffman, Nando de Freitas

    Abstract: Many real-world problems, including multi-speaker text-to-speech synthesis, can greatly benefit from the ability to meta-learn large models with only a few task-specific components. Updating only these task-specific modules then allows the model to be adapted to low-data tasks for as many steps as necessary without risking overfitting. Unfortunately, existing meta-learning methods either do not sc… ▽ More

    Submitted 22 October, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: Accepted by NeurIPS 2020

  11. arXiv:1710.11573  [pdf, other

    cs.LG cs.CV cs.NE

    Deep Learning as a Mixed Convex-Combinatorial Optimization Problem

    Authors: Abram L. Friesen, Pedro Domingos

    Abstract: As neural networks grow deeper and wider, learning networks with hard-threshold activations is becoming increasingly important, both for network quantization, which can drastically reduce time and energy requirements, and for creating large integrated systems of deep networks, which may have non-differentiable components and must avoid vanishing and exploding gradients for effective learning. Howe… ▽ More

    Submitted 16 April, 2018; v1 submitted 31 October, 2017; originally announced October 2017.

    Comments: 14 pages (9 body, 5 pages of references and appendices)

    Journal ref: In Proceedings of the International Conference on Learning Representations (ICLR) 2018

  12. arXiv:1611.03553  [pdf, other

    cs.LG cs.AI

    The Sum-Product Theorem: A Foundation for Learning Tractable Models

    Authors: Abram L. Friesen, Pedro Domingos

    Abstract: Inference in expressive probabilistic models is generally intractable, which makes them difficult to learn and limits their applicability. Sum-product networks are a class of deep models where, surprisingly, inference remains tractable even when an arbitrary number of hidden layers are present. In this paper, we generalize this result to a much broader set of learning problems: all those where inf… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: 15 pages (10 body, 5 pages of appendices)

    Journal ref: Proceedings of the 33rd International Conference on Machine Learning, pp. 1909-1918, 2016

  13. arXiv:1611.02755  [pdf, other

    cs.AI cs.LG stat.ML

    Recursive Decomposition for Nonconvex Optimization

    Authors: Abram L. Friesen, Pedro Domingos

    Abstract: Continuous optimization is an important problem in many areas of AI, including vision, robotics, probabilistic inference, and machine learning. Unfortunately, most real-world optimization problems are nonconvex, causing standard convex techniques to find only local optima, even with extensions like random restarts and simulated annealing. We observe that, in many cases, the local modes of the obje… ▽ More

    Submitted 8 November, 2016; originally announced November 2016.

    Comments: 11 pages, 7 figures, pdflatex

    Journal ref: Proceedings of the 24th International Joint Conference on Artificial Intelligence (2015), pp. 253-259