Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Totaro, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2108.05828  [pdf, other

    cs.LG cs.AI stat.ML

    A general class of surrogate functions for stable and efficient reinforcement learning

    Authors: Sharan Vaswani, Olivier Bachem, Simone Totaro, Robert Mueller, Shivam Garg, Matthieu Geist, Marlos C. Machado, Pablo Samuel Castro, Nicolas Le Roux

    Abstract: Common policy gradient methods rely on the maximization of a sequence of surrogate functions. In recent years, many such surrogate functions have been proposed, most without strong theoretical guarantees, leading to algorithms such as TRPO, PPO or MPO. Rather than design yet another surrogate function, we instead propose a general framework (FMA-PG) based on functional mirror ascent that gives ris… ▽ More

    Submitted 30 October, 2023; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: Fixed minor typos

  2. arXiv:2106.01655  [pdf, other

    cs.LG cs.AI

    Hierarchical Representation Learning for Markov Decision Processes

    Authors: Lorenzo Steccanella, Simone Totaro, Anders Jonsson

    Abstract: In this paper we present a novel method for learning hierarchical representations of Markov decision processes. Our method works by partitioning the state space into subsets, and defines subtasks for performing transitions between the partitions. We formulate the problem of partitioning the state space as an optimization problem that can be solved using gradient descent given a set of sampled traj… ▽ More

    Submitted 19 December, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

  3. arXiv:2011.06335  [pdf, other

    cs.LG cs.AI

    Hierarchical reinforcement learning for efficient exploration and transfer

    Authors: Lorenzo Steccanella, Simone Totaro, Damien Allonsius, Anders Jonsson

    Abstract: Sparse-reward domains are challenging for reinforcement learning algorithms since significant exploration is needed before encountering reward for the first time. Hierarchical reinforcement learning can facilitate exploration by reducing the number of decisions necessary before obtaining a reward. In this paper, we present a novel hierarchical reinforcement learning framework based on the compress… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  4. arXiv:2005.08006  [pdf, other

    eess.SY cs.AI cs.LG

    Lifelong Control of Off-grid Microgrid with Model Based Reinforcement Learning

    Authors: Simone Totaro, Ioannis Boukas, Anders Jonsson, Bertrand Cornélusse

    Abstract: The lifelong control problem of an off-grid microgrid is composed of two tasks, namely estimation of the condition of the microgrid devices and operational planning accounting for the uncertainties by forecasting the future consumption and the renewable production. The main challenge for the effective control arises from the various changes that take place over time. In this paper, we present an o… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

  5. arXiv:2005.06364  [pdf, other

    eess.SY cs.LG

    Adaptive Smoothing Path Integral Control

    Authors: Dominik Thalmeier, Hilbert J. Kappen, Simone Totaro, Vicenç Gómez

    Abstract: In Path Integral control problems a representation of an optimally controlled dynamical system can be formally computed and serve as a guidepost to learn a parametrized policy. The Path Integral Cross-Entropy (PICE) method tries to exploit this, but is hampered by poor sample efficiency. We propose a model-free algorithm called ASPIC (Adaptive Smoothing of Path Integral Control) that applies an in… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Comments: 23 pages, 5 figures, NeurIPS 2019 Optimization Foundations of Reinforcement Learning Workshop (OptRL 2019)

  6. arXiv:1807.04065  [pdf, other

    cs.NE cs.LG stat.ML

    Recurrent Neural Networks with Flexible Gates using Kernel Activation Functions

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Simone Totaro, Aurelio Uncini

    Abstract: Gated recurrent neural networks have achieved remarkable results in the analysis of sequential data. Inside these networks, gates are used to control the flow of information, allowing to model even very long-term dependencies in the data. In this paper, we investigate whether the original gate equation (a linear projection followed by an element-wise sigmoid) can be improved. In particular, we des… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: Accepted for presentation at 2018 IEEE International Workshop on Machine Learning for Signal Processing (MLSP)

  7. arXiv:1707.04035  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    Kafnets: kernel-based non-parametric activation functions for neural networks

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Simone Totaro, Aurelio Uncini

    Abstract: Neural networks are generally built by interleaving (adaptable) linear layers with (fixed) nonlinear activation functions. To increase their flexibility, several authors have proposed methods for adapting the activation functions themselves, endowing them with varying degrees of flexibility. None of these approaches, however, have gained wide acceptance in practice, and research in this topic rema… ▽ More

    Submitted 23 November, 2017; v1 submitted 13 July, 2017; originally announced July 2017.

    Comments: Preprint submitted to Neural Networks (Elsevier)