Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Borsa, D L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.00654  [pdf, other

    cs.LG cs.AI

    Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition

    Authors: Yash Chandak, Shantanu Thakoor, Zhaohan Daniel Guo, Yunhao Tang, Remi Munos, Will Dabney, Diana L Borsa

    Abstract: Representation learning and exploration are among the key challenges for any deep reinforcement learning agent. In this work, we provide a singular value decomposition based method that can be used to obtain representations that preserve the underlying transition structure in the domain. Perhaps interestingly, we show that these representations also capture the relative frequency of state visitati… ▽ More

    Submitted 2 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Accepted at the 40th International Conference on Machine Learning (ICML 2023)

  2. arXiv:2010.02255  [pdf, other

    cs.AI cs.LG stat.ML

    Temporal Difference Uncertainties as a Signal for Exploration

    Authors: Sebastian Flennerhag, Jane X. Wang, Pablo Sprechmann, Francesco Visin, Alexandre Galashov, Steven Kapturowski, Diana L. Borsa, Nicolas Heess, Andre Barreto, Razvan Pascanu

    Abstract: An effective approach to exploration in reinforcement learning is to rely on an agent's uncertainty over the optimal policy, which can yield near-optimal exploration strategies in tabular settings. However, in non-tabular settings that involve function approximators, obtaining accurate uncertainty estimates is almost as challenging a problem. In this paper, we highlight that value estimates are ea… ▽ More

    Submitted 1 July, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: 9 pages, 11 figures, 5 tables