Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Akimov, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.11096  [pdf, other

    cs.LG cs.AI cs.NE

    Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows

    Authors: Dmitriy Akimov, Vladislav Kurenkov, Alexander Nikulin, Denis Tarasov, Sergey Kolesnikov

    Abstract: Offline reinforcement learning aims to train a policy on a pre-recorded and fixed dataset without any additional environment interactions. There are two major challenges in this setting: (1) extrapolation error caused by approximating the value of state-action pairs not well-covered by the training data and (2) distributional shift between behavior and inference policies. One way to tackle these p… ▽ More

    Submitted 30 January, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: Accepted at 3rd Offline Reinforcement Learning Workshop at Neural Information Processing Systems, 2022. Source code: https://github.com/tinkoff-ai/cnf

  2. arXiv:2211.11092  [pdf, other

    cs.LG cs.AI cs.NE

    Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size

    Authors: Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Dmitry Akimov, Sergey Kolesnikov

    Abstract: Training large neural networks is known to be time-consuming, with the learning duration taking days or even weeks. To address this problem, large-batch optimization was introduced. This approach demonstrated that scaling mini-batch sizes with appropriate learning rate adjustments can speed up the training process by orders of magnitude. While long training time was not typically a major issue for… ▽ More

    Submitted 30 January, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: Accepted at 3rd Offline Reinforcement Learning Workshop at Neural Information Processing Systems, 2022. Source code: https://github.com/tinkoff-ai/lb-sac

  3. arXiv:2210.07105  [pdf, other

    cs.LG cs.AI

    CORL: Research-oriented Deep Offline Reinforcement Learning Library

    Authors: Denis Tarasov, Alexander Nikulin, Dmitry Akimov, Vladislav Kurenkov, Sergey Kolesnikov

    Abstract: CORL is an open-source library that provides thoroughly benchmarked single-file implementations of both deep offline and offline-to-online reinforcement learning algorithms. It emphasizes a simple developing experience with a straightforward codebase and a modern analysis tracking tool. In CORL, we isolate methods implementation into separate single files, making performance-relevant details easie… ▽ More

    Submitted 26 October, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmarks. Source code at https://github.com/corl-team/CORL

  4. arXiv:1911.13056  [pdf, other

    cs.AI cs.LG

    Distributed Soft Actor-Critic with Multivariate Reward Representation and Knowledge Distillation

    Authors: Dmitry Akimov

    Abstract: In this paper, we describe NeurIPS 2019 Learning to Move - Walk Around challenge physics-based environment and present our solution to this competition which scored 1303.727 mean reward points and took 3rd place. Our method combines recent advances from both continuous- and discrete-action space reinforcement learning, such as Soft Actor-Critic and Recurrent Experience Replay in Distributed Reinfo… ▽ More

    Submitted 10 April, 2020; v1 submitted 29 November, 2019; originally announced November 2019.

    Comments: 9 pages, 5 figures