Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Kuderov, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09287  [pdf, other

    cs.AI

    Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments

    Authors: Zoya Volovikova, Alexey Skrynnik, Petr Kuderov, Aleksandr I. Panov

    Abstract: In this study, we address the issue of enabling an artificial intelligence agent to execute complex language instructions within virtual environments. In our framework, we assume that these instructions involve intricate linguistic structures and multiple interdependent tasks that must be navigated successfully to achieve the desired outcomes. To effectively manage these complexities, we propose a… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2310.13391  [pdf, other

    cs.LG cs.AI cs.NE

    Learning Successor Features with Distributed Hebbian Temporal Memory

    Authors: Evgenii Dzhivelikian, Petr Kuderov, Aleksandr I. Panov

    Abstract: This paper presents a novel approach to address the challenge of online temporal memory learning for decision-making under uncertainty in non-stationary, partially observable environments. The proposed algorithm, Distributed Hebbian Temporal Memory (DHTM), is based on factor graph formalism and a multicomponent neuron model. DHTM aims to capture sequential data relationships and make cumulative pr… ▽ More

    Submitted 19 March, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: 20 pages, 9 figures