Skip to main content

Showing 1–7 of 7 results for author: Malysheva, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.15378  [pdf, other

    cs.AI cs.GT cs.MA

    Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

    Authors: Julien Perolat, Bart de Vylder, Daniel Hennes, Eugene Tarassov, Florian Strub, Vincent de Boer, Paul Muller, Jerome T. Connor, Neil Burch, Thomas Anthony, Stephen McAleer, Romuald Elie, Sarah H. Cen, Zhe Wang, Audrunas Gruslys, Aleksandra Malysheva, Mina Khan, Sherjil Ozair, Finbarr Timbers, Toby Pohlen, Tom Eccles, Mark Rowland, Marc Lanctot, Jean-Baptiste Lespiau, Bilal Piot , et al. (9 additional authors not shown)

    Abstract: We introduce DeepNash, an autonomous agent capable of learning to play the imperfect information game Stratego from scratch, up to a human expert level. Stratego is one of the few iconic board games that Artificial Intelligence (AI) has not yet mastered. This popular game has an enormous game tree on the order of $10^{535}$ nodes, i.e., $10^{175}$ times larger than that of Go. It has the additiona… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

  2. DYPLODOC: Dynamic Plots for Document Classification

    Authors: Anastasia Malysheva, Alexey Tikhonov, Ivan P. Yamshchikov

    Abstract: Narrative generation and analysis are still on the fringe of modern natural language processing yet are crucial in a variety of applications. This paper proposes a feature extraction method for plot dynamics. We present a dataset that consists of the plot descriptions for thirteen thousand TV shows alongside meta-information on their genres and dynamic plots extracted from them. We validate the pr… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

    ACM Class: I.2.7; I.2.6

    Journal ref: in Modern Management based on Big Data II and Machine Learning and Intelligent Systems III 2021 (pp. 511-519). IOS Press

  3. End-to-end Deep Object Tracking with Circular Loss Function for Rotated Bounding Box

    Authors: Vladislav Belyaev, Aleksandra Malysheva, Aleksei Shpilman

    Abstract: The task object tracking is vital in numerous applications such as autonomous driving, intelligent surveillance, robotics, etc. This task entails the assigning of a bounding box to an object in a video stream, given only the bounding box for that object on the first frame. In 2015, a new type of video object tracking (VOT) dataset was created that introduced rotated bounding boxes as an extension… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  4. MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning

    Authors: Aleksandra Malysheva, Daniel Kudenko, Aleksei Shpilman

    Abstract: Over recent years, deep reinforcement learning has shown strong successes in complex single-agent tasks, and more recently this approach has also been applied to multi-agent domains. In this paper, we propose a novel approach, called MAGNet, to multi-agent reinforcement learning that utilizes a relevance graph representation of the environment obtained by a self-attention mechanism, and a message-… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1811.12557

  5. Learning to Run with Potential-Based Reward Shaping and Demonstrations from Video Data

    Authors: Aleksandra Malysheva, Daniel Kudenko, Aleksei Shpilman

    Abstract: Learning to produce efficient movement behaviour for humanoid robots from scratch is a hard problem, as has been illustrated by the "Learning to run" competition at NIPS 2017. The goal of this competition was to train a two-legged model of a humanoid body to run in a simulated race course with maximum speed. All submissions took a tabula rasa approach to reinforcement learning (RL) and were able t… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

  6. arXiv:1902.02441  [pdf, other

    cs.LG cs.RO stat.ML

    Artificial Intelligence for Prosthetics - challenge solutions

    Authors: Łukasz Kidziński, Carmichael Ong, Sharada Prasanna Mohanty, Jennifer Hicks, Sean F. Carroll, Bo Zhou, Hongsheng Zeng, Fan Wang, Rongzhong Lian, Hao Tian, Wojciech Jaśkowski, Garrett Andersen, Odd Rune Lykkebø, Nihat Engin Toklu, Pranav Shyam, Rupesh Kumar Srivastava, Sergey Kolesnikov, Oleksii Hrinchuk, Anton Pechenko, Mattias Ljungström, Zhen Wang, Xu Hu, Zehong Hu, Minghui Qiu, Jun Huang , et al. (25 additional authors not shown)

    Abstract: In the NeurIPS 2018 Artificial Intelligence for Prosthetics challenge, participants were tasked with building a controller for a musculoskeletal model with a goal of matching a given time-varying velocity vector. Top participants were invited to describe their algorithms. In this work, we describe the challenge and present thirteen solutions that used deep reinforcement learning approaches. Many s… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

  7. arXiv:1811.12557  [pdf, other

    cs.MA cs.LG

    Deep Multi-Agent Reinforcement Learning with Relevance Graphs

    Authors: Aleksandra Malysheva, Tegg Taekyong Sung, Chae-Bong Sohn, Daniel Kudenko, Aleksei Shpilman

    Abstract: Over recent years, deep reinforcement learning has shown strong successes in complex single-agent tasks, and more recently this approach has also been applied to multi-agent domains. In this paper, we propose a novel approach, called MAGnet, to multi-agent reinforcement learning (MARL) that utilizes a relevance graph representation of the environment obtained by a self-attention mechanism, and a m… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: The first two authors contributed equally. Author ordering determined by coin flip over a Google Hangout. Accepted at NIPS 2018 Deep RL Workshop