Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Vezhnevets, A S

.
  1. arXiv:2312.03664  [pdf, other

    cs.AI cs.CL

    Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia

    Authors: Alexander Sasha Vezhnevets, John P. Agapiou, Avia Aharon, Ron Ziv, Jayd Matyas, Edgar A. Duéñez-Guzmán, William A. Cunningham, Simon Osindero, Danny Karmon, Joel Z. Leibo

    Abstract: Agent-based modeling has been around for decades, and applied widely across the social and natural sciences. The scope of this research method is now poised to grow dramatically as it absorbs the new affordances provided by Large Language Models (LLM)s. Generative Agent-Based Models (GABM) are not just classic Agent-Based Models (ABM)s where the agents talk to one another. Rather, GABMs are constr… ▽ More

    Submitted 13 December, 2023; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: 32 pages, 5 figures

  2. arXiv:2302.01180  [pdf, other

    cs.AI cs.NE

    Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition

    Authors: Peter Sunehag, Alexander Sasha Vezhnevets, Edgar Duéñez-Guzmán, Igor Mordach, Joel Z. Leibo

    Abstract: Many environments contain numerous available niches of variable value, each associated with a different local optimum in the space of behaviors (policy space). In such situations it is often difficult to design a learning process capable of evading distraction by poor local optima long enough to stumble upon the best available niche. In this work we propose a generic reinforcement learning (RL) al… ▽ More

    Submitted 3 February, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: Full length paper accompanying short format appearing at AAMAS 2023

    ACM Class: I.2

  3. arXiv:2211.13746  [pdf, other

    cs.MA cs.AI cs.GT cs.NE

    Melting Pot 2.0

    Authors: John P. Agapiou, Alexander Sasha Vezhnevets, Edgar A. Duéñez-Guzmán, Jayd Matyas, Yiran Mao, Peter Sunehag, Raphael Köster, Udari Madhushani, Kavya Kopparapu, Ramona Comanescu, DJ Strouse, Michael B. Johanson, Sukhdeep Singh, Julia Haas, Igor Mordatch, Dean Mobbs, Joel Z. Leibo

    Abstract: Multi-agent artificial intelligence research promises a path to develop intelligent technologies that are more human-like and more human-compatible than those produced by "solipsistic" approaches, which do not consider interactions between agents. Melting Pot is a research tool developed to facilitate work on multi-agent artificial intelligence, and provides an evaluation protocol that measures ge… ▽ More

    Submitted 30 October, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: 69 pages, 54 figures. arXiv admin note: text overlap with arXiv:2107.06857

  4. arXiv:2201.01816  [pdf, other

    cs.AI cs.LG cs.MA

    Hidden Agenda: a Social Deduction Game with Diverse Learned Equilibria

    Authors: Kavya Kopparapu, Edgar A. Duéñez-Guzmán, Jayd Matyas, Alexander Sasha Vezhnevets, John P. Agapiou, Kevin R. McKee, Richard Everett, Janusz Marecki, Joel Z. Leibo, Thore Graepel

    Abstract: A key challenge in the study of multiagent cooperation is the need for individual agents not only to cooperate effectively, but to decide with whom to cooperate. This is particularly critical in situations when other agents have hidden, possibly misaligned motivations and goals. Social deduction games offer an avenue to study how individuals might learn to synthesize potentially unreliable informa… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

  5. arXiv:2110.11404  [pdf, other

    cs.LG cs.AI cs.GT cs.MA

    Statistical discrimination in learning agents

    Authors: Edgar A. Duéñez-Guzmán, Kevin R. McKee, Yiran Mao, Ben Coppin, Silvia Chiappa, Alexander Sasha Vezhnevets, Michiel A. Bakker, Yoram Bachrach, Suzanne Sadedin, William Isaac, Karl Tuyls, Joel Z. Leibo

    Abstract: Undesired bias afflicts both human and algorithmic decision making, and may be especially prevalent when information processing trade-offs incentivize the use of heuristics. One primary example is \textit{statistical discrimination} -- selecting social partners based not on their underlying attributes, but on readily perceptible characteristics that covary with their suitability for the task at ha… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: 29 pages, 10 figures

    MSC Class: 68T07 (Primary) 91A26; 91-10; 93A16 (Secondary) ACM Class: I.2.11; I.2.0

  6. arXiv:2107.06857  [pdf, other

    cs.MA cs.AI

    Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot

    Authors: Joel Z. Leibo, Edgar Duéñez-Guzmán, Alexander Sasha Vezhnevets, John P. Agapiou, Peter Sunehag, Raphael Koster, Jayd Matyas, Charles Beattie, Igor Mordatch, Thore Graepel

    Abstract: Existing evaluation suites for multi-agent reinforcement learning (MARL) do not assess generalization to novel situations as their primary objective (unlike supervised-learning benchmarks). Our contribution, Melting Pot, is a MARL evaluation suite that fills this gap, and uses reinforcement learning to reduce the human labor required to create novel test scenarios. This works because one agent's b… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

    Comments: Accepted to ICML 2021 and presented as a long talk; 33 pages; 9 figures

    Journal ref: In International Conference on Machine Learning 2021 (pp. 6187-6199). PMLR

  7. arXiv:2106.09012  [pdf, other

    cs.MA

    A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings

    Authors: Eugene Vinitsky, Raphael Köster, John P. Agapiou, Edgar Duéñez-Guzmán, Alexander Sasha Vezhnevets, Joel Z. Leibo

    Abstract: Society is characterized by the presence of a variety of social norms: collective patterns of sanctioning that can prevent miscoordination and free-riding. Inspired by this, we aim to construct learning dynamics where potentially beneficial social norms can emerge. Since social norms are underpinned by sanctioning, we introduce a training regime where agents can access all sanctioning events but l… ▽ More

    Submitted 27 September, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

  8. arXiv:1906.01470  [pdf, other

    cs.LG cs.AI cs.MA cs.NE stat.ML

    Options as responses: Grounding behavioural hierarchies in multi-agent RL

    Authors: Alexander Sasha Vezhnevets, Yuhuai Wu, Remi Leblond, Joel Z. Leibo

    Abstract: This paper investigates generalisation in multi-agent games, where the generality of the agent can be evaluated by playing against opponents it hasn't seen during training. We propose two new games with concealed information and complex, non-transitive reward structure (think rock/paper/scissors). It turns out that most current deep reinforcement learning methods fail to efficiently explore the st… ▽ More

    Submitted 10 July, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: First two authors contributed equally

    Journal ref: International Conference on Machine Learning 2020

  9. arXiv:1708.04782  [pdf, other

    cs.LG cs.AI

    StarCraft II: A New Challenge for Reinforcement Learning

    Authors: Oriol Vinyals, Timo Ewalds, Sergey Bartunov, Petko Georgiev, Alexander Sasha Vezhnevets, Michelle Yeo, Alireza Makhzani, Heinrich Küttler, John Agapiou, Julian Schrittwieser, John Quan, Stephen Gaffney, Stig Petersen, Karen Simonyan, Tom Schaul, Hado van Hasselt, David Silver, Timothy Lillicrap, Kevin Calderone, Paul Keet, Anthony Brunasso, David Lawrence, Anders Ekermo, Jacob Repp, Rodney Tsing

    Abstract: This paper introduces SC2LE (StarCraft II Learning Environment), a reinforcement learning environment based on the StarCraft II game. This domain poses a new grand challenge for reinforcement learning, representing a more difficult class of problems than considered in most prior work. It is a multi-agent problem with multiple players interacting; there is imperfect information due to a partially o… ▽ More

    Submitted 16 August, 2017; originally announced August 2017.

    Comments: Collaboration between DeepMind & Blizzard. 20 pages, 9 figures, 2 tables

  10. arXiv:1703.01161  [pdf, other

    cs.AI

    FeUdal Networks for Hierarchical Reinforcement Learning

    Authors: Alexander Sasha Vezhnevets, Simon Osindero, Tom Schaul, Nicolas Heess, Max Jaderberg, David Silver, Koray Kavukcuoglu

    Abstract: We introduce FeUdal Networks (FuNs): a novel architecture for hierarchical reinforcement learning. Our approach is inspired by the feudal reinforcement learning proposal of Dayan and Hinton, and gains power and efficacy by decoupling end-to-end learning across multiple levels -- allowing it to utilise different resolutions of time. Our framework employs a Manager module and a Worker module. The Ma… ▽ More

    Submitted 6 March, 2017; v1 submitted 3 March, 2017; originally announced March 2017.