Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Johanson, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.13746  [pdf, other

    cs.MA cs.AI cs.GT cs.NE

    Melting Pot 2.0

    Authors: John P. Agapiou, Alexander Sasha Vezhnevets, Edgar A. Duéñez-Guzmán, Jayd Matyas, Yiran Mao, Peter Sunehag, Raphael Köster, Udari Madhushani, Kavya Kopparapu, Ramona Comanescu, DJ Strouse, Michael B. Johanson, Sukhdeep Singh, Julia Haas, Igor Mordatch, Dean Mobbs, Joel Z. Leibo

    Abstract: Multi-agent artificial intelligence research promises a path to develop intelligent technologies that are more human-like and more human-compatible than those produced by "solipsistic" approaches, which do not consider interactions between agents. Melting Pot is a research tool developed to facilitate work on multi-agent artificial intelligence, and provides an evaluation protocol that measures ge… ▽ More

    Submitted 30 October, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: 69 pages, 54 figures. arXiv admin note: text overlap with arXiv:2107.06857

  2. arXiv:2205.06760  [pdf, other

    cs.AI cs.LG cs.MA

    Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

    Authors: Michael Bradley Johanson, Edward Hughes, Finbarr Timbers, Joel Z. Leibo

    Abstract: Advances in artificial intelligence often stem from the development of new environments that abstract real-world situations into a form where research can be done conveniently. This paper contributes such an environment based on ideas inspired by elementary Microeconomics. Agents learn to produce resources in a spatially complex world, trade them with one another, and consume those that they prefe… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

  3. arXiv:2203.09498  [pdf, other

    cs.AI cs.CL cs.LG cs.MA

    The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents

    Authors: Patrick M. Pilarski, Andrew Butcher, Elnaz Davoodi, Michael Bradley Johanson, Dylan J. A. Brenneis, Adam S. R. Parker, Leslie Acker, Matthew M. Botvinick, Joseph Modayil, Adam White

    Abstract: Learned communication between agents is a powerful tool when approaching decision-making problems that are hard to overcome by any single agent in isolation. However, continual coordination and communication learning between machine agents or human-machine partnerships remains a challenging open problem. As a stepping stone toward solving the continual communication learning problem, in this paper… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 54 pages, 29 figures, 4 tables

  4. arXiv:2201.03709  [pdf, other

    cs.AI cs.LG cs.MA

    Pavlovian Signalling with General Value Functions in Agent-Agent Temporal Decision Making

    Authors: Andrew Butcher, Michael Bradley Johanson, Elnaz Davoodi, Dylan J. A. Brenneis, Leslie Acker, Adam S. R. Parker, Adam White, Joseph Modayil, Patrick M. Pilarski

    Abstract: In this paper, we contribute a multi-faceted study into Pavlovian signalling -- a process by which learned, temporally extended predictions made by one agent inform decision-making by another agent. Signalling is intimately connected to time and timing. In service of generating and receiving signals, humans and other animals are known to represent time, determine time since past events, predict th… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

    Comments: 9 pages, 7 figures

  5. arXiv:2112.07774  [pdf, other

    cs.AI cs.HC cs.MA

    Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study

    Authors: Dylan J. A. Brenneis, Adam S. Parker, Michael Bradley Johanson, Andrew Butcher, Elnaz Davoodi, Leslie Acker, Matthew M. Botvinick, Joseph Modayil, Adam White, Patrick M. Pilarski

    Abstract: Artificial intelligence systems increasingly involve continual learning to enable flexibility in general situations that are not encountered during system training. Human interaction with autonomous systems is broadly studied, but research has hitherto under-explored interactions that occur while the system is actively learning, and can noticeably change its behaviour in minutes. In this pilot stu… ▽ More

    Submitted 22 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

  6. arXiv:2010.10380  [pdf, other

    cs.LG cs.AI cs.MA

    Negotiating Team Formation Using Deep Reinforcement Learning

    Authors: Yoram Bachrach, Richard Everett, Edward Hughes, Angeliki Lazaridou, Joel Z. Leibo, Marc Lanctot, Michael Johanson, Wojciech M. Czarnecki, Thore Graepel

    Abstract: When autonomous agents interact in the same environment, they must often cooperate to achieve their goals. One way for agents to cooperate effectively is to form a team, make a binding agreement on a joint plan, and execute it. However, when agents are self-interested, the gains from team formation must be allocated appropriately to incentivize agreement. Various approaches for multi-agent negotia… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    ACM Class: I.2.6

    Journal ref: Artificial Intelligence 288 (2020): 103356

  7. arXiv:1905.02691  [pdf, other

    cs.AI cs.HC cs.LG

    Learned human-agent decision-making, communication and joint action in a virtual reality environment

    Authors: Patrick M. Pilarski, Andrew Butcher, Michael Johanson, Matthew M. Botvinick, Andrew Bolt, Adam S. R. Parker

    Abstract: Humans make decisions and act alongside other humans to pursue both short-term and long-term goals. As a result of ongoing progress in areas such as computing science and automation, humans now also interact with non-human agents of varying complexity as part of their day-to-day activities; substantial work is being done to integrate increasingly intelligent machine agents into human work and play… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

    Comments: 5 pages, 3 figures. Accepted to The 4th Multidisciplinary Conference on Reinforcement Learning and Decision Making, July 7-10, 2019, McGill University, Montreal, Quebec, Canada

  8. DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker

    Authors: Matej Moravčík, Martin Schmid, Neil Burch, Viliam Lisý, Dustin Morrill, Nolan Bard, Trevor Davis, Kevin Waugh, Michael Johanson, Michael Bowling

    Abstract: Artificial intelligence has seen several breakthroughs in recent years, with games often serving as milestones. A common feature of these games is that players have perfect information. Poker is the quintessential game of imperfect information, and a longstanding challenge problem in artificial intelligence. We introduce DeepStack, an algorithm for imperfect information settings. It combines recur… ▽ More

    Submitted 3 March, 2017; v1 submitted 6 January, 2017; originally announced January 2017.

  9. arXiv:1511.02590  [pdf

    cs.HC

    The Turing Test for Telepresence

    Authors: Mathias Johanson

    Abstract: The quality of high-end videoconferencing systems has improved significantly over the last few years enabling a class of applications known as "telepresence" wherein the users engaged in a communication session experience a feeling of mutual presence in a shared virtual space. Telepresence systems have reached a maturity level that seriously challenges the old familiar truism that a face-to-face m… ▽ More

    Submitted 9 November, 2015; originally announced November 2015.

    Comments: The International Journal of Multimedia and its Applications (IJMA), Vol.7, No.4/5, October 2015

  10. arXiv:1303.4441  [pdf, other

    cs.GT

    Solving Imperfect Information Games Using Decomposition

    Authors: Neil Burch, Michael Johanson, Michael Bowling

    Abstract: Decomposition, i.e. independently analyzing possible subgames, has proven to be an essential principle for effective decision-making in perfect information games. However, in imperfect information games, decomposition has proven to be problematic. To date, all proposed techniques for decomposition in imperfect information games have abandoned theoretical guarantees. This work presents the first te… ▽ More

    Submitted 21 April, 2014; v1 submitted 18 March, 2013; originally announced March 2013.

    Comments: 7 pages by 2 columns, 5 figures; April 21 2014 - expand explanations and theory

  11. arXiv:1302.7008  [pdf, ps, other

    cs.GT

    Measuring the Size of Large No-Limit Poker Games

    Authors: Michael Johanson

    Abstract: In the field of computational game theory, games are often compared in terms of their size. This can be measured in several ways, including the number of unique game states, the number of decision points, and the total number of legal actions over all decision points. These numbers are either known or estimated for a wide range of classic games such as chess and checkers. In the stochastic and imp… ▽ More

    Submitted 7 March, 2013; v1 submitted 27 February, 2013; originally announced February 2013.

    Report number: TR13-01