Zum Hauptinhalt springen

Showing 1–29 of 29 results for author: Gemp, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.16985  [pdf, other

    cs.GT cs.SE

    Visualizing 2x2 Normal-Form Games: twoxtwogame LaTeX Package

    Authors: Luke Marris, Ian Gemp, Siqi Liu, Joel Z. Leibo, Georgios Piliouras

    Abstract: Normal-form games with two players, each with two strategies, are the most studied class of games. These so-called 2x2 games are used to model a variety of strategic interactions. They appear in game theory, economics, and artificial intelligence research. However, there lacks tools for describing and visualizing such games. This work introduces a LaTeX package for visualizing 2x2 games. This work… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  2. arXiv:2402.08393  [pdf, other

    cs.GT

    NfgTransformer: Equivariant Representation Learning for Normal-form Games

    Authors: Siqi Liu, Luke Marris, Georgios Piliouras, Ian Gemp, Nicolas Heess

    Abstract: Normal-form games (NFGs) are the fundamental model of strategic interaction. We study their representation using neural networks. We describe the inherent equivariance of NFGs -- any permutation of strategies describes an equivalent game -- as well as the challenges this poses for representation learning. We then propose the NfgTransformer architecture that leverages this equivariance, leading to… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Published at ICLR 2024. Open-sourced at https://github.com/google-deepmind/nfg_transformer

  3. arXiv:2402.03928  [pdf, other

    cs.GT cs.MA

    Approximating the Core via Iterative Coalition Sampling

    Authors: Ian Gemp, Marc Lanctot, Luke Marris, Yiran Mao, Edgar Duéñez-Guzmán, Sarah Perrin, Andras Gyorgy, Romuald Elie, Georgios Piliouras, Michael Kaisers, Daniel Hennes, Kalesha Bullard, Kate Larson, Yoram Bachrach

    Abstract: The core is a central solution concept in cooperative game theory, defined as the set of feasible allocations or payments such that no subset of agents has incentive to break away and form their own subgroup or coalition. However, it has long been known that the core (and approximations, such as the least-core) are hard to compute. This limits our ability to analyze cooperative games in general, a… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Published in AAMAS 2024

  4. arXiv:2402.01704  [pdf, other

    cs.CL cs.AI cs.GT

    States as Strings as Strategies: Steering Language Models with Game-Theoretic Solvers

    Authors: Ian Gemp, Yoram Bachrach, Marc Lanctot, Roma Patel, Vibhavari Dasagi, Luke Marris, Georgios Piliouras, Siqi Liu, Karl Tuyls

    Abstract: Game theory is the study of mathematical models of strategic interactions among rational agents. Language is a key medium of interaction for humans, though it has historically proven difficult to model dialogue and its strategic motivations mathematically. A suitable model of the players, strategies, and payoffs associated with linguistic interactions (i.e., a binding to the conventional symbolic… ▽ More

    Submitted 6 February, 2024; v1 submitted 24 January, 2024; originally announced February 2024.

    Comments: 32 pages, 8 figures, code available @ https://github.com/google-deepmind/open_spiel/blob/master/open_spiel/python/games/chat_game.py

  5. arXiv:2310.06689  [pdf, other

    cs.GT cs.MA

    Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization

    Authors: Ian Gemp, Luke Marris, Georgios Piliouras

    Abstract: We propose the first loss function for approximate Nash equilibria of normal-form games that is amenable to unbiased Monte Carlo estimation. This construction allows us to deploy standard non-convex stochastic optimization techniques for approximating Nash equilibria, resulting in novel algorithms with provable guarantees. We complement our theoretical analysis with experiments demonstrating that… ▽ More

    Submitted 15 April, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Published at ICLR 2024

  6. arXiv:2304.09978  [pdf, other

    cs.GT cs.MA econ.TH math.OC

    Equilibrium-Invariant Embedding, Metric Space, and Fundamental Set of $2\times2$ Normal-Form Games

    Authors: Luke Marris, Ian Gemp, Georgios Piliouras

    Abstract: Equilibrium solution concepts of normal-form games, such as Nash equilibria, correlated equilibria, and coarse correlated equilibria, describe the joint strategy profiles from which no player has incentive to unilaterally deviate. They are widely studied in game theory, economics, and multiagent systems. Equilibrium concepts are invariant under certain transforms of the payoffs. We define an equil… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: 42 pages

  7. arXiv:2302.06607  [pdf, other

    cs.GT

    Generative Adversarial Equilibrium Solvers

    Authors: Denizalp Goktas, David C. Parkes, Ian Gemp, Luke Marris, Georgios Piliouras, Romuald Elie, Guy Lever, Andrea Tacchetti

    Abstract: We introduce the use of generative adversarial learning to compute equilibria in general game-theoretic settings, specifically the generalized Nash equilibrium (GNE) in pseudo-games, and its specific instantiation as the competitive equilibrium (CE) in Arrow-Debreu competitive economies. Pseudo-games are a generalization of games in which players' actions affect not only the payoffs of other playe… ▽ More

    Submitted 20 February, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: 41 pages, 13 figures

  8. arXiv:2302.04440  [pdf, other

    cs.LG cs.CV

    Feature Likelihood Divergence: Evaluating the Generalization of Generative Models Using Samples

    Authors: Marco Jiralerspong, Avishek Joey Bose, Ian Gemp, Chongli Qin, Yoram Bachrach, Gauthier Gidel

    Abstract: The past few years have seen impressive progress in the development of deep generative models capable of producing high-dimensional, complex, and photo-realistic data. However, current methods for evaluating such models remain incomplete: standard likelihood-based metrics do not always apply and rarely correlate with perceptual fidelity, while sample-based metrics, such as FID, are insensitive to… ▽ More

    Submitted 12 March, 2024; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: FLD code: https://github.com/marcojira/fld

  9. arXiv:2302.00797  [pdf, other

    cs.AI cs.GT cs.LG cs.MA

    Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning

    Authors: Zun Li, Marc Lanctot, Kevin R. McKee, Luke Marris, Ian Gemp, Daniel Hennes, Paul Muller, Kate Larson, Yoram Bachrach, Michael P. Wellman

    Abstract: Multiagent reinforcement learning (MARL) has benefited significantly from population-based and game-theoretic training regimes. One approach, Policy-Space Response Oracles (PSRO), employs standard reinforcement learning to compute response policies via approximate best responses and combines them via meta-strategy selection. We augment PSRO by adding a novel search procedure with generative sampli… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  10. arXiv:2211.09622  [pdf, other

    cs.AI

    AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process

    Authors: Kevin Du, Ian Gemp, Yi Wu, Yingying Wu

    Abstract: Reinforcement learning has recently been used to approach well-known NP-hard combinatorial problems in graph theory. Among these problems, Hamiltonian cycle problems are exceptionally difficult to analyze, even when restricted to individual instances of structurally complex graphs. In this paper, we use Monte Carlo Tree Search (MCTS), the search algorithm behind many state-of-the-art reinforcement… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  11. arXiv:2210.09257  [pdf, other

    cs.LG cs.AI cs.GT cs.MA

    Turbocharging Solution Concepts: Solving NEs, CEs and CCEs with Neural Equilibrium Solvers

    Authors: Luke Marris, Ian Gemp, Thomas Anthony, Andrea Tacchetti, Siqi Liu, Karl Tuyls

    Abstract: Solution concepts such as Nash Equilibria, Correlated Equilibria, and Coarse Correlated Equilibria are useful components for many multiagent machine learning algorithms. Unfortunately, solving a normal-form game could take prohibitive or non-deterministic time to converge, and could fail. We introduce the Neural Equilibrium Solver which utilizes a special equivariant neural network architecture to… ▽ More

    Submitted 15 April, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  12. arXiv:2210.02205  [pdf, other

    cs.GT cs.LG cs.MA

    Game Theoretic Rating in N-player general-sum games with Equilibria

    Authors: Luke Marris, Marc Lanctot, Ian Gemp, Shayegan Omidshafiei, Stephen McAleer, Jerome Connor, Karl Tuyls, Thore Graepel

    Abstract: Rating strategies in a game is an important area of research in game theory and artificial intelligence, and can be applied to any real-world competitive or cooperative setting. Traditionally, only transitive dependencies between strategies have been used to rate strategies (e.g. Elo), however recent work has expanded ratings to utilize game theoretic solutions to better rate strategies in non-tra… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  13. arXiv:2209.10958  [pdf, ps, other

    cs.MA cs.AI

    Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments

    Authors: Ian Gemp, Thomas Anthony, Yoram Bachrach, Avishkar Bhoopchand, Kalesha Bullard, Jerome Connor, Vibhavari Dasagi, Bart De Vylder, Edgar Duenez-Guzman, Romuald Elie, Richard Everett, Daniel Hennes, Edward Hughes, Mina Khan, Marc Lanctot, Kate Larson, Guy Lever, Siqi Liu, Luke Marris, Kevin R. McKee, Paul Muller, Julien Perolat, Florian Strub, Andrea Tacchetti, Eugene Tarassov , et al. (2 additional authors not shown)

    Abstract: The Game Theory & Multi-Agent team at DeepMind studies several aspects of multi-agent learning ranging from computing approximations to fundamental concepts in game theory to simulating social dilemmas in rich spatial environments and training 3-d humanoids in difficult team coordination tasks. A signature aim of our group is to use the resources and expertise made available to us at DeepMind in d… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: Published in AI Communications 2022

  14. arXiv:2207.14589  [pdf, other

    stat.ML cs.LG

    Stochastic Parallelizable Eigengap Dilation for Large Graph Clustering

    Authors: Elise van der Pol, Ian Gemp, Yoram Bachrach, Richard Everett

    Abstract: Large graphs commonly appear in social networks, knowledge graphs, recommender systems, life sciences, and decision making problems. Summarizing large graphs by their high level properties is helpful in solving problems in these settings. In spectral clustering, we aim to identify clusters of nodes where most edges fall within clusters and only few edges fall between clusters. This task is importa… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

    Comments: Presented at the ICML 2022 Workshop on Topology, Algebra, andGeometry in Machine Learning

  15. arXiv:2206.04993  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    The Symmetric Generalized Eigenvalue Problem as a Nash Equilibrium

    Authors: Ian Gemp, Charlie Chen, Brian McWilliams

    Abstract: The symmetric generalized eigenvalue problem (SGEP) is a fundamental concept in numerical linear algebra. It captures the solution of many classical machine learning problems such as canonical correlation analysis, independent components analysis, partial least squares, linear discriminant analysis, principal components and others. Despite this, most general solvers are prohibitively expensive whe… ▽ More

    Submitted 25 April, 2023; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: Published in ICLR 2023 (JAX code available as part of github.com/deepmind/eigengame)

  16. arXiv:2106.01285  [pdf, other

    cs.GT

    Sample-based Approximation of Nash in Large Many-Player Games via Gradient Descent

    Authors: Ian Gemp, Rahul Savani, Marc Lanctot, Yoram Bachrach, Thomas Anthony, Richard Everett, Andrea Tacchetti, Tom Eccles, János Kramár

    Abstract: Nash equilibrium is a central concept in game theory. Several Nash solvers exist, yet none scale to normal-form games with many actions and many players, especially those with payoff tensors too big to be stored in memory. In this work, we propose an approach that iteratively improves an approximation to a Nash equilibrium through joint play. It accomplishes this by tracing a previously establishe… ▽ More

    Submitted 4 February, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: Published in AAMAS 2022 (code available as part of open_spiel on github -- search ADIDAS in repo)

  17. arXiv:2102.04152  [pdf, other

    stat.ML cs.AI cs.LG

    EigenGame Unloaded: When playing games is better than optimizing

    Authors: Ian Gemp, Brian McWilliams, Claire Vernade, Thore Graepel

    Abstract: We build on the recently proposed EigenGame that views eigendecomposition as a competitive game. EigenGame's updates are biased if computed using minibatches of data, which hinders convergence and more sophisticated parallelism in the stochastic setting. In this work, we propose an unbiased stochastic update that is asymptotically equivalent to EigenGame, enjoys greater parallelism allowing comput… ▽ More

    Submitted 22 March, 2022; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: Published in ICLR '22

  18. arXiv:2010.00575  [pdf, other

    cs.MA cs.GT

    D3C: Reducing the Price of Anarchy in Multi-Agent Learning

    Authors: Ian Gemp, Kevin R. McKee, Richard Everett, Edgar A. Duéñez-Guzmán, Yoram Bachrach, David Balduzzi, Andrea Tacchetti

    Abstract: In multiagent systems, the complex interaction of fixed incentives can lead agents to outcomes that are poor (inefficient) not only for the group, but also for each individual. Price of anarchy is a technical, game-theoretic definition that quantifies the inefficiency arising in these scenarios -- it compares the welfare that can be achieved through perfect coordination against that achieved by se… ▽ More

    Submitted 20 February, 2022; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: Published in AAMAS 2022

  19. arXiv:2010.00554  [pdf, other

    cs.LG stat.ML

    EigenGame: PCA as a Nash Equilibrium

    Authors: Ian Gemp, Brian McWilliams, Claire Vernade, Thore Graepel

    Abstract: We present a novel view on principal component analysis (PCA) as a competitive game in which each approximate eigenvector is controlled by a player whose goal is to maximize their own utility function. We analyze the properties of this PCA game and the behavior of its gradient based updates. The resulting algorithm -- which combines elements from Oja's rule with a generalized Gram-Schmidt orthogon… ▽ More

    Submitted 16 March, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: Published as a conference paper at International Conference on Learning Representations (ICLR) 2021

  20. arXiv:2006.04635  [pdf, other

    cs.LG cs.AI cs.GT cs.MA stat.ML

    Learning to Play No-Press Diplomacy with Best Response Policy Iteration

    Authors: Thomas Anthony, Tom Eccles, Andrea Tacchetti, János Kramár, Ian Gemp, Thomas C. Hudson, Nicolas Porcel, Marc Lanctot, Julien Pérolat, Richard Everett, Roman Werpachowski, Satinder Singh, Thore Graepel, Yoram Bachrach

    Abstract: Recent advances in deep reinforcement learning (RL) have led to considerable progress in many 2-player zero-sum games, such as Go, Poker and Starcraft. The purely adversarial nature of such games allows for conceptually simple and principled application of RL methods. However real-world settings are many-agent, and agent interactions are complex mixtures of common-interest and competitive aspects.… ▽ More

    Submitted 4 January, 2022; v1 submitted 8 June, 2020; originally announced June 2020.

  21. arXiv:2006.03976  [pdf, other

    cs.LG stat.ML

    Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity

    Authors: Bo Liu, Ian Gemp, Mohammad Ghavamzadeh, Ji Liu, Sridhar Mahadevan, Marek Petrik

    Abstract: In this paper, we introduce proximal gradient temporal difference learning, which provides a principled way of designing and analyzing true stochastic gradient temporal difference learning algorithms. We show how gradient TD (GTD) reinforcement learning methods can be formally derived, not by starting from their original objective functions, as previously attempted, but rather from a primal-dual s… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

    Comments: Journal of Artificial Intelligence (JAIR)

  22. arXiv:2002.02325  [pdf, other

    cs.MA cs.AI

    Social diversity and social preferences in mixed-motive reinforcement learning

    Authors: Kevin R. McKee, Ian Gemp, Brian McWilliams, Edgar A. Duéñez-Guzmán, Edward Hughes, Joel Z. Leibo

    Abstract: Recent research on reinforcement learning in pure-conflict and pure-common interest games has emphasized the importance of population heterogeneity. In contrast, studies of reinforcement learning in mixed-motive games have primarily leveraged homogeneous approaches. Given the defining characteristic of mixed-motive games--the imperfect correlation of incentives between group members--we study the… ▽ More

    Submitted 12 February, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2020)

  23. arXiv:2001.04678  [pdf, other

    cs.LG cs.AI cs.GT cs.MA stat.ML

    Smooth markets: A basic mechanism for organizing gradient-based learners

    Authors: David Balduzzi, Wojciech M Czarnecki, Thomas W Anthony, Ian M Gemp, Edward Hughes, Joel Z Leibo, Georgios Piliouras, Thore Graepel

    Abstract: With the success of modern machine learning, it is becoming increasingly important to understand and control how learning algorithms interact. Unfortunately, negative results from game theory show there is little hope of understanding or controlling general n-player games. We therefore introduce smooth markets (SM-games), a class of n-player games with pairwise zero sum interactions. SM-games codi… ▽ More

    Submitted 18 January, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

    Comments: 18 pages, 3 figures

    Journal ref: ICLR 2020

  24. arXiv:1808.01531  [pdf, other

    cs.LG stat.ML

    Global Convergence to the Equilibrium of GANs using Variational Inequalities

    Authors: Ian Gemp, Sridhar Mahadevan

    Abstract: In optimization, the negative gradient of a function denotes the direction of steepest descent. Furthermore, traveling in any direction orthogonal to the gradient maintains the value of the function. In this work, we show that these orthogonal directions that are ignored by gradient descent can be critical in equilibrium problems. Equilibrium problems have drawn heightened attention in machine lea… ▽ More

    Submitted 20 May, 2019; v1 submitted 4 August, 2018; originally announced August 2018.

  25. arXiv:1710.07328  [pdf, other

    cs.GT cs.LG math.OC

    Online Monotone Games

    Authors: Ian Gemp, Sridhar Mahadevan

    Abstract: Algorithmic game theory (AGT) focuses on the design and analysis of algorithms for interacting agents, with interactions rigorously formalized within the framework of games. Results from AGT find applications in domains such as online bidding auctions for web advertisements and network routing protocols. Monotone games are games where agent strategies naturally converge to an equilibrium state. Pr… ▽ More

    Submitted 19 October, 2017; originally announced October 2017.

  26. arXiv:1611.01673  [pdf, other

    cs.LG cs.MA cs.NE

    Generative Multi-Adversarial Networks

    Authors: Ishan Durugkar, Ian Gemp, Sridhar Mahadevan

    Abstract: Generative adversarial networks (GANs) are a framework for producing a generative model by way of a two-player minimax game. In this paper, we propose the \emph{Generative Multi-Adversarial Network} (GMAN), a framework that extends GANs to multiple discriminators. In previous work, the successful training of GANs requires modifying the minimax objective to accelerate training early on. In contrast… ▽ More

    Submitted 2 March, 2017; v1 submitted 5 November, 2016; originally announced November 2016.

    Comments: Accepted as a conference paper (poster) at ICLR 2017

  27. arXiv:1608.07888  [pdf, other

    cs.LG math.OC

    Online Monotone Optimization

    Authors: Ian Gemp, Sridhar Mahadevan

    Abstract: This paper presents a new framework for analyzing and designing no-regret algorithms for dynamic (possibly adversarial) systems. The proposed framework generalizes the popular online convex optimization framework and extends it to its natural limit allowing it to capture a notion of regret that is intuitive for more general problems such as those encountered in game theory and variational inequali… ▽ More

    Submitted 28 August, 2016; originally announced August 2016.

    Comments: 23 pages, 6 figures

  28. arXiv:1608.05983  [pdf, other

    cs.LG stat.ML

    Inverting Variational Autoencoders for Improved Generative Accuracy

    Authors: Ian Gemp, Ishan Durugkar, Mario Parente, M. Darby Dyar, Sridhar Mahadevan

    Abstract: Recent advances in semi-supervised learning with deep generative models have shown promise in generalizing from small labeled datasets ($\mathbf{x},\mathbf{y}$) to large unlabeled ones ($\mathbf{x}$). In the case where the codomain has known structure, a large unfeatured dataset ($\mathbf{y}$) is potentially available. We develop a parameter-efficient, deep semi-supervised generative model for the… ▽ More

    Submitted 24 August, 2017; v1 submitted 21 August, 2016; originally announced August 2016.

  29. arXiv:1405.6757  [pdf, other

    cs.LG

    Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces

    Authors: Sridhar Mahadevan, Bo Liu, Philip Thomas, Will Dabney, Steve Giguere, Nicholas Jacek, Ian Gemp, Ji Liu

    Abstract: In this paper, we set forth a new vision of reinforcement learning developed by us over the past few years, one that yields mathematically rigorous solutions to longstanding important questions that have remained unresolved: (i) how to design reliable, convergent, and robust reinforcement learning algorithms (ii) how to guarantee that reinforcement learning satisfies pre-specified "safety" guarant… ▽ More

    Submitted 26 May, 2014; originally announced May 2014.

    Comments: 121 pages