On the Convergence of Model Free Learning in Mean Field Games

Elie, Romuald; Pérolat, Julien; Laurière, Mathieu; Geist, Matthieu; Pietquin, Olivier

Mathematics > Optimization and Control

arXiv:1907.02633 (math)

[Submitted on 4 Jul 2019 (v1), last revised 21 Feb 2020 (this version, v3)]

Title:On the Convergence of Model Free Learning in Mean Field Games

Authors:Romuald Elie, Julien Pérolat, Mathieu Laurière, Matthieu Geist, Olivier Pietquin

View PDF

Abstract:Learning by experience in Multi-Agent Systems (MAS) is a difficult and exciting task, due to the lack of stationarity of the environment, whose dynamics evolves as the population learns. In order to design scalable algorithms for systems with a large population of interacting agents (e.g. swarms), this paper focuses on Mean Field MAS, where the number of agents is asymptotically infinite. Recently, a very active burgeoning field studies the effects of diverse reinforcement learning algorithms for agents with no prior information on a stationary Mean Field Game (MFG) and learn their policy through repeated experience. We adopt a high perspective on this problem and analyze in full generality the convergence of a fictitious iterative scheme using any single agent learning algorithm at each step. We quantify the quality of the computed approximate Nash equilibrium, in terms of the accumulated errors arising at each learning iteration step. Notably, we show for the first time convergence of model free learning algorithms towards non-stationary MFG equilibria, relying only on classical assumptions on the MFG dynamics. We illustrate our theoretical results with a numerical experiment in a continuous action-space environment, where the approximate best response of the iterative fictitious play scheme is computed with a deep RL algorithm.

Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1907.02633 [math.OC]
	(or arXiv:1907.02633v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1907.02633
Journal reference:	AAAI 2020 conference proceedings

Submission history

From: Romuald Elie [view email]
[v1] Thu, 4 Jul 2019 11:54:09 UTC (110 KB)
[v2] Fri, 20 Dec 2019 16:01:32 UTC (93 KB)
[v3] Fri, 21 Feb 2020 00:13:13 UTC (107 KB)

Mathematics > Optimization and Control

Title:On the Convergence of Model Free Learning in Mean Field Games

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:On the Convergence of Model Free Learning in Mean Field Games

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators