Modeling the formation of social conventions from embodied real-time interactions

Ismael T Freire; Clement Moulin-Frier; Marti Sanchez-Fibla; Xerxes D Arsiwalla; Paul F M J Verschure

doi:10.1371/journal.pone.0234434

Modeling the formation of social conventions from embodied real-time interactions

PLoS One. 2020 Jun 22;15(6):e0234434. doi: 10.1371/journal.pone.0234434. eCollection 2020.

Authors

Ismael T Freire¹, Clement Moulin-Frier², Marti Sanchez-Fibla³, Xerxes D Arsiwalla¹, Paul F M J Verschure^{1

4

5}

Affiliations

¹ SPECS Lab, Institute for Bioengineering of Catalonia, Barcelona, Spain.
² Flowers Team, Inria and Ensta ParisTech, Bordeaux, France.
³ AI-ML Group, ETIC, Universitat Pompeu Fabra, Barcelona, Spain.
⁴ Barcelona Institute of Science and Technology (BIST), Barcelona, Spain.
⁵ Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain.

Abstract

What is the role of real-time control and learning in the formation of social conventions? To answer this question, we propose a computational model that matches human behavioral data in a social decision-making game that was analyzed both in discrete-time and continuous-time setups. Furthermore, unlike previous approaches, our model takes into account the role of sensorimotor control loops in embodied decision-making scenarios. For this purpose, we introduce the Control-based Reinforcement Learning (CRL) model. CRL is grounded in the Distributed Adaptive Control (DAC) theory of mind and brain, where low-level sensorimotor control is modulated through perceptual and behavioral learning in a layered structure. CRL follows these principles by implementing a feedback control loop handling the agent's reactive behaviors (pre-wired reflexes), along with an Adaptive Layer that uses reinforcement learning to maximize long-term reward. We test our model in a multi-agent game-theoretic task in which coordination must be achieved to find an optimal solution. We show that CRL is able to reach human-level performance on standard game-theoretic metrics such as efficiency in acquiring rewards and fairness in reward distribution.

Publication types

Research Support, Non-U.S. Gov't
Video-Audio Media

MeSH terms

Computer Simulation
Decision Making / physiology*
Game Theory
Humans
Models, Psychological*
Reinforcement, Social*
Sensorimotor Cortex / physiology
Social Behavior*
Social Norms*

Grants and funding

PFMJV. This project has received funding from the European Union’s Horizon 2020 research and innovation programme, ID:820742 and ID:641321. MSF and CMF. This project has been supported by INSOCO-DPI2016-80116-P. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.