Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Santos, P P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.06274  [pdf, other

    cs.LG

    Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning

    Authors: Pedro P. Santos, Diogo S. Carvalho, Miguel Vasco, Alberto Sardinha, Pedro A. Santos, Ana Paiva, Francisco S. Melo

    Abstract: We introduce hybrid execution in multi-agent reinforcement learning (MARL), a new paradigm in which agents aim to successfully complete cooperative tasks with arbitrary communication levels at execution time by taking advantage of information-sharing among the agents. Under hybrid execution, the communication level can range from a setting in which no communication is allowed between agents (fully… ▽ More

    Submitted 5 June, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  2. arXiv:2111.11758  [pdf, other

    cs.LG

    The Impact of Data Distribution on Q-learning with Function Approximation

    Authors: Pedro P. Santos, Diogo S. Carvalho, Alberto Sardinha, Francisco S. Melo

    Abstract: We study the interplay between the data distribution and Q-learning-based algorithms with function approximation. We provide a unified theoretical and empirical analysis as to how different properties of the data distribution influence the performance of Q-learning-based algorithms. We connect different lines of research, as well as validate and extend previous results. We start by reviewing theor… ▽ More

    Submitted 10 February, 2023; v1 submitted 23 November, 2021; originally announced November 2021.

  3. arXiv:2101.09614  [pdf, other

    eess.SY cs.LG

    A Methodology for the Development of RL-Based Adaptive Traffic Signal Controllers

    Authors: Guilherme S. Varela, Pedro P. Santos, Alberto Sardinha, Francisco S. Melo

    Abstract: This article proposes a methodology for the development of adaptive traffic signal controllers using reinforcement learning. Our methodology addresses the lack of standardization in the literature that renders the comparison of approaches in different works meaningless, due to differences in metrics, environments, and even experimental design and methodology. The proposed methodology thus comprise… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.