Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Baier, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.16312  [pdf, other

    cs.MA cs.AI cs.GT

    MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement Learning

    Authors: Florian Felten, Umut Ucak, Hicham Azmani, Gao Peng, Willem Röpke, Hendrik Baier, Patrick Mannion, Diederik M. Roijers, Jordan K. Terry, El-Ghazali Talbi, Grégoire Danoy, Ann Nowé, Roxana Rădulescu

    Abstract: Many challenging tasks such as managing traffic systems, electricity grids, or supply chains involve complex decision-making processes that must balance multiple conflicting objectives and coordinate the actions of various independent decision-makers (DMs). One perspective for formalising and addressing such tasks is multi-objective multi-agent reinforcement learning (MOMARL). MOMARL broadens rein… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  2. arXiv:2407.10820  [pdf, other

    cs.AI

    Enabling MCTS Explainability for Sequential Planning Through Computation Tree Logic

    Authors: Ziyan An, Hendrik Baier, Abhishek Dubey, Ayan Mukhopadhyay, Meiyi Ma

    Abstract: Monte Carlo tree search (MCTS) is one of the most capable online search algorithms for sequential planning tasks, with significant applications in areas such as resource allocation and transit planning. Despite its strong performance in real-world deployment, the inherent complexity of MCTS makes it challenging to understand for users without technical background. This paper considers the use of M… ▽ More

    Submitted 16 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: Accepted by the Proceedings of the 27th European Conference on Artificial Intelligence (ECAI)

  3. arXiv:2401.03197  [pdf, other

    cs.AI cs.LG

    Decision Making in Non-Stationary Environments with Policy-Augmented Search

    Authors: Ava Pettet, Yunuo Zhang, Baiting Luo, Kyle Wray, Hendrik Baier, Aron Laszka, Abhishek Dubey, Ayan Mukhopadhyay

    Abstract: Sequential decision-making under uncertainty is present in many important problems. Two popular approaches for tackling such problems are reinforcement learning and online search (e.g., Monte Carlo tree search). While the former learns a policy by interacting with the environment (typically done before execution), the latter uses a generative model of the environment to sample promising action tra… ▽ More

    Submitted 20 January, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: Extended Abstract accepted for presentation at AAMAS 2024

  4. arXiv:2208.11367  [pdf, other

    cs.CR cs.LG

    Combining AI and AM - Improving Approximate Matching through Transformer Networks

    Authors: Frieder Uhlig, Lukas Struppek, Dominik Hintersdorf, Thomas Göbel, Harald Baier, Kristian Kersting

    Abstract: Approximate matching (AM) is a concept in digital forensics to determine the similarity between digital artifacts. An important use case of AM is the reliable and efficient detection of case-relevant data structures on a blacklist, if only fragments of the original are available. For instance, if only a cluster of indexed malware is still present during the digital forensic investigation, the AM a… ▽ More

    Submitted 27 April, 2023; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: Published at DFRWS USA 2023 as a conference paper

  5. arXiv:2206.00113  [pdf, other

    cs.AI cs.GT

    BRExIt: On Opponent Modelling in Expert Iteration

    Authors: Daniel Hernandez, Hendrik Baier, Michael Kaisers

    Abstract: Finding a best response policy is a central objective in game theory and multi-agent learning, with modern population-based training approaches employing reinforcement learning algorithms as best-response oracles to improve play against candidate opponents (typically previously learnt policies). We propose Best Response Expert Iteration (BRExIt), which accelerates learning in games by incorporatin… ▽ More

    Submitted 25 April, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

  6. arXiv:2201.11404  [pdf, other

    cs.AI

    Online Planning in POMDPs with Self-Improving Simulators

    Authors: Jinke He, Miguel Suau, Hendrik Baier, Michael Kaisers, Frans A. Oliehoek

    Abstract: How can we plan efficiently in a large and complex environment when the time budget is limited? Given the original simulator of the environment, which may be computationally very demanding, we propose to learn online an approximate but much faster simulator that improves over time. To plan reliably and efficiently while the approximate simulator is learning, we develop a method that adaptively dec… ▽ More

    Submitted 12 December, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: presented at IJCAI 2022

  7. arXiv:1808.01262  [pdf, ps, other

    cs.AI

    The Text-Based Adventure AI Competition

    Authors: Timothy Atkinson, Hendrik Baier, Tara Copplestone, Sam Devlin, Jerry Swan

    Abstract: In 2016, 2017, and 2018 at the IEEE Conference on Computational Intelligence in Games, the authors of this paper ran a competition for agents that can play classic text-based adventure games. This competition fills a gap in existing game AI competitions that have typically focussed on traditional card/board games or modern video games with graphical interfaces. By providing a platform for evaluati… ▽ More

    Submitted 24 January, 2019; v1 submitted 3 August, 2018; originally announced August 2018.

    Comments: updated to journal version

    MSC Class: 68T50

  8. arXiv:0911.2174  [pdf, other

    hep-lat cs.AR

    QPACE -- a QCD parallel computer based on Cell processors

    Authors: H. Baier, H. Boettiger, M. Drochner, N. Eicker, U. Fischer, Z. Fodor, A. Frommer, C. Gomez, G. Goldrian, S. Heybrock, D. Hierl, M. Hüsken, T. Huth, B. Krill, J. Lauritsen, T. Lippert, T. Maurer, B. Mendl, N. Meyer, A. Nobile, I. Ouda, M. Pivanti, D. Pleiter, M. Ries, A. Schäfer , et al. (10 additional authors not shown)

    Abstract: QPACE is a novel parallel computer which has been developed to be primarily used for lattice QCD simulations. The compute power is provided by the IBM PowerXCell 8i processor, an enhanced version of the Cell processor that is used in the Playstation 3. The QPACE nodes are interconnected by a custom, application optimized 3-dimensional torus network implemented on an FPGA. To achieve the very hig… ▽ More

    Submitted 23 December, 2009; v1 submitted 11 November, 2009; originally announced November 2009.

    Comments: 21 pages. Poster by T. Maurer and plenary talk by D. Pleiter presented at the "XXVII International Symposium on Lattice Field Theory", July 26-31 2009, Peking University, Beijing, China. Information on recent Green500 ranking added and list of authors extended

    Journal ref: PoS LAT2009:001,2009