Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Ohmer, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14649  [pdf, other

    cs.AI

    Emergent Language in Open-Ended Environments

    Authors: Cornelius Wolff, Julius Mayer, Elia Bruni, Xenia Ohmer

    Abstract: Emergent language research has made significant progress in recent years, but still largely fails to explore how communication emerges in more complex and situated multi-agent systems. Existing setups often employ a reference game, which limits the range of language emergence phenomena that can be studied, as the game consists of a single, purely language-based interaction between the agents. In t… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 10 pages, 4 figures, 4 tables, preprint

  2. arXiv:2404.12145  [pdf, other

    cs.CL cs.AI

    From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency

    Authors: Xenia Ohmer, Elia Bruni, Dieuwke Hupkes

    Abstract: The staggering pace with which the capabilities of large language models (LLMs) are increasing, as measured by a range of commonly used natural language understanding (NLU) benchmarks, raises many questions regarding what "understanding" means for a language model and how it compares to human understanding. This is especially true since many LLMs are exclusively trained on text, casting doubt on w… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  3. arXiv:2311.09048  [pdf, other

    cs.CL

    GRASP: A novel benchmark for evaluating language GRounding And Situated Physics understanding in multimodal language models

    Authors: Serwan Jassim, Mario Holubar, Annika Richter, Cornelius Wolff, Xenia Ohmer, Elia Bruni

    Abstract: This paper presents GRASP, a novel benchmark to evaluate the language grounding and physical understanding capabilities of video-based multimodal large language models (LLMs). This evaluation is accomplished via a two-tier approach leveraging Unity simulations. The first level tests for language grounding by assessing a model's ability to relate simple textual descriptions with visual information.… ▽ More

    Submitted 6 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  4. arXiv:2309.12263  [pdf, other

    cs.CL

    On the Relationship between Skill Neurons and Robustness in Prompt Tuning

    Authors: Leon Ackermann, Xenia Ohmer

    Abstract: Prompt Tuning is a popular parameter-efficient finetuning method for pre-trained large language models (PLMs). Based on experiments with RoBERTa, it has been suggested that Prompt Tuning activates specific neurons in the transformer's feed-forward networks, that are highly predictive and selective for the given task. In this paper, we study the robustness of Prompt Tuning in relation to these "ski… ▽ More

    Submitted 25 March, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

  5. arXiv:2305.11662  [pdf, other

    cs.CL cs.AI

    Separating form and meaning: Using self-consistency to quantify task understanding across multiple senses

    Authors: Xenia Ohmer, Elia Bruni, Dieuwke Hupkes

    Abstract: At the staggering pace with which the capabilities of large language models (LLMs) are increasing, creating future-proof evaluation sets to assess their understanding becomes more and more challenging. In this paper, we propose a novel paradigm for evaluating LLMs which leverages the idea that correct world understanding should be consistent across different (Fregean) senses of the same meaning. A… ▽ More

    Submitted 20 December, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

  6. arXiv:2203.13176  [pdf, other

    cs.AI cs.CL

    Emergence of hierarchical reference systems in multi-agent communication

    Authors: Xenia Ohmer, Marko Duda, Elia Bruni

    Abstract: In natural language, referencing objects at different levels of specificity is a fundamental pragmatic mechanism for efficient communication in context. We develop a novel communication game, the hierarchical reference game, to study the emergence of such reference systems in artificial agents. We consider a simplified world, in which concepts are abstractions over a set of primitive attributes (e… ▽ More

    Submitted 15 September, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

  7. Mutual influence between language and perception in multi-agent communication games

    Authors: Xenia Ohmer, Michael Marino, Michael Franke, Peter König

    Abstract: Language interfaces with many other cognitive domains. This paper explores how interactions at these interfaces can be studied with deep learning methods, focusing on the relation between language emergence and visual perception. To model the emergence of language, a sender and a receiver agent are trained on a reference game. The agents are implemented as deep neural networks, with dedicated visi… ▽ More

    Submitted 17 October, 2022; v1 submitted 29 December, 2021; originally announced December 2021.