Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: Litschko, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.17125  [pdf, other

    cs.CL cs.LG

    Behavioral Testing: Can Large Language Models Implicitly Resolve Ambiguous Entities?

    Authors: Anastasiia Sedova, Robert Litschko, Diego Frassinelli, Benjamin Roth, Barbara Plank

    Abstract: One of the major aspects contributing to the striking performance of large language models (LLMs) is the vast amount of factual knowledge accumulated during pre-training. Yet, many LLMs suffer from self-inconsistency, which raises doubts about their trustworthiness and reliability. In this paper, we focus on entity type ambiguity and analyze current state-of-the-art LLMs for their proficiency and… ▽ More

    Submitted 25 July, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

  2. arXiv:2407.01137  [pdf, other

    cs.CL cs.AI

    An Empirical Comparison of Generative Approaches for Product Attribute-Value Identification

    Authors: Kassem Sabeh, Robert Litschko, Mouna Kacimi, Barbara Plank, Johann Gamper

    Abstract: Product attributes are crucial for e-commerce platforms, supporting applications like search, recommendation, and question answering. The task of Product Attribute and Value Identification (PAVI) involves identifying both attributes and their values from product information. In this paper, we formulate PAVI as a generation task and provide, to the best of our knowledge, the most comprehensive eval… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2406.17600  [pdf, other

    cs.CL

    "Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?

    Authors: Beiduo Chen, Xinpeng Wang, Siyao Peng, Robert Litschko, Anna Korhonen, Barbara Plank

    Abstract: Human label variation (HLV) is a valuable source of information that arises when multiple human annotators provide different labels for valid reasons. In Natural Language Inference (NLI) earlier approaches to capturing HLV involve either collecting annotations from many crowd workers to represent human judgment distribution (HJD) or use expert linguists to provide detailed explanations for their c… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 22 pages, 9 figures

  4. arXiv:2404.02570  [pdf, other

    cs.CL

    MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness

    Authors: Shijia Zhou, Huangyan Shan, Barbara Plank, Robert Litschko

    Abstract: This paper presents our system developed for the SemEval-2024 Task 1: Semantic Textual Relatedness (STR), on Track C: Cross-lingual. The task aims to detect semantic relatedness of two sentences in a given target language without access to direct supervision (i.e. zero-shot cross-lingual transfer). To this end, we focus on different source language selection strategies on two different pre-trained… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  5. arXiv:2311.02025  [pdf, other

    cs.CL

    Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive Language Detection

    Authors: Gretel Liz De la Peña Sarracén, Paolo Rosso, Robert Litschko, Goran Glavaš, Simone Paolo Ponzetto

    Abstract: Cross-lingual transfer learning from high-resource to medium and low-resource languages has shown encouraging results. However, the scarcity of resources in target languages remains a challenge. In this work, we resort to data augmentation and continual pre-training for domain adaptation to improve cross-lingual abusive language detection. For data augmentation, we analyze two existing techniques… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2023 (Main Conference)

  6. arXiv:2310.05442  [pdf, other

    cs.CL

    Establishing Trustworthiness: Rethinking Tasks and Model Evaluation

    Authors: Robert Litschko, Max Müller-Eberstein, Rob van der Goot, Leon Weber, Barbara Plank

    Abstract: Language understanding is a multi-faceted cognitive capability, which the Natural Language Processing (NLP) community has striven to model computationally for decades. Traditionally, facets of linguistic intelligence have been compartmentalized into tasks with specialized model architectures and corresponding evaluation protocols. With the advent of large language models (LLMs) the community has w… ▽ More

    Submitted 23 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 (Main Conference), camera-ready

  7. arXiv:2309.01669  [pdf, other

    cs.CL

    Donkii: Can Annotation Error Detection Methods Find Errors in Instruction-Tuning Datasets?

    Authors: Leon Weber-Genzel, Robert Litschko, Ekaterina Artemova, Barbara Plank

    Abstract: Instruction tuning has become an integral part of training pipelines for Large Language Models (LLMs) and has been shown to yield strong performance gains. In an orthogonal line of research, Annotation Error Detection (AED) has emerged as a tool for detecting quality problems in gold standard labels. So far, however, the application of AED methods has been limited to classification tasks. It is an… ▽ More

    Submitted 22 February, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: Camera ready version for LAW-XVIII

  8. arXiv:2305.07016  [pdf, other

    cs.CL

    A General-Purpose Multilingual Document Encoder

    Authors: Onur Galoğlu, Robert Litschko, Goran Glavaš

    Abstract: Massively multilingual pretrained transformers (MMTs) have tremendously pushed the state of the art on multilingual NLP and cross-lingual transfer of NLP models in particular. While a large body of work leveraged MMTs to mine parallel data and induce bilingual document embeddings, much less effort has been devoted to training general-purpose (massively) multilingual document encoder that can be us… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

  9. arXiv:2305.05295  [pdf, other

    cs.CL cs.IR

    Boosting Zero-shot Cross-lingual Retrieval by Training on Artificially Code-Switched Data

    Authors: Robert Litschko, Ekaterina Artemova, Barbara Plank

    Abstract: Transferring information retrieval (IR) models from a high-resource language (typically English) to other languages in a zero-shot fashion has become a widely adopted approach. In this work, we show that the effectiveness of zero-shot rankers diminishes when queries and documents are present in different languages. Motivated by this, we propose to train ranking models on artificially code-switched… ▽ More

    Submitted 26 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of ACL 2023

  10. arXiv:2205.14981  [pdf, other

    cs.CL

    ZusammenQA: Data Augmentation with Specialized Models for Cross-lingual Open-retrieval Question Answering System

    Authors: Chia-Chien Hung, Tommaso Green, Robert Litschko, Tornike Tsereteli, Sotaro Takeshita, Marco Bombieri, Goran Glavaš, Simone Paolo Ponzetto

    Abstract: This paper introduces our proposed system for the MIA Shared Task on Cross-lingual Open-retrieval Question Answering (COQA). In this challenging scenario, given an input question the system has to gather evidence documents from a multilingual pool and generate from them an answer in the language of the question. We devised several approaches combining different model variants for three main compon… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

  11. arXiv:2204.02292  [pdf, other

    cs.CL cs.IR

    Parameter-Efficient Neural Reranking for Cross-Lingual and Multilingual Retrieval

    Authors: Robert Litschko, Ivan Vulić, Goran Glavaš

    Abstract: State-of-the-art neural (re)rankers are notoriously data-hungry which -- given the lack of large-scale training data in languages other than English -- makes them rarely used in multilingual and cross-lingual retrieval settings. Current approaches therefore commonly transfer rankers trained on English data to other languages and cross-lingual setups by means of multilingual encoders: they fine-tun… ▽ More

    Submitted 16 September, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: COLING 2022

    ACM Class: H.3.3; I.2.7

  12. arXiv:2112.11031  [pdf, other

    cs.CL cs.IR

    On Cross-Lingual Retrieval with Multilingual Text Encoders

    Authors: Robert Litschko, Ivan Vulić, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: In this work we present a systematic empirical study focused on the suitability of the state-of-the-art multilingual encoders for cross-lingual document and sentence retrieval tasks across a number of diverse language pairs. We first treat these models as multilingual text encoders and benchmark their performance in unsupervised ad-hoc sentence- and document-level CLIR. In contrast to supervised l… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: to appear in IRJ ECIR 2021 Special Issue. arXiv admin note: substantial text overlap with arXiv:2101.08370

    ACM Class: H.3.3; I.2.7

  13. arXiv:2101.08370  [pdf, other

    cs.CL cs.IR

    Evaluating Multilingual Text Encoders for Unsupervised Cross-Lingual Retrieval

    Authors: Robert Litschko, Ivan Vulić, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Pretrained multilingual text encoders based on neural Transformer architectures, such as multilingual BERT (mBERT) and XLM, have achieved strong performance on a myriad of language understanding tasks. Consequently, they have been adopted as a go-to paradigm for multilingual and cross-lingual representation learning and transfer, rendering cross-lingual word embeddings (CLWEs) effectively obsolete… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

    Comments: accepted at ECIR'21 (preprint)

    ACM Class: H.3.3; I.2.7

  14. arXiv:2010.05731  [pdf, other

    cs.CL

    Probing Pretrained Language Models for Lexical Semantics

    Authors: Ivan Vulić, Edoardo Maria Ponti, Robert Litschko, Goran Glavaš, Anna Korhonen

    Abstract: The success of large pretrained language models (LMs) such as BERT and RoBERTa has sparked interest in probing their representations, in order to unveil what types of knowledge they implicitly capture. While prior research focused on morphosyntactic, semantic, and world knowledge, it remains unclear to which extent LMs also derive lexical type-level knowledge from words in context. In this work, w… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020: Long paper

  15. arXiv:2004.07642  [pdf, other

    cs.CL

    Towards Instance-Level Parser Selection for Cross-Lingual Transfer of Dependency Parsers

    Authors: Robert Litschko, Ivan Vulić, Željko Agić, Goran Glavaš

    Abstract: Current methods of cross-lingual parser transfer focus on predicting the best parser for a low-resource target language globally, that is, "at treebank level". In this work, we propose and argue for a novel cross-lingual transfer paradigm: instance-level parser selection (ILPS), and present a proof-of-concept study focused on instance-level selection in the framework of delexicalized parser transf… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

  16. arXiv:1902.00508  [pdf, other

    cs.CL

    How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions

    Authors: Goran Glavas, Robert Litschko, Sebastian Ruder, Ivan Vulic

    Abstract: Cross-lingual word embeddings (CLEs) enable multilingual modeling of meaning and facilitate cross-lingual transfer of NLP models. Despite their ubiquitous usage in downstream tasks, recent increasingly popular projection-based CLE models are almost exclusively evaluated on a single task only: bilingual lexicon induction (BLI). Even BLI evaluations vary greatly, hindering our ability to correctly i… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

    Journal ref: ACL 2019

  17. arXiv:1805.00879  [pdf, ps, other

    cs.CL

    Unsupervised Cross-Lingual Information Retrieval using Monolingual Data Only

    Authors: Robert Litschko, Goran Glavaš, Simone Paolo Ponzetto, Ivan Vulić

    Abstract: We propose a fully unsupervised framework for ad-hoc cross-lingual information retrieval (CLIR) which requires no bilingual data at all. The framework leverages shared cross-lingual word embedding spaces in which terms, queries, and documents can be represented, irrespective of their actual language. The shared embedding spaces are induced solely on the basis of monolingual corpora in two language… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

    Comments: accepted at SIGIR'18 (preprint)