Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Choenni, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16482  [pdf, other

    cs.CL

    Self-Alignment: Improving Alignment of Cultural Values in LLMs via In-Context Learning

    Authors: Rochelle Choenni, Ekaterina Shutova

    Abstract: Improving the alignment of Large Language Models (LLMs) with respect to the cultural values that they encode has become an increasingly important topic. In this work, we study whether we can exploit existing knowledge about cultural values at inference time to adjust model responses to cultural value probes. We present a simple and inexpensive method that uses a combination of in-context learning… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  2. arXiv:2406.14267  [pdf, other

    cs.CL cs.AI

    On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?

    Authors: Rochelle Choenni, Sara Rajaee, Christof Monz, Ekaterina Shutova

    Abstract: While multilingual language models (MLMs) have been trained on 100+ languages, they are typically only evaluated across a handful of them due to a lack of available test data in most languages. This is particularly problematic when assessing MLM's potential for low-resource and unseen languages. In this paper, we present an analysis of existing evaluation frameworks in multilingual NLP, discuss th… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2405.12744  [pdf, other

    cs.CL

    The Echoes of Multilinguality: Tracing Cultural Value Shifts during LM Fine-tuning

    Authors: Rochelle Choenni, Anne Lauscher, Ekaterina Shutova

    Abstract: Texts written in different languages reflect different culturally-dependent beliefs of their writers. Thus, we expect multilingual LMs (MLMs), that are jointly trained on a concatenation of text in multiple languages, to encode different cultural values for each language. Yet, as the 'multilinguality' of these LMs is driven by cross-lingual sharing, we also have reason to belief that cultural valu… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  4. arXiv:2403.11810  [pdf, other

    cs.CL

    Metaphor Understanding Challenge Dataset for LLMs

    Authors: Xiaoyu Tong, Rochelle Choenni, Martha Lewis, Ekaterina Shutova

    Abstract: Metaphors in natural language are a reflection of fundamental cognitive processes such as analogical reasoning and categorisation, and are deeply rooted in everyday communication. Metaphor understanding is therefore an essential task for large language models (LLMs). We release the Metaphor Understanding Challenge Dataset (MUNCH), designed to evaluate the metaphor understanding capabilities of LLM… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  5. arXiv:2311.08273  [pdf, other

    cs.CL

    Examining Modularity in Multilingual LMs via Language-Specialized Subnetworks

    Authors: Rochelle Choenni, Ekaterina Shutova, Dan Garrette

    Abstract: Recent work has proposed explicitly inducing language-wise modularity in multilingual LMs via sparse fine-tuning (SFT) on per-language subnetworks as a means of better guiding cross-lingual sharing. In this work, we investigate (1) the degree to which language-wise modularity naturally arises within models with no special modularity interventions, and (2) how cross-lingual sharing and interference… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  6. arXiv:2310.20384  [pdf, other

    cs.CL cs.AI

    Do large language models solve verbal analogies like children do?

    Authors: Claire E. Stevenson, Mathilde ter Veen, Rochelle Choenni, Han L. J. van der Maas, Ekaterina Shutova

    Abstract: Analogy-making lies at the heart of human cognition. Adults solve analogies such as \textit{Horse belongs to stable like chicken belongs to ...?} by mapping relations (\textit{kept in}) and answering \textit{chicken coop}. In contrast, children often use association, e.g., answering \textit{egg}. This paper investigates whether large language models (LLMs) solve verbal analogies in A:B::C:? form u… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  7. arXiv:2310.18696  [pdf, other

    cs.CL cs.AI cs.LG

    Probing LLMs for Joint Encoding of Linguistic Categories

    Authors: Giulio Starace, Konstantinos Papakostas, Rochelle Choenni, Apostolos Panagiotopoulos, Matteo Rosati, Alina Leidinger, Ekaterina Shutova

    Abstract: Large Language Models (LLMs) exhibit impressive performance on a range of NLP tasks, due to the general-purpose linguistic knowledge acquired during pretraining. Existing model interpretability research (Tenney et al., 2019) suggests that a linguistic hierarchy emerges in the LLM layers, with lower layers better suited to solving syntactic tasks and higher layers employed for semantic processing.… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted in EMNLP Findings 2023

  8. arXiv:2305.13286  [pdf, other

    cs.CL

    How do languages influence each other? Studying cross-lingual data sharing during LM fine-tuning

    Authors: Rochelle Choenni, Dan Garrette, Ekaterina Shutova

    Abstract: Multilingual large language models (MLLMs) are jointly trained on data from many different languages such that representation of individual languages can benefit from other languages' data. Impressive performance on zero-shot cross-lingual transfer shows that these models are capable of exploiting data from other languages. Yet, it remains unclear to what extent, and under which conditions, langua… ▽ More

    Submitted 21 May, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

  9. arXiv:2211.00106  [pdf, other

    cs.CL

    Data-Efficient Cross-Lingual Transfer with Language-Specific Subnetworks

    Authors: Rochelle Choenni, Dan Garrette, Ekaterina Shutova

    Abstract: Large multilingual language models typically share their parameters across all languages, which enables cross-lingual task transfer, but learning can also be hindered when training updates from different languages are in conflict. In this paper, we propose novel methods for using language-specific subnetworks, which control cross-lingual parameter sharing, to reduce conflicts and increase positive… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

  10. arXiv:2109.10052  [pdf, other

    cs.CL

    Stepmothers are mean and academics are pretentious: What do pretrained language models learn about you?

    Authors: Rochelle Choenni, Ekaterina Shutova, Robert van Rooij

    Abstract: In this paper, we investigate what types of stereotypical information are captured by pretrained language models. We present the first dataset comprising stereotypical attributes of a range of social groups and propose a method to elicit stereotypes encoded by pretrained language models in an unsupervised fashion. Moreover, we link the emergent stereotypes to their manifestation as basic emotions… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

  11. arXiv:2010.12825  [pdf, other

    cs.CL

    Cross-neutralising: Probing for joint encoding of linguistic information in multilingual models

    Authors: Rochelle Choenni, Ekaterina Shutova

    Abstract: Multilingual sentence encoders are widely used to transfer NLP models across languages. The success of this transfer is, however, dependent on the model's ability to encode the patterns of cross-lingual similarity and variation. Yet, little is known as to how these models are able to do this. We propose a simple method to study how relationships between languages are encoded in two state-of-the-ar… ▽ More

    Submitted 13 March, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

  12. arXiv:2009.12862  [pdf, other

    cs.CL

    What does it mean to be language-agnostic? Probing multilingual sentence encoders for typological properties

    Authors: Rochelle Choenni, Ekaterina Shutova

    Abstract: Multilingual sentence encoders have seen much success in cross-lingual model transfer for downstream NLP tasks. Yet, we know relatively little about the properties of individual languages or the general patterns of linguistic variation that they encode. We propose methods for probing sentence representations from state-of-the-art multilingual encoders (LASER, M-BERT, XLM and XLM-R) with respect to… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

  13. arXiv:1906.01539  [pdf, other

    cs.AI cs.CL q-bio.NC

    Blackbox meets blackbox: Representational Similarity and Stability Analysis of Neural Language Models and Brains

    Authors: Samira Abnar, Lisa Beinborn, Rochelle Choenni, Willem Zuidema

    Abstract: In this paper, we define and apply representational stability analysis (ReStA), an intuitive way of analyzing neural language models. ReStA is a variant of the popular representational similarity analysis (RSA) in cognitive neuroscience. While RSA can be used to compare representations in models, model components, and human brains, ReStA compares instances of the same model, while systematically v… ▽ More

    Submitted 5 June, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Journal ref: 2nd BlackBoxNLP workshop @ACL2019

  14. arXiv:1904.10820  [pdf, other

    cs.CL cs.AI

    Semantic Drift in Multilingual Representations

    Authors: Lisa Beinborn, Rochelle Choenni

    Abstract: Multilingual representations have mostly been evaluated based on their performance on specific tasks. In this article, we look beyond engineering goals and analyze the relations between languages in computational representations. We introduce a methodology for comparing languages based on their organization of semantic concepts. We propose to conduct an adapted version of representational similari… ▽ More

    Submitted 16 November, 2020; v1 submitted 24 April, 2019; originally announced April 2019.

    Comments: Almost final version. Paper will appear in the Computational Linguistics Journal, Volume 46, Issue 3

  15. arXiv:1904.02547  [pdf, other

    cs.CL cs.AI cs.LG

    Robust Evaluation of Language-Brain Encoding Experiments

    Authors: Lisa Beinborn, Samira Abnar, Rochelle Choenni

    Abstract: Language-brain encoding experiments evaluate the ability of language models to predict brain responses elicited by language stimuli. The evaluation scenarios for this task have not yet been standardized which makes it difficult to compare and interpret results. We perform a series of evaluation experiments with a consistent encoding setup and compute the results for multiple fMRI datasets. In addi… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.