Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Goldfarb-Tarrant, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18682  [pdf, other

    cs.CL cs.AI cs.LG

    The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm

    Authors: Aakanksha, Arash Ahmadian, Beyza Ermis, Seraphina Goldfarb-Tarrant, Julia Kreutzer, Marzieh Fadaee, Sara Hooker

    Abstract: A key concern with the concept of "alignment" is the implicit question of "alignment to what?". AI systems are increasingly used across the world, yet safety alignment is often focused on homogeneous monolingual settings. Additionally, preference training and safety measures often overfit to harms common in Western-centric datasets. Here, we explore the viability of different alignment approaches… ▽ More

    Submitted 8 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.15352  [pdf, other

    cs.CL

    A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick

    Authors: Nishant Balepur, Matthew Shu, Alexander Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, Jordan Boyd-Graber

    Abstract: Keyword mnemonics are memorable explanations that link new terms to simpler keywords. Prior works generate mnemonics for students, but they do not guide models toward mnemonics students prefer and aid learning. We build SMART, a mnemonic generator trained on feedback from real students learning new terms. To train SMART, we first fine-tune LLaMA-2 on a curated set of user-written mnemonics. We the… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: In-Progress Preprint

  3. arXiv:2402.15925  [pdf, other

    cs.CL cs.AI cs.IR

    MultiContrievers: Analysis of Dense Retrieval Representations

    Authors: Seraphina Goldfarb-Tarrant, Pedro Rodriguez, Jane Dwivedi-Yu, Patrick Lewis

    Abstract: Dense retrievers compress source documents into (possibly lossy) vector representations, yet there is little analysis of what information is lost versus preserved, and how it affects downstream tasks. We conduct the first analysis of the information captured by dense retrievers compared to the language models they are based on (e.g., BERT versus Contriever). We use 25 MultiBert checkpoints as rand… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  4. arXiv:2305.12757  [pdf, other

    cs.CL

    This Prompt is Measuring <MASK>: Evaluating Bias Evaluation in Language Models

    Authors: Seraphina Goldfarb-Tarrant, Eddie Ungless, Esma Balkir, Su Lin Blodgett

    Abstract: Bias research in NLP seeks to analyse models for social biases, thus helping NLP practitioners uncover, measure, and mitigate social harms. We analyse the body of work that uses prompts and templates to assess bias in language models. We draw on a measurement modelling framework to create a taxonomy of attributes that capture what a bias test aims to measure and how that measurement is carried out… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL Findings 2023

  5. arXiv:2305.12709  [pdf, other

    cs.CL

    Cross-lingual Transfer Can Worsen Bias in Sentiment Analysis

    Authors: Seraphina Goldfarb-Tarrant, Björn Ross, Adam Lopez

    Abstract: Sentiment analysis (SA) systems are widely deployed in many of the world's languages, and there is well-documented evidence of demographic bias in these systems. In languages beyond English, scarcer training data is often supplemented with transfer learning using pre-trained models, including multilingual models trained on other languages. In some cases, even supervision data comes from other lang… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 8 pages, preprint

  6. arXiv:2305.11673  [pdf, other

    cs.CL

    Bias Beyond English: Counterfactual Tests for Bias in Sentiment Analysis in Four Languages

    Authors: Seraphina Goldfarb-Tarrant, Adam Lopez, Roi Blanco, Diego Marcheggiani

    Abstract: Sentiment analysis (SA) systems are used in many products and hundreds of languages. Gender and racial biases are well-studied in English SA systems, but understudied in other languages, with few resources for such studies. To remedy this, we build a counterfactual evaluation corpus for gender and racial/migrant bias in four languages. We demonstrate its usefulness by answering a simple but import… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 5 pages, accepted to Findings of ACL 2023

  7. arXiv:2204.06827  [pdf, other

    cs.CL

    How Gender Debiasing Affects Internal Model Representations, and Why It Matters

    Authors: Hadas Orgad, Seraphina Goldfarb-Tarrant, Yonatan Belinkov

    Abstract: Common studies of gender bias in NLP focus either on extrinsic bias measured by model performance on a downstream task or on intrinsic bias found in models' internal representations. However, the relationship between extrinsic and intrinsic bias is relatively unknown. In this work, we illuminate this relationship by measuring both quantities together: we debias a model during downstream fine-tunin… ▽ More

    Submitted 16 May, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted to NAACL 2022

    MSC Class: 68T50 ACM Class: I.2.7

  8. arXiv:2012.15859  [pdf, other

    cs.CL

    Intrinsic Bias Metrics Do Not Correlate with Application Bias

    Authors: Seraphina Goldfarb-Tarrant, Rebecca Marchant, Ricardo Muñoz Sanchez, Mugdha Pandya, Adam Lopez

    Abstract: Natural Language Processing (NLP) systems learn harmful societal biases that cause them to amplify inequality as they are deployed in more and more situations. To guide efforts at debiasing these systems, the NLP community relies on a variety of metrics that quantify bias in models. Some of these metrics are intrinsic, measuring bias in word embedding spaces, and some are extrinsic, measuring bias… ▽ More

    Submitted 8 June, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: In Proceedings of ACL 2021, 9 pages

  9. arXiv:2010.04665  [pdf, other

    cs.CL cs.IR

    Scaling Systematic Literature Reviews with Machine Learning Pipelines

    Authors: Seraphina Goldfarb-Tarrant, Alexander Robertson, Jasmina Lazic, Theodora Tsouloufi, Louise Donnison, Karen Smyth

    Abstract: Systematic reviews, which entail the extraction of data from large numbers of scientific documents, are an ideal avenue for the application of machine learning. They are vital to many fields of science and philanthropy, but are very time-consuming and require experts. Yet the three main stages of a systematic review are easily done automatically: searching for documents can be done via APIs and sc… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: In EMNLP 2020 Scholarly Document Processing Workshop

  10. arXiv:2009.09870  [pdf, other

    cs.CL cs.AI

    Content Planning for Neural Story Generation with Aristotelian Rescoring

    Authors: Seraphina Goldfarb-Tarrant, Tuhin Chakrabarty, Ralph Weischedel, Nanyun Peng

    Abstract: Long-form narrative text generated from large language models manages a fluent impersonation of human writing, but only at the local sentence level, and lacks structure or global cohesion. We posit that many of the problems of story generation can be addressed via high-quality content planning, and present a system that focuses on how to learn good plot structures to guide story generation. We uti… ▽ More

    Submitted 9 October, 2020; v1 submitted 21 September, 2020; originally announced September 2020.

    Comments: EMNLP 2020, 9 pages

  11. arXiv:1904.02357  [pdf, other

    cs.CL

    Plan, Write, and Revise: an Interactive System for Open-Domain Story Generation

    Authors: Seraphina Goldfarb-Tarrant, Haining Feng, Nanyun Peng

    Abstract: Story composition is a challenging problem for machines and even for humans. We present a neural narrative generation system that interacts with humans to generate stories. Our system has different levels of human interaction, which enables us to understand at what stage of story-writing human collaboration is most productive, both to improving story quality and human engagement in the writing pro… ▽ More

    Submitted 31 May, 2019; v1 submitted 4 April, 2019; originally announced April 2019.

    Comments: Accepted to NAACL 2019 Demo Track, 5 pages