Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Abercrombie, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01294  [pdf, other

    cs.LG cs.AI cs.CY

    A Collaborative, Human-Centred Taxonomy of AI, Algorithmic, and Automation Harms

    Authors: Gavin Abercrombie, Djalel Benbouzid, Paolo Giudici, Delaram Golpayegani, Julio Hernandez, Pierre Noro, Harshvardhan Pandit, Eva Paraschou, Charlie Pownall, Jyoti Prajapati, Mark A. Sayre, Ushnish Sengupta, Arthit Suriyawongkul, Ruby Thelot, Sofia Vei, Laura Waltersdorfer

    Abstract: This paper introduces a collaborative, human-centered taxonomy of AI, algorithmic and automation harms. We argue that existing taxonomies, while valuable, can be narrow, unclear, typically cater to practitioners and government, and often overlook the needs of the wider public. Drawing on existing taxonomies and a large repository of documented incidents, we propose a taxonomy that is clear and und… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2403.20103  [pdf, other

    cs.CL

    NLP for Counterspeech against Hate: A Survey and How-To Guide

    Authors: Helena Bonaldi, Yi-Ling Chung, Gavin Abercrombie, Marco Guerini

    Abstract: In recent years, counterspeech has emerged as one of the most promising strategies to fight online hate. These non-escalatory responses tackle online abuse while preserving the freedom of speech of the users, and can have a tangible impact in reducing online and offline violence. Recently, there has been growing interest from the Natural Language Processing (NLP) community in addressing the challe… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: To appear in Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (findings)

  3. arXiv:2403.03121  [pdf, other

    cs.CL

    Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution

    Authors: Flor Miriam Plaza-del-Arco, Amanda Cercas Curry, Alba Curry, Gavin Abercrombie, Dirk Hovy

    Abstract: Large language models (LLMs) reflect societal norms and biases, especially about gender. While societal biases and stereotypes have been extensively researched in various NLP applications, there is a surprising gap for emotion analysis. However, emotion and gender are closely linked in societal discourse. E.g., women are often thought of as more empathetic, while men's anger is more socially accep… ▽ More

    Submitted 28 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted to ACL 2024

  4. arXiv:2403.02268  [pdf, other

    cs.CL cs.AI cs.CY

    Subjective $\textit{Isms}$? On the Danger of Conflating Hate and Offence in Abusive Language Detection

    Authors: Amanda Cercas Curry, Gavin Abercrombie, Zeerak Talat

    Abstract: Natural language processing research has begun to embrace the notion of annotator subjectivity, motivated by variations in labelling. This approach understands each annotator's view as valid, which can be highly suitable for tasks that embed subjectivity, e.g., sentiment analysis. However, this construction may be inappropriate for tasks such as hate speech detection, as it affords equal validity… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  5. arXiv:2307.04761  [pdf, other

    cs.CL cs.AI cs.CY

    Understanding Counterspeech for Online Harm Mitigation

    Authors: Yi-Ling Chung, Gavin Abercrombie, Florence Enock, Jonathan Bright, Verena Rieser

    Abstract: Counterspeech offers direct rebuttals to hateful speech by challenging perpetrators of hate and showing support to targets of abuse. It provides a promising alternative to more contentious measures, such as content moderation and deplatforming, by contributing a greater amount of positive online speech rather than attempting to mitigate harmful content through removal. Advances in the development… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: 21 pages, 2 figures, 2 tables

  6. arXiv:2305.09800  [pdf, other

    cs.CL

    Mirages: On Anthropomorphism in Dialogue Systems

    Authors: Gavin Abercrombie, Amanda Cercas Curry, Tanvi Dinkar, Verena Rieser, Zeerak Talat

    Abstract: Automated dialogue or conversational systems are anthropomorphised by developers and personified by users. While a degree of anthropomorphism may be inevitable due to the choice of medium, conscious and unconscious design choices can guide users to personify such systems to varying degrees. Encouraging users to relate to automated systems as if they were human can lead to high risk scenarios cause… ▽ More

    Submitted 23 October, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted for publication at EMNLP. See ACL Anthology for published version

  7. arXiv:2305.09281  [pdf, other

    cs.CL

    On the Origins of Bias in NLP through the Lens of the Jim Code

    Authors: Fatma Elsafoury, Gavin Abercrombie

    Abstract: In this paper, we trace the biases in current natural language processing (NLP) models back to their origins in racism, sexism, and homophobia over the last 500 years. We review literature from critical race theory, gender studies, data ethics, and digital humanities studies, and summarize the origins of bias in NLP models from these social science perspective. We show how the causes of the biases… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 10 pages

  8. arXiv:2305.06074  [pdf, other

    cs.CL cs.LG

    iLab at SemEval-2023 Task 11 Le-Wi-Di: Modelling Disagreement or Modelling Perspectives?

    Authors: Nikolas Vitsakis, Amit Parekh, Tanvi Dinkar, Gavin Abercrombie, Ioannis Konstas, Verena Rieser

    Abstract: There are two competing approaches for modelling annotator disagreement: distributional soft-labelling approaches (which aim to capture the level of disagreement) or modelling perspectives of individual annotators or groups thereof. We adapt a multi-task architecture -- which has previously shown success in modelling perspectives -- to evaluate its performance on the SEMEVAL Task 11. We do so by c… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: To appear in the Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023). Association for Computational Linguistics, 2023

  9. arXiv:2305.01633  [pdf, other

    cs.CL

    Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

    Authors: Anya Belz, Craig Thomson, Ehud Reiter, Gavin Abercrombie, Jose M. Alonso-Moral, Mohammad Arvan, Anouck Braggaar, Mark Cieliebak, Elizabeth Clark, Kees van Deemter, Tanvi Dinkar, Ondřej Dušek, Steffen Eger, Qixiang Fang, Mingqi Gao, Albert Gatt, Dimitra Gkatzia, Javier González-Corbelle, Dirk Hovy, Manuela Hürlimann, Takumi Ito, John D. Kelleher, Filip Klubicka, Emiel Krahmer, Huiyuan Lai , et al. (17 additional authors not shown)

    Abstract: We report our efforts in identifying a set of previous human evaluations in NLP that would be suitable for a coordinated study examining what makes human evaluations in NLP more/less reproducible. We present our results and findings, which include that just 13\% of papers had (i) sufficiently low barriers to reproduction, and (ii) enough obtainable information, to be considered for reproduction, a… ▽ More

    Submitted 7 August, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 5 pages plus appendix, 4 tables, 1 figure. To appear at "Workshop on Insights from Negative Results in NLP" (co-located with EACL2023). Updated author list and acknowledgements

    MSC Class: 68 ACM Class: I.2.7

  10. arXiv:2304.14803  [pdf

    cs.CL

    SemEval-2023 Task 11: Learning With Disagreements (LeWiDi)

    Authors: Elisa Leonardelli, Alexandra Uma, Gavin Abercrombie, Dina Almanea, Valerio Basile, Tommaso Fornaciari, Barbara Plank, Verena Rieser, Massimo Poesio

    Abstract: NLP datasets annotated with human judgments are rife with disagreements between the judges. This is especially true for tasks depending on subjective judgments such as sentiment analysis or offensive language detection. Particularly in these latter cases, the NLP community has come to realize that the approach of 'reconciling' these different subjective interpretations is inappropriate. Many NLP r… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

  11. arXiv:2301.10684  [pdf, other

    cs.CL

    Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement

    Authors: Gavin Abercrombie, Verena Rieser, Dirk Hovy

    Abstract: We commonly use agreement measures to assess the utility of judgements made by human annotators in Natural Language Processing (NLP) tasks. While inter-annotator agreement is frequently used as an indication of label reliability by measuring consistency between annotators, we argue for the additional use of intra-annotator agreement to measure label stability over time. However, in a systematic re… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

  12. arXiv:2210.00572  [pdf, other

    cs.CL

    Risk-graded Safety for Handling Medical Queries in Conversational AI

    Authors: Gavin Abercrombie, Verena Rieser

    Abstract: Conversational AI systems can engage in unsafe behaviour when handling users' medical queries that can have severe consequences and could even lead to deaths. Systems therefore need to be capable of both recognising the seriousness of medical inputs and producing responses with appropriate levels of risk. We create a corpus of human written English language medical queries and the responses of dif… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

    Comments: Accepted for publication at AACL 2022

  13. arXiv:2109.09483  [pdf, other

    cs.CL cs.HC

    ConvAbuse: Data, Analysis, and Benchmarks for Nuanced Abuse Detection in Conversational AI

    Authors: Amanda Cercas Curry, Gavin Abercrombie, Verena Rieser

    Abstract: We present the first English corpus study on abusive language towards three conversational AI systems gathered "in the wild": an open-domain social bot, a rule-based chatbot, and a task-based system. To account for the complexity of the task, we take a more `nuanced' approach where our ConvAI dataset reflects fine-grained notions of abuse, as well as views from multiple expert annotators. We find… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: To be published in the 2021 Conference on Empirical Methods for Natural Language Processing (EMNLP2021)

  14. arXiv:2107.03451  [pdf, other

    cs.CL cs.AI

    Anticipating Safety Issues in E2E Conversational AI: Framework and Tooling

    Authors: Emily Dinan, Gavin Abercrombie, A. Stevie Bergman, Shannon Spruit, Dirk Hovy, Y-Lan Boureau, Verena Rieser

    Abstract: Over the last several years, end-to-end neural conversational agents have vastly improved in their ability to carry a chit-chat conversation with humans. However, these models are often trained on large datasets from the internet, and as a result, may learn undesirable behaviors from this data, such as toxic or otherwise harmful language. Researchers must thus wrestle with the issue of how and whe… ▽ More

    Submitted 23 July, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

  15. arXiv:2106.02578  [pdf, other

    cs.AI

    Alexa, Google, Siri: What are Your Pronouns? Gender and Anthropomorphism in the Design and Perception of Conversational Assistants

    Authors: Gavin Abercrombie, Amanda Cercas Curry, Mugdha Pandya, Verena Rieser

    Abstract: Technology companies have produced varied responses to concerns about the effects of the design of their conversational AI systems. Some have claimed that their voice assistants are in fact not gendered or human-like -- despite design features suggesting the contrary. We compare these claims to user perceptions by analysing the pronouns they use when referring to AI assistants. We also examine sys… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: To be presented at the 3rd Workshop on Gender Bias in Natural Language Processing (GeBNLP 2021)

  16. Sentiment and position-taking analysis of parliamentary debates: A systematic literature review

    Authors: Gavin Abercrombie, Riza Batista-Navarro

    Abstract: Parliamentary and legislative debate transcripts provide access to information concerning the opinions, positions and policy preferences of elected politicians. They attract attention from researchers from a wide variety of backgrounds, from political and social sciences to computer science. As a result, the problem of automatic sentiment and position-taking analysis has been tackled from differen… ▽ More

    Submitted 16 January, 2020; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: Journal of Computational Social Science (2020)