Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Curry, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06908  [pdf, other

    cs.CL cs.CY

    Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models

    Authors: Flor Miriam Plaza-del-Arco, Amanda Cercas Curry, Susanna Paoli, Alba Curry, Dirk Hovy

    Abstract: Emotions play important epistemological and cognitive roles in our lives, revealing our values and guiding our actions. Previous work has shown that LLMs display biases in emotion attribution along gender lines. However, unlike gender, which says little about our values, religion, as a socio-cultural system, prescribes a set of beliefs and values for its followers. Religions, therefore, cultivate… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2406.08598  [pdf, other

    cs.CL cs.AI

    Language Model Council: Benchmarking Foundation Models on Highly Subjective Tasks by Consensus

    Authors: Justin Zhao, Flor Miriam Plaza-del-Arco, Amanda Cercas Curry

    Abstract: The rapid advancement of Large Language Models (LLMs) necessitates robust and challenging benchmarks. Leaderboards like Chatbot Arena rank LLMs based on how well their responses align with human preferences. However, many tasks such as those related to emotional intelligence, creative writing, or persuasiveness, are highly subjective and often lack majoritarian human agreement. Judges may have irr… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2403.04445  [pdf, other

    cs.CL

    Classist Tools: Social Class Correlates with Performance in NLP

    Authors: Amanda Cercas Curry, Giuseppe Attanasio, Zeerak Talat, Dirk Hovy

    Abstract: Since the foundational work of William Labov on the social stratification of language (Labov, 1964), linguistics has made concentrated efforts to explore the links between sociodemographic characteristics and language production and perception. But while there is strong evidence for socio-demographic characteristics in language, they are infrequently used in Natural Language Processing (NLP). Age… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  4. arXiv:2403.03874  [pdf, ps, other

    cs.CL cs.AI cs.CY

    Impoverished Language Technology: The Lack of (Social) Class in NLP

    Authors: Amanda Cercas Curry, Zeerak Talat, Dirk Hovy

    Abstract: Since Labov's (1964) foundational work on the social stratification of language, linguistics has dedicated concerted efforts towards understanding the relationships between socio-demographic factors and language production and perception. Despite the large body of evidence identifying significant relationships between socio-demographic factors and language production, relatively few of these facto… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

  5. arXiv:2403.03121  [pdf, other

    cs.CL

    Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution

    Authors: Flor Miriam Plaza-del-Arco, Amanda Cercas Curry, Alba Curry, Gavin Abercrombie, Dirk Hovy

    Abstract: Large language models (LLMs) reflect societal norms and biases, especially about gender. While societal biases and stereotypes have been extensively researched in various NLP applications, there is a surprising gap for emotion analysis. However, emotion and gender are closely linked in societal discourse. E.g., women are often thought of as more empathetic, while men's anger is more socially accep… ▽ More

    Submitted 28 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted to ACL 2024

  6. arXiv:2403.02268  [pdf, other

    cs.CL cs.AI cs.CY

    Subjective $\textit{Isms}$? On the Danger of Conflating Hate and Offence in Abusive Language Detection

    Authors: Amanda Cercas Curry, Gavin Abercrombie, Zeerak Talat

    Abstract: Natural language processing research has begun to embrace the notion of annotator subjectivity, motivated by variations in labelling. This approach understands each annotator's view as valid, which can be highly suitable for tasks that embed subjectivity, e.g., sentiment analysis. However, this construction may be inappropriate for tasks such as hate speech detection, as it affords equal validity… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  7. arXiv:2403.01222  [pdf, other

    cs.CL

    Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions

    Authors: Flor Miriam Plaza-del-Arco, Alba Curry, Amanda Cercas Curry, Dirk Hovy

    Abstract: Emotions are a central aspect of communication. Consequently, emotion analysis (EA) is a rapidly growing field in natural language processing (NLP). However, there is no consensus on scope, direction, or methods. In this paper, we conduct a thorough review of 154 relevant NLP publications from the last decade. Based on this review, we address four different questions: (1) How are EA tasks defined… ▽ More

    Submitted 18 March, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

  8. arXiv:2312.02065  [pdf, other

    cs.CL cs.AI

    Know Your Audience: Do LLMs Adapt to Different Age and Education Levels?

    Authors: Donya Rooein, Amanda Cercas Curry, Dirk Hovy

    Abstract: Large language models (LLMs) offer a range of new possibilities, including adapting the text to different audiences and their reading needs. But how well do they adapt? We evaluate the readability of answers generated by four state-of-the-art LLMs (commercial and open-source) to science questions when prompted to target different age groups and education levels. To assess the adaptability of LLMs… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  9. arXiv:2305.09800  [pdf, other

    cs.CL

    Mirages: On Anthropomorphism in Dialogue Systems

    Authors: Gavin Abercrombie, Amanda Cercas Curry, Tanvi Dinkar, Verena Rieser, Zeerak Talat

    Abstract: Automated dialogue or conversational systems are anthropomorphised by developers and personified by users. While a degree of anthropomorphism may be inevitable due to the choice of medium, conscious and unconscious design choices can guide users to personify such systems to varying degrees. Encouraging users to relate to automated systems as if they were human can lead to high risk scenarios cause… ▽ More

    Submitted 23 October, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted for publication at EMNLP. See ACL Anthology for published version

  10. arXiv:2212.10983  [pdf, other

    cs.CL cs.HC

    Computer says "No": The Case Against Empathetic Conversational AI

    Authors: Alba Curry, Amanda Cercas Curry

    Abstract: Emotions are an integral part of human cognition and they guide not only our understanding of the world but also our actions within it. As such, whether we soothe or flame an emotion is not inconsequential. Recent work in conversational AI has focused on responding empathetically to users, validating and soothing their emotions without a real basis. This AI-aided emotional regulation can have nega… ▽ More

    Submitted 6 July, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: Accepted to Findings of the ACL 2023

  11. arXiv:2109.09483  [pdf, other

    cs.CL cs.HC

    ConvAbuse: Data, Analysis, and Benchmarks for Nuanced Abuse Detection in Conversational AI

    Authors: Amanda Cercas Curry, Gavin Abercrombie, Verena Rieser

    Abstract: We present the first English corpus study on abusive language towards three conversational AI systems gathered "in the wild": an open-domain social bot, a rule-based chatbot, and a task-based system. To account for the complexity of the task, we take a more `nuanced' approach where our ConvAI dataset reflects fine-grained notions of abuse, as well as views from multiple expert annotators. We find… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: To be published in the 2021 Conference on Empirical Methods for Natural Language Processing (EMNLP2021)

  12. arXiv:2106.02578  [pdf, other

    cs.AI

    Alexa, Google, Siri: What are Your Pronouns? Gender and Anthropomorphism in the Design and Perception of Conversational Assistants

    Authors: Gavin Abercrombie, Amanda Cercas Curry, Mugdha Pandya, Verena Rieser

    Abstract: Technology companies have produced varied responses to concerns about the effects of the design of their conversational AI systems. Some have claimed that their voice assistants are in fact not gendered or human-like -- despite design features suggesting the contrary. We compare these claims to user perceptions by analysing the pronouns they use when referring to AI assistants. We also examine sys… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: To be presented at the 3rd Workshop on Gender Bias in Natural Language Processing (GeBNLP 2021)

  13. arXiv:1909.04387  [pdf, other

    cs.HC cs.CL

    A Crowd-based Evaluation of Abuse Response Strategies in Conversational Agents

    Authors: Amanda Cercas Curry, Verena Rieser

    Abstract: How should conversational agents respond to verbal abuse through the user? To answer this question, we conduct a large-scale crowd-sourced evaluation of abuse response strategies employed by current state-of-the-art systems. Our results show that some strategies, such as "polite refusal" score highly across the board, while for other strategies demographic factors, such as age, as well as the seve… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

  14. arXiv:1712.07558  [pdf, other

    cs.CL

    An Ensemble Model with Ranking for Social Dialogue

    Authors: Ioannis Papaioannou, Amanda Cercas Curry, Jose L. Part, Igor Shalyminov, Xinnuo Xu, Yanchao Yu, Ondřej Dušek, Verena Rieser, Oliver Lemon

    Abstract: Open-domain social dialogue is one of the long-standing goals of Artificial Intelligence. This year, the Amazon Alexa Prize challenge was announced for the first time, where real customers get to rate systems developed by leading universities worldwide. The aim of the challenge is to converse "coherently and engagingly with humans on popular topics for 20 minutes". We describe our Alexa Prize syst… ▽ More

    Submitted 20 December, 2017; originally announced December 2017.

    Comments: NIPS 2017 Workshop on Conversational AI

  15. A Review of Evaluation Techniques for Social Dialogue Systems

    Authors: Amanda Cercas Curry, Helen Hastie, Verena Rieser

    Abstract: In contrast with goal-oriented dialogue, social dialogue has no clear measure of task success. Consequently, evaluation of these systems is notoriously hard. In this paper, we review current evaluation methods, focusing on automatic metrics. We conclude that turn-based metrics often ignore the context and do not account for the fact that several replies are valid, while end-of-dialogue rewards are… ▽ More

    Submitted 13 September, 2017; originally announced September 2017.

    Comments: 2 pages

    MSC Class: 68T50

  16. Why We Need New Evaluation Metrics for NLG

    Authors: Jekaterina Novikova, Ondřej Dušek, Amanda Cercas Curry, Verena Rieser

    Abstract: The majority of NLG evaluation relies on automatic metrics, such as BLEU . In this paper, we motivate the need for novel, system- and data-independent automatic evaluation methods: We investigate a wide range of metrics, including state-of-the-art word-based and novel grammar-based ones, and demonstrate that they only weakly reflect human judgements of system outputs as generated by data-driven, e… ▽ More

    Submitted 21 July, 2017; originally announced July 2017.

    Comments: accepted to EMNLP 2017

    Journal ref: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 2231-2242, Copenhagen, Denmark, September 7-11, 2017