Skip to main content

Showing 1–50 of 115 results for author: Tsvetkov, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12043  [pdf, other

    cs.CL cs.AI cs.HC

    The Art of Saying No: Contextual Noncompliance in Language Models

    Authors: Faeze Brahman, Sachin Kumar, Vidhisha Balachandran, Pradeep Dasigi, Valentina Pyatkin, Abhilasha Ravichander, Sarah Wiegreffe, Nouha Dziri, Khyathi Chandu, Jack Hessel, Yulia Tsvetkov, Noah A. Smith, Yejin Choi, Hannaneh Hajishirzi

    Abstract: Chat-based language models are designed to be helpful, yet they should not comply with every user request. While most existing work primarily focuses on refusal of "unsafe" queries, we posit that the scope of noncompliance should be broadened. We introduce a comprehensive taxonomy of contextual noncompliance describing when and how models should not comply with user requests. Our taxonomy spans a… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2407.08818  [pdf

    cs.CL

    MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization

    Authors: Orevaoghene Ahia, Sachin Kumar, Hila Gonen, Valentin Hoffman, Tomasz Limisiewicz, Yulia Tsvetkov, Noah A. Smith

    Abstract: In multilingual settings, non-Latin scripts and low-resource languages are usually disadvantaged in terms of language models' utility, efficiency, and cost. Specifically, previous studies have reported multiple modeling biases that the current tokenization algorithms introduce to non-Latin script languages, the main one being over-segmentation. In this work, we propose MAGNET; multilingual adaptiv… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  3. arXiv:2407.02472  [pdf, other

    cs.CL

    ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions

    Authors: Chan Young Park, Shuyue Stella Li, Hayoung Jung, Svitlana Volkova, Tanushree Mitra, David Jurgens, Yulia Tsvetkov

    Abstract: This study introduces ValueScope, a framework leveraging language models to quantify social norms and values within online communities, grounded in social science perspectives on normative structures. We employ ValueScope to dissect and analyze linguistic and stylistic expressions across 13 Reddit communities categorized under gender, politics, science, and finance. Our analysis provides a quantit… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: First three authors contributed equally. 33 pages. In submission

  4. arXiv:2406.19564  [pdf, other

    cs.CL

    Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects

    Authors: Orevaoghene Ahia, Anuoluwapo Aremu, Diana Abagyan, Hila Gonen, David Ifeoluwa Adelani, Daud Abolade, Noah A. Smith, Yulia Tsvetkov

    Abstract: Yorùbá an African language with roughly 47 million speakers encompasses a continuum with several dialects. Recent efforts to develop NLP technologies for African languages have focused on their standard dialects, resulting in disparities for dialects and varieties for which there are little to no resources or tools. We take steps towards bridging this gap by introducing a new high-quality parallel… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  5. arXiv:2406.15992  [pdf, other

    cs.CL

    Can LLM Graph Reasoning Generalize beyond Pattern Memorization?

    Authors: Yizhuo Zhang, Heng Wang, Shangbin Feng, Zhaoxuan Tan, Xiaochuang Han, Tianxing He, Yulia Tsvetkov

    Abstract: Large language models (LLMs) demonstrate great potential for problems with implicit graphical structures, while recent works seek to enhance the graph reasoning capabilities of LLMs through specialized instruction tuning. The resulting 'graph LLMs' are evaluated with in-distribution settings only, thus it remains underexplored whether LLMs are learning generalizable graph reasoning skills or merel… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 16 pages, 6 figures, Code and data will be publicly available at https://github.com/MatthewYZhang/NLGift

    ACM Class: I.2.7

  6. arXiv:2406.15951  [pdf, other

    cs.CL

    Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration

    Authors: Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Yejin Choi, Yulia Tsvetkov

    Abstract: While existing alignment paradigms have been integral in developing large language models (LLMs), LLMs often learn an averaged human preference and struggle to model diverse preferences across cultures, demographics, and communities. We propose Modular Pluralism, a modular framework based on multi-LLM collaboration for pluralistic alignment: it "plugs into" a base LLM a pool of smaller but special… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  7. arXiv:2406.15948  [pdf, other

    cs.CL

    Teaching LLMs to Abstain across Languages via Multilingual Feedback

    Authors: Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Orevaoghene Ahia, Shuyue Stella Li, Vidhisha Balachandran, Sunayana Sitaram, Yulia Tsvetkov

    Abstract: Multilingual LLMs often have knowledge disparities across languages, with larger gaps in under-resourced languages. Teaching LLMs to abstain in the face of knowledge gaps is thus a promising strategy to mitigate hallucinations in multilingual settings. However, previous studies on LLM abstention primarily focus on English; we find that directly applying existing solutions beyond English results in… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  8. arXiv:2406.00922  [pdf, other

    cs.CL cs.AI

    MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning

    Authors: Shuyue Stella Li, Vidhisha Balachandran, Shangbin Feng, Jonathan Ilgen, Emma Pierson, Pang Wei Koh, Yulia Tsvetkov

    Abstract: In high-stakes domains like clinical reasoning, AI assistants powered by large language models (LLMs) are yet to be reliable and safe. We identify a key obstacle towards reliability: existing LLMs are trained to answer any question, even with incomplete context in the prompt or insufficient parametric knowledge. We propose to change this paradigm to develop more careful LLMs that ask follow-up que… ▽ More

    Submitted 4 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Comments: 29 pages, 12 figures

  9. arXiv:2404.16367  [pdf, other

    cs.CL cs.LG

    Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize Hierarchically

    Authors: Kabir Ahuja, Vidhisha Balachandran, Madhur Panwar, Tianxing He, Noah A. Smith, Navin Goyal, Yulia Tsvetkov

    Abstract: Transformers trained on natural language data have been shown to learn its hierarchical structure and generalize to sentences with unseen syntactic structures without explicitly encoding any structural bias. In this work, we investigate sources of inductive bias in transformer models and their training that could cause such generalization behavior to emerge. We extensively experiment with transfor… ▽ More

    Submitted 31 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Code now available: https://github.com/kabirahuja2431/transformers-hg

  10. arXiv:2404.06664  [pdf, other

    cs.CL cs.AI cs.HC

    CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge

    Authors: Yu Ying Chiu, Liwei Jiang, Maria Antoniak, Chan Young Park, Shuyue Stella Li, Mehar Bhatia, Sahithya Ravi, Yulia Tsvetkov, Vered Shwartz, Yejin Choi

    Abstract: Frontier large language models (LLMs) are developed by researchers and practitioners with skewed cultural backgrounds and on datasets with skewed sources. However, LLMs' (lack of) multicultural knowledge cannot be effectively assessed with current methods for developing benchmarks. Existing multicultural evaluations primarily rely on expensive and restricted human annotations or potentially outdat… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Preprint (under review)

  11. arXiv:2403.11009  [pdf, other

    cs.CL cs.AI

    DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

    Authors: Fahim Faisal, Orevaoghene Ahia, Aarohi Srivastava, Kabir Ahuja, David Chiang, Yulia Tsvetkov, Antonios Anastasopoulos

    Abstract: Language technologies should be judged on their usefulness in real-world use cases. An often overlooked aspect in natural language processing (NLP) research and evaluation is language variation in the form of non-standard dialects or language varieties (hereafter, varieties). Most NLP benchmarks are limited to standard language varieties. To fill this gap, we propose DIALECTBENCH, the first-ever l… ▽ More

    Submitted 7 July, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: Equal contribution: Fahim Faisal, Orevaoghene Ahia

  12. arXiv:2403.04801  [pdf, other

    cs.CL

    Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs

    Authors: Aly M. Kassem, Omar Mahmoud, Niloofar Mireshghallah, Hyunwoo Kim, Yulia Tsvetkov, Yejin Choi, Sherif Saad, Santu Rana

    Abstract: In this paper, we introduce a black-box prompt optimization method that uses an attacker LLM agent to uncover higher levels of memorization in a victim agent, compared to what is revealed by prompting the target model with the training data directly, which is the dominant approach of quantifying memorization in LLMs. We use an iterative rejection-sampling optimization process to find instruction-b… ▽ More

    Submitted 31 March, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  13. arXiv:2402.17914  [pdf, other

    cs.CL cs.AI

    Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers

    Authors: Roy Xie, Orevaoghene Ahia, Yulia Tsvetkov, Antonios Anastasopoulos

    Abstract: Identifying linguistic differences between dialects of a language often requires expert knowledge and meticulous human analysis. This is largely due to the complexity and nuance involved in studying various dialects. We present a novel approach to extract distinguishing lexical features of dialects by utilizing interpretable dialect classifiers, even in the absence of human experts. We explore bot… ▽ More

    Submitted 23 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Code is available at https://github.com/ruoyuxie/interpretable_dialect_classifier

  14. arXiv:2402.11638  [pdf, other

    cs.CL

    Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks

    Authors: Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, Yulia Tsvetkov, Tianxing He

    Abstract: The widespread use of large language models (LLMs) is increasing the demand for methods that detect machine-generated text to prevent misuse. The goal of our study is to stress test the detectors' robustness to malicious attacks under realistic scenarios. We comprehensively study the robustness of popular machine-generated text detectors under attacks from diverse categories: editing, paraphrasing… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  15. arXiv:2402.10426  [pdf, other

    cs.CL

    DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection

    Authors: Herun Wan, Shangbin Feng, Zhaoxuan Tan, Heng Wang, Yulia Tsvetkov, Minnan Luo

    Abstract: Large language models are limited by challenges in factuality and hallucinations to be directly employed off-the-shelf for judging the veracity of news articles, where factual accuracy is paramount. In this work, we propose DELL that identifies three key stages in misinformation detection where LLMs could be incorporated as part of the pipeline: 1) LLMs could \emph{generate news reactions} to repr… ▽ More

    Submitted 4 July, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  16. arXiv:2402.07841  [pdf, other

    cs.CL

    Do Membership Inference Attacks Work on Large Language Models?

    Authors: Michael Duan, Anshuman Suri, Niloofar Mireshghallah, Sewon Min, Weijia Shi, Luke Zettlemoyer, Yulia Tsvetkov, Yejin Choi, David Evans, Hannaneh Hajishirzi

    Abstract: Membership inference attacks (MIAs) attempt to predict whether a particular datapoint is a member of a target model's training data. Despite extensive research on traditional machine learning models, there has been limited work studying MIA on the pre-training data of large language models (LLMs). We perform a large-scale evaluation of MIAs over a suite of language models (LMs) trained on the Pile… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  17. arXiv:2402.00371  [pdf, other

    cs.CL

    What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

    Authors: Shangbin Feng, Herun Wan, Ningnan Wang, Zhaoxuan Tan, Minnan Luo, Yulia Tsvetkov

    Abstract: Social media bot detection has always been an arms race between advancements in machine learning bot detectors and adversarial bot strategies to evade detection. In this work, we bring the arms race to the next level by investigating the opportunities and risks of state-of-the-art large language models (LLMs) in social bot detection. To investigate the opportunities, we design novel LLM-based bot… ▽ More

    Submitted 4 July, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  18. arXiv:2402.00367  [pdf, other

    cs.CL

    Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration

    Authors: Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Vidhisha Balachandran, Yulia Tsvetkov

    Abstract: Despite efforts to expand the knowledge of large language models (LLMs), knowledge gaps -- missing or outdated information in LLMs -- might always persist given the evolving nature of knowledge. In this work, we study approaches to identify LLM knowledge gaps and abstain from answering questions when knowledge gaps are present. We first adapt existing approaches to model calibration or adaptation… ▽ More

    Submitted 30 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  19. arXiv:2401.08565  [pdf, other

    cs.CL

    Tuning Language Models by Proxy

    Authors: Alisa Liu, Xiaochuang Han, Yizhong Wang, Yulia Tsvetkov, Yejin Choi, Noah A. Smith

    Abstract: Despite the general capabilities of large pretrained language models, they consistently benefit from further adaptation to better achieve desired behaviors. However, tuning these models has become increasingly resource-intensive, or impossible when model weights are private. We introduce proxy-tuning, a lightweight decoding-time algorithm that operates on top of black-box LMs to achieve the same e… ▽ More

    Submitted 15 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: fix typo in Table 13, add acknowledgments section. code available at https://github.com/alisawuffles/proxy-tuning

  20. arXiv:2401.06855  [pdf, other

    cs.CL

    Fine-grained Hallucination Detection and Editing for Language Models

    Authors: Abhika Mishra, Akari Asai, Vidhisha Balachandran, Yizhong Wang, Graham Neubig, Yulia Tsvetkov, Hannaneh Hajishirzi

    Abstract: Large language models (LMs) are prone to generate factual errors, which are often called hallucinations. In this paper, we introduce a comprehensive taxonomy of hallucinations and argue that hallucinations manifest in diverse forms, each requiring varying degrees of careful assessments to verify factuality. We propose a novel task of automatic fine-grained hallucination detection and construct a n… ▽ More

    Submitted 21 February, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: Our code, data, and demo are available at https://fine-grained-hallucination.github.io. Expanded human annotations adding a new LM, as well as included more baselines for comparison

  21. arXiv:2311.09741  [pdf, other

    cs.CL cs.LG

    P^3SUM: Preserving Author's Perspective in News Summarization with Diffusion Language Models

    Authors: Yuhan Liu, Shangbin Feng, Xiaochuang Han, Vidhisha Balachandran, Chan Young Park, Sachin Kumar, Yulia Tsvetkov

    Abstract: In this work, we take a first step towards designing summarization systems that are faithful to the author's intent, not only the semantic content of the article. Focusing on a case study of preserving political perspectives in news summarization, we find that existing approaches alter the political opinions and stances of news articles in more than 50% of summaries, misrepresenting the intent and… ▽ More

    Submitted 4 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  22. arXiv:2311.07115  [pdf, other

    cs.CL

    Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions

    Authors: Sachin Kumar, Chan Young Park, Yulia Tsvetkov

    Abstract: Language model (LM) prompting--a popular paradigm for solving NLP tasks--has been shown to be susceptible to miscalibration and brittleness to slight prompt variations, caused by its discriminative prompting approach, i.e., predicting the label given the input. To address these issues, we propose Gen-Z--a generative prompting framework for zero-shot text classification. GEN-Z is generative, as it… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  23. arXiv:2310.17884  [pdf, other

    cs.AI cs.CL cs.CR

    Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory

    Authors: Niloofar Mireshghallah, Hyunwoo Kim, Xuhui Zhou, Yulia Tsvetkov, Maarten Sap, Reza Shokri, Yejin Choi

    Abstract: The interactive use of large language models (LLMs) in AI assistants (at work, home, etc.) introduces a new set of inference-time privacy risks: LLMs are fed different types of information from multiple sources in their inputs and are expected to reason about what to share in their outputs, for what purpose and with whom, within a given context. In this work, we draw attention to the highly critic… ▽ More

    Submitted 28 June, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: 2024 ICLR Spotlight. The dataset and code can be found at https://confaide.github.io

  24. arXiv:2310.11324  [pdf, other

    cs.CL cs.AI cs.LG

    Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting

    Authors: Melanie Sclar, Yejin Choi, Yulia Tsvetkov, Alane Suhr

    Abstract: As large language models (LLMs) are adopted as a fundamental component of language technologies, it is crucial to accurately characterize their performance. Because choices in prompt design can strongly influence model behavior, this design process is critical in effectively using any modern pre-trained generative language model. In this work, we focus on LLM sensitivity to a quintessential class… ▽ More

    Submitted 1 July, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Camera Ready version. With respect to the original submission, we added text generation experiments, plots of entire accuracy distributions for each task + stdev computations, and prompt length correlation with spread analysis

  25. arXiv:2310.09725  [pdf, other

    cs.CL

    KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models

    Authors: Yuyang Bai, Shangbin Feng, Vidhisha Balachandran, Zhaoxuan Tan, Shiqi Lou, Tianxing He, Yulia Tsvetkov

    Abstract: Large language models (LLMs) demonstrate remarkable performance on knowledge-intensive tasks, suggesting that real-world knowledge is encoded in their model parameters. However, besides explorations on a few probing tasks in limited knowledge domains, it is not well understood how to evaluate LLMs' knowledge systematically and how well their knowledge abilities generalize, across a spectrum of kno… ▽ More

    Submitted 23 March, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: TheWebConf 2024

  26. arXiv:2310.07707  [pdf, other

    cs.LG cs.CL cs.CV

    MatFormer: Nested Transformer for Elastic Inference

    Authors: Devvrit, Sneha Kudugunta, Aditya Kusupati, Tim Dettmers, Kaifeng Chen, Inderjit Dhillon, Yulia Tsvetkov, Hannaneh Hajishirzi, Sham Kakade, Ali Farhadi, Prateek Jain

    Abstract: Transformer models are deployed in a wide range of settings, from multi-accelerator clusters to standalone mobile phones. The diverse inference constraints in these scenarios necessitate practitioners to train foundation models such as PaLM 2, Llama, & ViTs as a series of models of varying sizes. Due to significant training costs, only a select few model sizes are trained and supported, limiting m… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 31 pages, 12 figures, first three authors contributed equally

  27. arXiv:2310.05165  [pdf, other

    cs.CL

    On the Zero-Shot Generalization of Machine-Generated Text Detectors

    Authors: Xiao Pu, Jingyu Zhang, Xiaochuang Han, Yulia Tsvetkov, Tianxing He

    Abstract: The rampant proliferation of large language models, fluent enough to generate text indistinguishable from human-written language, gives unprecedented importance to the detection of machine-generated text. This work is motivated by an important research question: How will the detectors of machine-generated text perform on outputs of a new generator, that the detectors were not trained on? We begin… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  28. arXiv:2310.03991  [pdf, other

    cs.CL

    SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation

    Authors: Abe Bohan Hou, Jingyu Zhang, Tianxing He, Yichen Wang, Yung-Sung Chuang, Hongwei Wang, Lingfeng Shen, Benjamin Van Durme, Daniel Khashabi, Yulia Tsvetkov

    Abstract: Existing watermarking algorithms are vulnerable to paraphrase attacks because of their token-level design. To address this issue, we propose SemStamp, a robust sentence-level semantic watermarking algorithm based on locality-sensitive hashing (LSH), which partitions the semantic space of sentences. The algorithm encodes and LSH-hashes a candidate sentence generated by an LLM, and conducts sentence… ▽ More

    Submitted 22 April, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Accepted to NAACL 24 Main

  29. arXiv:2310.01290  [pdf, other

    cs.CL cs.AI

    Knowledge Crosswords: Geometric Knowledge Reasoning with Large Language Models

    Authors: Wenxuan Ding, Shangbin Feng, Yuhan Liu, Zhaoxuan Tan, Vidhisha Balachandran, Tianxing He, Yulia Tsvetkov

    Abstract: We propose Knowledge Crosswords, a geometric knowledge reasoning benchmark consisting of incomplete knowledge networks bounded by structured factual constraints, where LLMs are tasked with inferring the missing facts to meet all constraints. The novel setting of geometric knowledge reasoning necessitates new LM abilities beyond existing atomic/linear multi-hop QA, such as backtracking, verifying f… ▽ More

    Submitted 25 June, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  30. arXiv:2310.00935  [pdf, other

    cs.CL

    Resolving Knowledge Conflicts in Large Language Models

    Authors: Yike Wang, Shangbin Feng, Heng Wang, Weijia Shi, Vidhisha Balachandran, Tianxing He, Yulia Tsvetkov

    Abstract: Large language models (LLMs) often encounter knowledge conflicts, scenarios where discrepancy arises between the internal parametric knowledge of LLMs and non-parametric information provided in the prompt context. In this work we ask what are the desiderata for LLMs when a knowledge conflict arises and whether existing LLMs fulfill them. We posit that LLMs should 1) identify knowledge conflicts, 2… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  31. arXiv:2309.17157  [pdf, other

    cs.CL

    LatticeGen: A Cooperative Framework which Hides Generated Text in a Lattice for Privacy-Aware Generation on Cloud

    Authors: Mengke Zhang, Tianxing He, Tianle Wang, Lu Mi, Fatemehsadat Mireshghallah, Binyi Chen, Hao Wang, Yulia Tsvetkov

    Abstract: In the current user-server interaction paradigm of prompted generation with large language models (LLM) on cloud, the server fully controls the generation process, which leaves zero options for users who want to keep the generated text to themselves. We propose LatticeGen, a cooperative framework in which the server still handles most of the computation while the user controls the sampling operati… ▽ More

    Submitted 5 April, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  32. arXiv:2306.15091  [pdf, other

    cs.CL

    Understanding In-Context Learning via Supportive Pretraining Data

    Authors: Xiaochuang Han, Daniel Simig, Todor Mihaylov, Yulia Tsvetkov, Asli Celikyilmaz, Tianlu Wang

    Abstract: In-context learning (ICL) improves language models' performance on a variety of NLP tasks by simply demonstrating a handful of examples at inference time. It is not well understood why ICL ability emerges, as the model has never been specifically trained on such demonstrations. Unlike prior work that explores implicit mechanisms behind ICL, we study ICL via investigating the pretraining data. Spec… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: ACL 2023

  33. arXiv:2306.00924  [pdf, other

    cs.CL cs.AI cs.LG

    Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker

    Authors: Melanie Sclar, Sachin Kumar, Peter West, Alane Suhr, Yejin Choi, Yulia Tsvetkov

    Abstract: Theory of Mind (ToM)$\unicode{x2014}$the ability to reason about the mental states of other people$\unicode{x2014}$is a key element of our social intelligence. Yet, despite their ever more impressive performance, large-scale neural language models still lack basic theory of mind capabilities out-of-the-box. We posit that simply scaling up models will not imbue them with theory of mind due to the i… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Journal ref: ACL 2023

  34. Examining risks of racial biases in NLP tools for child protective services

    Authors: Anjalie Field, Amanda Coston, Nupoor Gandhi, Alexandra Chouldechova, Emily Putnam-Hornstein, David Steier, Yulia Tsvetkov

    Abstract: Although much literature has established the presence of demographic bias in natural language processing (NLP) models, most work relies on curated bias metrics that may not be reflective of real-world applications. At the same time, practitioners are increasingly using algorithmic tools in high-stakes settings, with particular recent interest in NLP. In this work, we focus on one such setting: chi… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: In 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT '23)

  35. arXiv:2305.14857  [pdf, other

    cs.CL

    BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer

    Authors: Akari Asai, Sneha Kudugunta, Xinyan Velocity Yu, Terra Blevins, Hila Gonen, Machel Reid, Yulia Tsvetkov, Sebastian Ruder, Hannaneh Hajishirzi

    Abstract: Despite remarkable advancements in few-shot generalization in natural language processing, most models are developed and evaluated primarily in English. To facilitate research on few-shot cross-lingual transfer, we introduce a new benchmark, called BUFFET, which unifies 15 diverse tasks across 54 languages in a sequence-to-sequence format and provides a fixed set of few-shot examples and instructi… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: The data and code is available at https://buffetfs.github.io/

  36. arXiv:2305.14771  [pdf, other

    cs.CL

    David helps Goliath: Inference-Time Collaboration Between Small Specialized and Large General Diffusion LMs

    Authors: Xiaochuang Han, Sachin Kumar, Yulia Tsvetkov, Marjan Ghazvininejad

    Abstract: Diffusion-based language models are emerging as a promising alternative to autoregressive LMs: they approach the competence of autoregressive LMs while offering nuanced controllability at inference time. While autoregressive LMs have benefited immensely from scaling and instruction-based learning, existing studies of diffusion LMs have been conducted on a smaller scale. Starting with a recently pr… ▽ More

    Submitted 14 February, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  37. arXiv:2305.14739  [pdf, other

    cs.CL

    Trusting Your Evidence: Hallucinate Less with Context-aware Decoding

    Authors: Weijia Shi, Xiaochuang Han, Mike Lewis, Yulia Tsvetkov, Luke Zettlemoyer, Scott Wen-tau Yih

    Abstract: Language models (LMs) often struggle to pay enough attention to the input context, and generate texts that are unfaithful or contain hallucinations. To mitigate this issue, we present context-aware decoding (CAD), which follows a contrastive output distribution that amplifies the difference between the output probabilities when a model is used with and without context. Our experiments show that CA… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  38. arXiv:2305.14716  [pdf, other

    cs.CL

    GlobalBench: A Benchmark for Global Progress in Natural Language Processing

    Authors: Yueqi Song, Catherine Cui, Simran Khanuja, Pengfei Liu, Fahim Faisal, Alissa Ostapenko, Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Yulia Tsvetkov, Antonios Anastasopoulos, Graham Neubig

    Abstract: Despite the major advances in NLP, significant disparities in NLP system performance across languages still exist. Arguably, these are due to uneven resource allocation and sub-optimal incentives to work on less resourced languages. To track and further incentivize the global development of equitable language technology, we introduce GlobalBench. Prior multilingual benchmarks are static and have f… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Preprint, 9 pages

  39. arXiv:2305.14326  [pdf, other

    cs.CL

    TalkUp: Paving the Way for Understanding Empowering Language

    Authors: Lucille Njoo, Chan Young Park, Octavia Stappart, Marvin Thielk, Yi Chu, Yulia Tsvetkov

    Abstract: Empowering language is important in many real-world contexts, from education to workplace dynamics to healthcare. Though language technologies are growing more prevalent in these contexts, empowerment has seldom been studied in NLP, and moreover, it is inherently challenging to operationalize because of its implicit nature. This work builds from linguistic and social psychology literature to explo… ▽ More

    Submitted 23 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Findings of EMNLP 2023

  40. arXiv:2305.13707  [pdf, other

    cs.CL

    Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models

    Authors: Orevaoghene Ahia, Sachin Kumar, Hila Gonen, Jungo Kasai, David R. Mortensen, Noah A. Smith, Yulia Tsvetkov

    Abstract: Language models have graduated from being research prototypes to commercialized products offered as web APIs, and recent works have highlighted the multilingual capabilities of these products. The API vendors charge their users based on usage, more specifically on the number of ``tokens'' processed or generated by the underlying language models. What constitutes a token, however, is training data… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  41. arXiv:2305.10037  [pdf, other

    cs.CL cs.AI

    Can Language Models Solve Graph Problems in Natural Language?

    Authors: Heng Wang, Shangbin Feng, Tianxing He, Zhaoxuan Tan, Xiaochuang Han, Yulia Tsvetkov

    Abstract: Large language models (LLMs) are increasingly adopted for a variety of tasks with implicit graphical structures, such as planning in robotics, multi-hop question answering or knowledge probing, structured commonsense reasoning, and more. While LLMs have advanced the state-of-the-art on these tasks with structure implications, whether LLMs could explicitly process textual descriptions of graphs and… ▽ More

    Submitted 5 January, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 Spotlight

  42. arXiv:2305.09955  [pdf, other

    cs.CL

    Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models

    Authors: Shangbin Feng, Weijia Shi, Yuyang Bai, Vidhisha Balachandran, Tianxing He, Yulia Tsvetkov

    Abstract: By design, large language models (LLMs) are static general-purpose models, expensive to retrain or update frequently. As they are increasingly adopted for knowledge-intensive tasks, it becomes evident that these design choices lead to failures to generate factual, relevant, and up-to-date knowledge. To this end, we propose Knowledge Card, a modular framework to plug in new factual and relevant kno… ▽ More

    Submitted 22 March, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: ICLR 2024, oral

  43. arXiv:2305.08283  [pdf, other

    cs.CL

    From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models

    Authors: Shangbin Feng, Chan Young Park, Yuhan Liu, Yulia Tsvetkov

    Abstract: Language models (LMs) are pretrained on diverse data sources, including news, discussion forums, books, and online encyclopedias. A significant portion of this data includes opinions and perspectives which, on one hand, celebrate democracy and diversity of ideas, and on the other hand are inherently socially biased. Our work develops new methods to (1) measure political biases in LMs trained on su… ▽ More

    Submitted 5 July, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  44. arXiv:2305.08281  [pdf, other

    cs.CL

    FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge

    Authors: Shangbin Feng, Vidhisha Balachandran, Yuyang Bai, Yulia Tsvetkov

    Abstract: Evaluating the factual consistency of automatically generated summaries is essential for the progress and adoption of reliable summarization systems. Despite recent advances, existing factuality evaluation models are not robust, being especially prone to entity and relation errors in new domains. We propose FactKB, a simple new approach to factuality evaluation that is generalizable across domains… ▽ More

    Submitted 18 October, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  45. arXiv:2303.18190  [pdf, other

    cs.CL

    Assessing Language Model Deployment with Risk Cards

    Authors: Leon Derczynski, Hannah Rose Kirk, Vidhisha Balachandran, Sachin Kumar, Yulia Tsvetkov, M. R. Leiser, Saif Mohammad

    Abstract: This paper introduces RiskCards, a framework for structured assessment and documentation of risks associated with an application of language models. As with all language, text generated by language models can be harmful, or used to bring about harm. Automating language generation adds both an element of scale and also more subtle or emergent undesirable tendencies to the generated text. Prior work… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

  46. arXiv:2302.00381  [pdf, other

    cs.SI

    BotPercent: Estimating Bot Populations in Twitter Communities

    Authors: Zhaoxuan Tan, Shangbin Feng, Melanie Sclar, Herun Wan, Minnan Luo, Yejin Choi, Yulia Tsvetkov

    Abstract: Twitter bot detection is vital in combating misinformation and safeguarding the integrity of social media discourse. While malicious bots are becoming more and more sophisticated and personalized, standard bot detection approaches are still agnostic to social environments (henceforth, communities) the bots operate at. In this work, we introduce community-specific bot detection, estimating the perc… ▽ More

    Submitted 18 October, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Accepted to findings of EMNLP 2023

  47. arXiv:2212.10539  [pdf, other

    cs.CL

    Toward Human Readable Prompt Tuning: Kubrick's The Shining is a good movie, and a good prompt too?

    Authors: Weijia Shi, Xiaochuang Han, Hila Gonen, Ari Holtzman, Yulia Tsvetkov, Luke Zettlemoyer

    Abstract: Large language models can perform new tasks in a zero-shot fashion, given natural language prompts that specify the desired behavior. Such prompts are typically hand engineered, but can also be learned with gradient-based methods from labeled data. However, it is underexplored what factors make the prompts effective, especially when the prompts are natural language. In this paper, we investigate c… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  48. arXiv:2212.10020  [pdf, other

    cs.CL

    On the Blind Spots of Model-Based Evaluation Metrics for Text Generation

    Authors: Tianxing He, Jingyu Zhang, Tianle Wang, Sachin Kumar, Kyunghyun Cho, James Glass, Yulia Tsvetkov

    Abstract: In this work, we explore a useful but often neglected methodology for robustness analysis of text generation evaluation metrics: stress tests with synthetic data. Basically, we design and synthesize a wide range of potential errors and check whether they result in a commensurate drop in the metric scores. We examine a range of recently proposed evaluation metrics based on pretrained language model… ▽ More

    Submitted 18 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  49. arXiv:2210.17432  [pdf, other

    cs.CL cs.LG

    SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control

    Authors: Xiaochuang Han, Sachin Kumar, Yulia Tsvetkov

    Abstract: Despite the growing success of diffusion models in continuous-valued domains (e.g., images), similar efforts for discrete domains such as text have yet to match the performance of autoregressive language models. In this work, we present SSD-LM -- a diffusion-based language model with two key design choices. First, SSD-LM is semi-autoregressive, iteratively generating blocks of text, allowing for f… ▽ More

    Submitted 26 June, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: ACL 2023

  50. arXiv:2210.15144  [pdf, other

    cs.CL cs.CY

    Gendered Mental Health Stigma in Masked Language Models

    Authors: Inna Wanyin Lin, Lucille Njoo, Anjalie Field, Ashish Sharma, Katharina Reinecke, Tim Althoff, Yulia Tsvetkov

    Abstract: Mental health stigma prevents many individuals from receiving the appropriate care, and social psychology studies have shown that mental health tends to be overlooked in men. In this work, we investigate gendered mental health stigma in masked language models. In doing so, we operationalize mental health stigma by developing a framework grounded in psychology research: we use clinical psychology l… ▽ More

    Submitted 11 April, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022