Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Havaldar, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11622  [pdf, other

    cs.CL

    Building Knowledge-Guided Lexica to Model Cultural Variation

    Authors: Shreya Havaldar, Salvatore Giorgi, Sunny Rai, Thomas Talhelm, Sharath Chandra Guntuku, Lyle Ungar

    Abstract: Cultural variation exists between nations (e.g., the United States vs. China), but also within regions (e.g., California vs. Texas, Los Angeles vs. San Francisco). Measuring this regional cultural variation can illuminate how and why people think and behave differently. Historically, it has been difficult to computationally model cultural variation due to a lack of training data and scalability co… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted at NAACL 2024

  2. arXiv:2402.11333  [pdf, other

    cs.CY

    Social Norms in Cinema: A Cross-Cultural Analysis of Shame, Pride and Prejudice

    Authors: Sunny Rai, Khushang Jilesh Zaveri, Shreya Havaldar, Soumna Nema, Lyle Ungar, Sharath Chandra Guntuku

    Abstract: Social emotions such as shame and pride reflect social sanctions or approvals in society. In this paper, we examine how expressions of shame and pride vary across cultures and harness them to extract unspoken normative expectations across cultures. We introduce the first cross-cultural shame/pride emotions movie dialogue dataset, obtained from ~5.4K Bollywood and Hollywood movies, along with over… ▽ More

    Submitted 16 June, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  3. arXiv:2310.10092  [pdf, ps, other

    cs.LG stat.ML

    Label Differential Privacy via Aggregation

    Authors: Anand Brahmbhatt, Rishi Saket, Shreyas Havaldar, Anshul Nasery, Aravindan Raghuveer

    Abstract: In many real-world applications, due to recent developments in the privacy landscape, training data may be aggregated to preserve the privacy of sensitive training labels. In the learning from label proportions (LLP) framework, the dataset is partitioned into bags of feature-vectors which are available only with the sum of the labels per bag. A further restriction, which we call learning from bag… ▽ More

    Submitted 27 November, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

  4. arXiv:2310.08056  [pdf, other

    cs.LG cs.AI

    Learning from Label Proportions: Bootstrapping Supervised Learners via Belief Propagation

    Authors: Shreyas Havaldar, Navodita Sharma, Shubhi Sareen, Karthikeyan Shanmugam, Aravindan Raghuveer

    Abstract: Learning from Label Proportions (LLP) is a learning problem where only aggregate level labels are available for groups of instances, called bags, during training, and the aim is to get the best performance at the instance-level on the test data. This setting arises in domains like advertising and medicine due to privacy considerations. We propose a novel algorithmic framework for this problem that… ▽ More

    Submitted 20 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Published as a conference paper at The Twelfth International Conference on Learning Representations (ICLR 2024) & Oral Presentation at Regulatable ML @ NeurIPS 2023

  5. arXiv:2310.07535  [pdf, other

    cs.LG cs.AI

    Fairness under Covariate Shift: Improving Fairness-Accuracy tradeoff with few Unlabeled Test Samples

    Authors: Shreyas Havaldar, Jatin Chauhan, Karthikeyan Shanmugam, Jay Nandy, Aravindan Raghuveer

    Abstract: Covariate shift in the test data is a common practical phenomena that can significantly downgrade both the accuracy and the fairness performance of the model. Ensuring fairness across different sensitive groups under covariate shift is of paramount importance due to societal implications like criminal justice. We operate in the unsupervised regime where only a small set of unlabeled test samples a… ▽ More

    Submitted 8 January, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted at The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)

  6. arXiv:2310.07135  [pdf, other

    cs.CL

    Comparing Styles across Languages

    Authors: Shreya Havaldar, Matthew Pressimone, Eric Wong, Lyle Ungar

    Abstract: Understanding how styles differ across languages is advantageous for training both humans and computers to generate culturally appropriate text. We introduce an explanation framework to extract stylistic differences from multilingual LMs and compare styles across languages. Our framework (1) generates comprehensive style lexica in any language and (2) consolidates feature importances from LMs into… ▽ More

    Submitted 4 December, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  7. arXiv:2307.01370  [pdf, other

    cs.CL

    Multilingual Language Models are not Multicultural: A Case Study in Emotion

    Authors: Shreya Havaldar, Sunny Rai, Bhumika Singhal, Langchen Liu, Sharath Chandra Guntuku, Lyle Ungar

    Abstract: Emotions are experienced and expressed differently across the world. In order to use Large Language Models (LMs) for multilingual tasks that require emotional sensitivity, LMs must reflect this cultural variation in emotion. In this study, we investigate whether the widely-used multilingual LMs in 2023 reflect differences in emotional expressions across cultures and languages. We find that embeddi… ▽ More

    Submitted 9 July, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: Accepted to WASSA at ACL 2023

  8. arXiv:2306.00976  [pdf, other

    cs.CL

    TopEx: Topic-based Explanations for Model Comparison

    Authors: Shreya Havaldar, Adam Stein, Eric Wong, Lyle Ungar

    Abstract: Meaningfully comparing language models is challenging with current explanation methods. Current explanations are overwhelming for humans due to large vocabularies or incomparable across models. We present TopEx, an explanation method that enables a level playing field for comparing language models via model-agnostic topics. We demonstrate how TopEx can identify similarities and differences between… ▽ More

    Submitted 1 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to ICLR 2023, Tiny Papers Track

  9. arXiv:2305.14757  [pdf, other

    cs.CL

    Psychological Metrics for Dialog System Evaluation

    Authors: Salvatore Giorgi, Shreya Havaldar, Farhan Ahmed, Zuhaib Akhtar, Shalaka Vaidya, Gary Pan, Lyle H. Ungar, H. Andrew Schwartz, Joao Sedoc

    Abstract: We present metrics for evaluating dialog systems through a psychologically-grounded "human" lens in which conversational agents express a diversity of both states (e.g., emotion) and traits (e.g., personality), just as people do. We present five interpretable metrics from established psychology that are fundamental to human communication and relationships: emotional entropy, linguistic style and e… ▽ More

    Submitted 15 September, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  10. arXiv:2301.13379  [pdf, other

    cs.CL

    Faithful Chain-of-Thought Reasoning

    Authors: Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-Burch

    Abstract: While Chain-of-Thought (CoT) prompting boosts Language Models' (LM) performance on a gamut of complex reasoning tasks, the generated reasoning chain does not necessarily reflect how the model arrives at the answer (aka. faithfulness). We propose Faithful CoT, a reasoning framework involving two stages: Translation (Natural Language query $\rightarrow$ symbolic reasoning chain) and Problem Solving… ▽ More

    Submitted 20 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: IJCNLP-AACL 2023 camera-ready version

  11. arXiv:2003.06566  [pdf, other

    cs.LG cs.CV stat.ML

    On the benefits of defining vicinal distributions in latent space

    Authors: Puneet Mangla, Vedant Singh, Shreyas Jayant Havaldar, Vineeth N Balasubramanian

    Abstract: The vicinal risk minimization (VRM) principle is an empirical risk minimization (ERM) variant that replaces Dirac masses with vicinal functions. There is strong numerical and theoretical evidence showing that VRM outperforms ERM in terms of generalization if appropriate vicinal functions are chosen. Mixup Training (MT), a popular choice of vicinal distribution, improves the generalization performa… ▽ More

    Submitted 18 October, 2021; v1 submitted 14 March, 2020; originally announced March 2020.

    Comments: Accepted at Elsevier Pattern Recognition Letters (2021), Best Paper Award at CVPR 2021 Workshop on Adversarial Machine Learning in Real-World Computer Vision (AML-CV), Also accepted at ICLR 2021 Workshops on Robust-Reliable Machine Learning (Oral) and Generalization beyond the training distribution (Abstract)