Skip to main content

Showing 1–13 of 13 results for author: Sathe, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03678  [pdf, other

    cs.CL cs.LG

    Improving Self Consistency in LLMs through Probabilistic Tokenization

    Authors: Ashutosh Sathe, Divyanshu Aggarwal, Sunayana Sitaram

    Abstract: Prior research has demonstrated noticeable performance gains through the use of probabilistic tokenizations, an approach that involves employing multiple tokenizations of the same input string during the training phase of a language model. Despite these promising findings, modern large language models (LLMs) have yet to be trained using probabilistic tokenizations. Interestingly, while the tokeniz… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: ICML 2024 Workshop on LLMs and Cognition

  2. arXiv:2407.02819  [pdf, other

    cs.CL cs.LG

    Efficient Training of Language Models with Compact and Consistent Next Token Distributions

    Authors: Ashutosh Sathe, Sunita Sarawagi

    Abstract: Maximizing the likelihood of the next token is an established, statistically sound objective for pre-training language models. In this paper we show that we can train better models faster by pre-aggregating the corpus with a collapsed $n$-gram distribution. Previous studies have proposed corpus-level $n$-gram statistics as a regularizer; however, the construction and querying of such $n$-grams, if… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: ACL 2024

  3. arXiv:2405.09605  [pdf, other

    cs.CL cs.AI cs.LG

    Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models

    Authors: Anna A. Ivanova, Aalok Sathe, Benjamin Lipkin, Unnathi Kumar, Setayesh Radkani, Thomas H. Clark, Carina Kauf, Jennifer Hu, R. T. Pramod, Gabriel Grand, Vivian Paulun, Maria Ryskina, Ekin Akyürek, Ethan Wilcox, Nafisa Rashid, Leshem Choshen, Roger Levy, Evelina Fedorenko, Joshua Tenenbaum, Jacob Andreas

    Abstract: The ability to build and leverage world models is essential for a general-purpose AI agent. Testing such capabilities is hard, in part because the building blocks of world models are ill-defined. We present Elements of World Knowledge (EWOK), a framework for evaluating world modeling in language models by testing their ability to use knowledge of a concept to match a target text with a plausible/i… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 21 pages (11 main), 7 figures. Authors Anna Ivanova, Aalok Sathe, Benjamin Lipkin contributed equally

  4. arXiv:2402.13636  [pdf, other

    cs.CV cs.CL cs.CY

    A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models

    Authors: Ashutosh Sathe, Prachi Jain, Sunayana Sitaram

    Abstract: Vision-language models (VLMs) have gained widespread adoption in both industry and academia. In this study, we propose a unified framework for systematically evaluating gender, race, and age biases in VLMs with respect to professions. Our evaluation encompasses all supported inference modes of the recent VLMs, including image-to-text, text-to-text, text-to-image, and image-to-image. Additionally,… ▽ More

    Submitted 17 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  5. arXiv:2402.07519  [pdf, other

    cs.CL cs.CY

    MAFIA: Multi-Adapter Fused Inclusive LanguAge Models

    Authors: Prachi Jain, Ashutosh Sathe, Varun Gumma, Kabir Ahuja, Sunayana Sitaram

    Abstract: Pretrained Language Models (PLMs) are widely used in NLP for various tasks. Recent studies have identified various biases that such models exhibit and have proposed methods to correct these biases. However, most of the works address a limited set of bias dimensions independently such as gender, race, or religion. Moreover, the methods typically involve finetuning the full model to maintain the per… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL 2024

  6. arXiv:2401.07598  [pdf, other

    cs.CL

    MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models

    Authors: Divyanshu Aggarwal, Ashutosh Sathe, Ishaan Watts, Sunayana Sitaram

    Abstract: Parameter Efficient Finetuning (PEFT) has emerged as a viable solution for improving the performance of Large Language Models (LLMs) without requiring massive resources and compute. Prior work on multilingual evaluation has shown that there is a large gap between the performance of LLMs on English and other languages. Further, there is also a large gap between the performance of smaller open-sourc… ▽ More

    Submitted 20 February, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: 46 pages, 23 figures, 45 tables

  7. arXiv:2311.07463  [pdf, other

    cs.CL

    MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks

    Authors: Sanchit Ahuja, Divyanshu Aggarwal, Varun Gumma, Ishaan Watts, Ashutosh Sathe, Millicent Ochieng, Rishav Hada, Prachi Jain, Maxamed Axmed, Kalika Bali, Sunayana Sitaram

    Abstract: There has been a surge in LLM evaluation research to understand LLM capabilities and limitations. However, much of this research has been confined to English, leaving LLM building and evaluation for non-English languages relatively unexplored. Several new LLMs have been introduced recently, necessitating their evaluation on non-English languages. This study aims to perform a thorough evaluation of… ▽ More

    Submitted 2 April, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 40 pages, 35 figures and 34 tables

  8. arXiv:2310.16071  [pdf

    cs.LG cs.AI

    Grid Frequency Forecasting in University Campuses using Convolutional LSTM

    Authors: Aneesh Sathe, Wen Ren Yang

    Abstract: The modern power grid is facing increasing complexities, primarily stemming from the integration of renewable energy sources and evolving consumption patterns. This paper introduces an innovative methodology that harnesses Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) networks to establish robust time series forecasting models for grid frequency. These models effectively ca… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 9 pages, 20 figures

  9. arXiv:2310.13659  [pdf, other

    cs.CL

    Benchmarking and Improving Text-to-SQL Generation under Ambiguity

    Authors: Adithya Bhaskar, Tushar Tomar, Ashutosh Sathe, Sunita Sarawagi

    Abstract: Research in Text-to-SQL conversion has been largely benchmarked against datasets where each text query corresponds to one correct SQL. However, natural language queries over real-life databases frequently involve significant ambiguity about the intended SQL due to overlapping schema names and multiple confusing relationship paths. To bridge this gap, we develop a novel benchmark called AmbiQT with… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: To appear at EMNLP 2023 (Main)

  10. arXiv:2210.16613  [pdf, other

    cs.CL cs.AI cs.LG

    Diverse Parallel Data Synthesis for Cross-Database Adaptation of Text-to-SQL Parsers

    Authors: Abhijeet Awasthi, Ashutosh Sathe, Sunita Sarawagi

    Abstract: Text-to-SQL parsers typically struggle with databases unseen during the train time. Adapting parsers to new databases is a challenging problem due to the lack of natural language queries in the new schemas. We present ReFill, a framework for synthesizing high-quality and textually diverse parallel datasets for adapting a Text-to-SQL parser to a target schema. ReFill learns to retrieve-and-edit tex… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  11. arXiv:2110.02386  [pdf

    cs.CL cs.AI

    Analyzing the Effects of Reasoning Types on Cross-Lingual Transfer Performance

    Authors: Karthikeyan K, Aalok Sathe, Somak Aditya, Monojit Choudhury

    Abstract: Multilingual language models achieve impressive zero-shot accuracies in many languages in complex tasks such as Natural Language Inference (NLI). Examples in NLI (and equivalent complex tasks) often pertain to various types of sub-tasks, requiring different kinds of reasoning. Certain types of reasoning have proven to be more difficult to learn in a monolingual context, and in the crosslingual con… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: Workshop on Multilingual Representation Learning (MRL 2021), at Empirical Methods in Natural Language Processing (EMNLP 2021)

  12. arXiv:2106.06566  [pdf, other

    cs.CL

    Sample-efficient Linguistic Generalizations through Program Synthesis: Experiments with Phonology Problems

    Authors: Saujas Vaduguru, Aalok Sathe, Monojit Choudhury, Dipti Misra Sharma

    Abstract: Neural models excel at extracting statistical patterns from large amounts of data, but struggle to learn patterns or reason about language from only a few examples. In this paper, we ask: Can we learn explicit rules that generalize well from only a few examples? We explore this question using program synthesis. We develop a synthesis model to learn phonology rules as programs in a domain-specific… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: SIGMORPHON 2021

  13. arXiv:2009.14505  [pdf, other

    cs.AI cs.CL

    TaxiNLI: Taking a Ride up the NLU Hill

    Authors: Pratik Joshi, Somak Aditya, Aalok Sathe, Monojit Choudhury

    Abstract: Pre-trained Transformer-based neural architectures have consistently achieved state-of-the-art performance in the Natural Language Inference (NLI) task. Since NLI examples encompass a variety of linguistic, logical, and reasoning phenomena, it remains unclear as to which specific concepts are learnt by the trained systems and where they can achieve strong generalization. To investigate this questi… ▽ More

    Submitted 9 October, 2020; v1 submitted 30 September, 2020; originally announced September 2020.

    Comments: 15 pages, 9 figures, 4 tables. Accepted at CoNLL 2020