Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Anantheswaran, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15444  [pdf, other

    cs.CL

    Investigating the Robustness of LLMs on Math Word Problems

    Authors: Ujjwala Anantheswaran, Himanshu Gupta, Kevin Scaria, Shreyas Verma, Chitta Baral, Swaroop Mishra

    Abstract: Large Language Models (LLMs) excel at various tasks, including solving math word problems (MWPs), but struggle with real-world problems containing irrelevant information. To address this, we propose a prompting framework that generates adversarial variants of MWPs by adding irrelevant variables. We introduce a dataset, ProbleMATHIC, containing both adversarial and non-adversarial MWPs. Our experim… ▽ More

    Submitted 30 May, 2024; originally announced June 2024.

  2. arXiv:2310.17876  [pdf, other

    cs.CL

    TarGEN: Targeted Data Generation with Large Language Models

    Authors: Himanshu Gupta, Kevin Scaria, Ujjwala Anantheswaran, Shreyas Verma, Mihir Parmar, Saurabh Arjun Sawant, Chitta Baral, Swaroop Mishra

    Abstract: The rapid advancement of large language models (LLMs) has sparked interest in data synthesis techniques, aiming to generate diverse and high-quality synthetic datasets. However, these synthetic datasets often suffer from a lack of diversity and added noise. In this paper, we present TarGEN, a multi-step prompting strategy for generating high-quality synthetic datasets utilizing a LLM. An advantage… ▽ More

    Submitted 8 August, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: COLM 2024, 35 pages

  3. arXiv:2305.16357  [pdf, other

    cs.CL

    EDM3: Event Detection as Multi-task Text Generation

    Authors: Ujjwala Anantheswaran, Himanshu Gupta, Mihir Parmar, Kuntal Kumar Pal, Chitta Baral

    Abstract: Event detection refers to identifying event occurrences in a text and comprises of two subtasks; event identification and classification. We present EDM3, a novel approach for Event Detection that formulates three generative tasks: identification, classification, and combined detection. We show that EDM3 helps to learn transferable knowledge that can be leveraged to perform Event Detection and its… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 9 pages, 4 figures, 10 tables, 5 Page appendix

  4. arXiv:2302.10346  [pdf, other

    cs.CL cs.AI cs.CR

    Exploring the Limits of Transfer Learning with Unified Model in the Cybersecurity Domain

    Authors: Kuntal Kumar Pal, Kazuaki Kashihara, Ujjwala Anantheswaran, Kirby C. Kuznia, Siddhesh Jagtap, Chitta Baral

    Abstract: With the increase in cybersecurity vulnerabilities of software systems, the ways to exploit them are also increasing. Besides these, malware threats, irregular network interactions, and discussions about exploits in public forums are also on the rise. To identify these threats faster, to detect potentially relevant entities from any texts, and to be aware of software vulnerabilities, automated app… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 8 pages