Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: Potdar, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.05948  [pdf, other

    cs.CL cs.IR cs.LG

    ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA datasets with Large Language Models

    Authors: Ronak Pradeep, Daniel Lee, Ali Mousavi, Jeff Pound, Yisi Sang, Jimmy Lin, Ihab Ilyas, Saloni Potdar, Mostafa Arefiyan, Yunyao Li

    Abstract: The rapid advancement of Large Language Models (LLMs) and conversational assistants necessitates dynamic, scalable, and configurable conversational datasets for training and evaluation. These datasets must accommodate diverse user interaction modes, including text and voice, each presenting unique modeling challenges. Knowledge Graphs (KGs), with their structured and evolving nature, offer an idea… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  2. arXiv:2405.15028  [pdf, other

    cs.CL cs.IR

    AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings

    Authors: Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar

    Abstract: Ranking is a fundamental and popular problem in search. However, existing ranking algorithms usually restrict the granularity of ranking to full passages or require a specific dense index for each desired level of granularity. Such lack of flexibility in granularity negatively affects many applications that can benefit from more granular ranking, such as sentence-level ranking for open-domain ques… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2404.01626  [pdf, other

    cs.CL cs.IR

    Entity Disambiguation via Fusion Entity Decoding

    Authors: Junxiong Wang, Ali Mousavi, Omar Attia, Ronak Pradeep, Saloni Potdar, Alexander M. Rush, Umar Farooq Minhas, Yunyao Li

    Abstract: Entity disambiguation (ED), which links the mentions of ambiguous entities to their referent entities in a knowledge base, serves as a core component in entity linking (EL). Existing generative approaches demonstrate improved accuracy compared to classification approaches under the standardized ZELDA benchmark. Nevertheless, generative approaches suffer from the need for large-scale pre-training a… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at NAACL'24 main

  4. arXiv:2301.06544  [pdf, other

    cs.CL

    Distinguish Sense from Nonsense: Out-of-Scope Detection for Virtual Assistants

    Authors: Cheng Qian, Haode Qi, Gengyu Wang, Ladislav Kunc, Saloni Potdar

    Abstract: Out of Scope (OOS) detection in Conversational AI solutions enables a chatbot to handle a conversation gracefully when it is unable to make sense of the end-user query. Accurately tagging a query as out-of-domain is particularly hard in scenarios when the chatbot is not equipped to handle a topic which has semantic overlap with an existing topic it is trained on. We propose a simple yet effective… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: Accepted to EMNLP 2022 Industry Track

  5. arXiv:2205.14226  [pdf, other

    cs.IR cs.CL

    Fast and Light-Weight Answer Text Retrieval in Dialogue Systems

    Authors: Hui Wan, Siva Sankalp Patel, J. William Murdock, Saloni Potdar, Sachindra Joshi

    Abstract: Dialogue systems can benefit from being able to search through a corpus of text to find information relevant to user requests, especially when encountering a request for which no manually curated response is available. The state-of-the-art technology for neural dense retrieval or re-ranking involves deep learning models with hundreds of millions of parameters. However, it is difficult and expensiv… ▽ More

    Submitted 31 May, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted in NAACL-HLT 2022 Industry Track

  6. arXiv:2107.10137  [pdf, other

    cs.CL

    Improved Text Classification via Contrastive Adversarial Training

    Authors: Lin Pan, Chung-Wei Hang, Avirup Sil, Saloni Potdar

    Abstract: We propose a simple and general method to regularize the fine-tuning of Transformer-based encoders for text classification tasks. Specifically, during fine-tuning we generate adversarial examples by perturbing the word embeddings of the model and perform contrastive learning on clean and adversarial examples in order to teach the model to learn noise-invariant representations. By training on both… ▽ More

    Submitted 17 February, 2022; v1 submitted 21 July, 2021; originally announced July 2021.

  7. arXiv:2106.03826  [pdf, other

    cs.CL cs.AI

    Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study

    Authors: Xiangyang Mou, Chenghao Yang, Mo Yu, Bingsheng Yao, Xiaoxiao Guo, Saloni Potdar, Hui Su

    Abstract: Recent advancements in open-domain question answering (ODQA), i.e., finding answers from large open-domain corpus like Wikipedia, have led to human-level performance on many datasets. However, progress in QA over book stories (Book QA) lags behind despite its similar task formulation to ODQA. This work provides a comprehensive and quantitative analysis about the difficulty of Book QA: (1) We bench… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted to TACL

  8. arXiv:2104.03424  [pdf, other

    cs.CV

    Track, Check, Repeat: An EM Approach to Unsupervised Tracking

    Authors: Adam W. Harley, Yiming Zuo, Jing Wen, Ayush Mangal, Shubhankar Potdar, Ritwick Chaudhry, Katerina Fragkiadaki

    Abstract: We propose an unsupervised method for detecting and tracking moving objects in 3D, in unlabelled RGB-D videos. The method begins with classic handcrafted techniques for segmenting objects using motion cues: we estimate optical flow and camera motion, and conservatively segment regions that appear to be moving independently of the background. Treating these initial segments as pseudo-labels, we lea… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

  9. arXiv:2012.03929  [pdf, other

    cs.CL

    Benchmarking Commercial Intent Detection Services with Practice-Driven Evaluations

    Authors: Haode Qi, Lin Pan, Atin Sood, Abhishek Shah, Ladislav Kunc, Mo Yu, Saloni Potdar

    Abstract: Intent detection is a key component of modern goal-oriented dialog systems that accomplish a user task by predicting the intent of users' text input. There are three primary challenges in designing robust and accurate intent detection models. First, typical intent detection models require a large amount of labeled data to achieve high accuracy. Unfortunately, in practical scenarios it is more comm… ▽ More

    Submitted 2 June, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: Accepted at NAACL2021 Industry Track

  10. arXiv:2010.16279  [pdf, other

    cs.CV

    3D Object Recognition By Corresponding and Quantizing Neural 3D Scene Representations

    Authors: Mihir Prabhudesai, Shamit Lal, Hsiao-Yu Fish Tung, Adam W. Harley, Shubhankar Potdar, Katerina Fragkiadaki

    Abstract: We propose a system that learns to detect objects and infer their 3D poses in RGB-D images. Many existing systems can identify objects and infer 3D poses, but they heavily rely on human labels and 3D annotations. The challenge here is to achieve this without relying on strong supervision signals. To address this challenge, we propose a model that maps RGB-D images to a set of 3D visual feature map… ▽ More

    Submitted 30 October, 2020; originally announced October 2020.

  11. arXiv:2010.12547  [pdf, other

    cs.CL

    Multilingual BERT Post-Pretraining Alignment

    Authors: Lin Pan, Chung-Wei Hang, Haode Qi, Abhishek Shah, Saloni Potdar, Mo Yu

    Abstract: We propose a simple method to align multilingual contextual embeddings as a post-pretraining step for improved zero-shot cross-lingual transferability of the pretrained models. Using parallel data, our method aligns embeddings on the word level through the recently proposed Translation Language Modeling objective as well as on the sentence level via contrastive learning and random input shuffling.… ▽ More

    Submitted 10 April, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted at NAACL2021

  12. arXiv:2007.09878  [pdf, ps, other

    cs.CL cs.IR

    Frustratingly Hard Evidence Retrieval for QA Over Books

    Authors: Xiangyang Mou, Mo Yu, Bingsheng Yao, Chenghao Yang, Xiaoxiao Guo, Saloni Potdar, Hui Su

    Abstract: A lot of progress has been made to improve question answering (QA) in recent years, but the special problem of QA over narrative book stories has not been explored in-depth. We formulate BookQA as an open-domain QA task given its similar dependency on evidence retrieval. We further investigate how state-of-the-art open-domain QA approaches can help BookQA. Besides achieving state-of-the-art on the… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: ACL 2020 NUSE Workshop, 6 pages

  13. arXiv:1909.05357  [pdf, other

    cs.CL cs.LG stat.ML

    Out-of-Domain Detection for Low-Resource Text Classification Tasks

    Authors: Ming Tan, Yang Yu, Haoyu Wang, Dakuo Wang, Saloni Potdar, Shiyu Chang, Mo Yu

    Abstract: Out-of-domain (OOD) detection for low-resource text classification is a realistic but understudied task. The goal is to detect the OOD cases with limited in-domain (ID) training data, since we observe that training data is often insufficient in machine learning applications. In this work, we propose an OOD-resistant Prototypical Network to tackle this zero-shot OOD detection and few-shot ID classi… ▽ More

    Submitted 31 August, 2019; originally announced September 2019.

    Comments: EMNLP 2019

  14. arXiv:1902.01030  [pdf, other

    cs.CL cs.AI

    Extracting Multiple-Relations in One-Pass with Pre-Trained Transformers

    Authors: Haoyu Wang, Ming Tan, Mo Yu, Shiyu Chang, Dakuo Wang, Kun Xu, Xiaoxiao Guo, Saloni Potdar

    Abstract: Most approaches to extraction multiple relations from a paragraph require multiple passes over the paragraph. In practice, multiple passes are computationally expensive and this makes difficult to scale to longer paragraphs and larger text corpora. In this work, we focus on the task of multiple relation extraction by encoding the paragraph only once (one-pass). We build our solution on the pre-tra… ▽ More

    Submitted 3 June, 2019; v1 submitted 3 February, 2019; originally announced February 2019.

    Comments: 7 pages

  15. arXiv:1805.07513  [pdf, other

    cs.CL cs.LG

    Diverse Few-Shot Text Classification with Multiple Metrics

    Authors: Mo Yu, Xiaoxiao Guo, Jinfeng Yi, Shiyu Chang, Saloni Potdar, Yu Cheng, Gerald Tesauro, Haoyu Wang, Bowen Zhou

    Abstract: We study few-shot learning in natural language domains. Compared to many existing works that apply either metric-based or optimization-based meta-learning to image domain with low inter-task variance, we consider a more realistic setting, where tasks are diverse. However, it imposes tremendous difficulties to existing state-of-the-art metric-based algorithms since a single metric is insufficient t… ▽ More

    Submitted 19 May, 2018; originally announced May 2018.

    Comments: NAACL 2018. 11+5 pages. arXiv admin note: text overlap with arXiv:1708.07918

  16. arXiv:1708.07918  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Robust Task Clustering for Deep Many-Task Learning

    Authors: Mo Yu, Xiaoxiao Guo, Jinfeng Yi, Shiyu Chang, Saloni Potdar, Gerald Tesauro, Haoyu Wang, Bowen Zhou

    Abstract: We investigate task clustering for deep-learning based multi-task and few-shot learning in a many-task setting. We propose a new method to measure task similarities with cross-task transfer performance matrix for the deep learning scenario. Although this matrix provides us critical information regarding similarity between tasks, its asymmetric property and unreliable performance scores can affect… ▽ More

    Submitted 17 May, 2018; v1 submitted 25 August, 2017; originally announced August 2017.

  17. arXiv:1701.04027  [pdf, other

    cs.CL

    Neural Models for Sequence Chunking

    Authors: Feifei Zhai, Saloni Potdar, Bing Xiang, Bowen Zhou

    Abstract: Many natural language understanding (NLU) tasks, such as shallow parsing (i.e., text chunking) and semantic slot filling, require the assignment of representative labels to the meaningful chunks in a sentence. Most of the current deep neural network (DNN) based methods consider these tasks as a sequence labeling problem, in which a word, rather than a chunk, is treated as the basic unit for labeli… ▽ More

    Submitted 15 January, 2017; originally announced January 2017.

    Comments: Accepted by AAAI 2017