Zum Hauptinhalt springen

Showing 1–24 of 24 results for author: Mazumder, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.03907  [pdf, other

    cs.CL cs.AI

    Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models

    Authors: Shachi H Kumar, Saurav Sahay, Sahisnu Mazumder, Eda Okur, Ramesh Manuvinakurike, Nicole Beckage, Hsuan Su, Hung-yi Lee, Lama Nachman

    Abstract: Large Language Models (LLMs) have excelled at language understanding and generating human-level text. However, even with supervised training and human alignment, these LLMs are susceptible to adversarial attacks where malicious users can prompt the model to generate undesirable text. LLMs also inherently encode potential biases that can cause various harmful effects during interactions. Bias evalu… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 6 pages paper content, 17 pages of appendix

  2. arXiv:2405.19414  [pdf, other

    cs.LG

    Safety through Permissibility: Shield Construction for Fast and Safe Reinforcement Learning

    Authors: Alexander Politowicz, Sahisnu Mazumder, Bing Liu

    Abstract: Designing Reinforcement Learning (RL) solutions for real-life problems remains a significant challenge. A major area of concern is safety. "Shielding" is a popular technique to enforce safety in RL by turning user-defined safety specifications into safe agent behavior. However, these methods either suffer from extreme learning delays, demand extensive human effort in designing models and safe doma… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 9 pages, 3 figures

  3. arXiv:2404.19195  [pdf

    eess.SY cs.AR physics.app-ph

    Evaluation of Thermal Performance of a Wick-free Vapor Chamber in Power Electronics Cooling

    Authors: Arani Mukhopadhyay, Anish Pal, Congbo Bao, Mohamad Jafari Gukeh, Sudip K. Mazumder, Constantine M. Megaridis

    Abstract: Efficient thermal management in high-power electronics cooling can be achieved using phase-change heat transfer devices, such as vapor chambers. Traditional vapor chambers use wicks to transport condensate for efficient thermal exchange and to prevent "dry-out" of the evaporator. However, wicks in vapor chambers present significant design challenges arising out of large pressure drops across the w… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Presented at IEEE ITherm (Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems) 2023, Orlando FL. Corresponding author: [email protected]

  4. arXiv:2305.19400  [pdf, other

    cs.CE

    Automating GPU Scalability for Complex Scientific Models: Phonon Boltzman Transport Equation

    Authors: Eric Heisler, Siddharth Saurav, Aadesh Deshmukh, Sandip Mazumder, Ponnuswamy Sadayappan, Hari Sundar

    Abstract: Heterogeneous computing environments combining CPU and GPU resources provide a great boost to large-scale scientific computing applications. Code generation utilities that partition the work into CPU and GPU tasks while considering data movement costs allow researchers to more quickly and easily develop high-performance solutions, and make these resources accessible to a larger user base. We pre… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  5. arXiv:2303.04361  [pdf, other

    cs.CL cs.CV

    Sample Efficient Multimodal Semantic Augmentation for Incremental Summarization

    Authors: Sumanta Bhattacharyya, Ramesh Manuvinakurike, Sahisnu Mazumder, Saurav Sahay

    Abstract: In this work, we develop a prompting approach for incremental summarization of task videos. We develop a sample-efficient few-shot approach for extracting semantic concepts as an intermediate step. We leverage an existing model for extracting the concepts from the images and extend it to videos and introduce a clustering and querying approach for sample efficiency, motivated by the recent advances… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  6. arXiv:2302.05888  [pdf, other

    cs.CL cs.AI cs.LG

    Position Matters! Empirical Study of Order Effect in Knowledge-grounded Dialogue

    Authors: Hsuan Su, Shachi H Kumar, Sahisnu Mazumder, Wenda Chen, Ramesh Manuvinakurike, Eda Okur, Saurav Sahay, Lama Nachman, Shang-Tse Chen, Hung-yi Lee

    Abstract: With the power of large pretrained language models, various research works have integrated knowledge into dialogue systems. The traditional techniques treat knowledge as part of the input sequence for the dialogue system, prepending a set of knowledge statements in front of dialogue history. However, such a mechanism forces knowledge sets to be concatenated in an ordered manner, making models impl… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  7. arXiv:2211.06553  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Lifelong and Continual Learning Dialogue Systems

    Authors: Sahisnu Mazumder, Bing Liu

    Abstract: Dialogue systems, commonly known as chatbots, have gained escalating popularity in recent times due to their wide-spread applications in carrying out chit-chat conversations with users and task-oriented dialogues to accomplish various user tasks. Existing chatbots are usually trained from pre-collected and manually-labeled data and/or written with handcrafted rules. Many also use manually-compiled… ▽ More

    Submitted 16 June, 2024; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: Book - draft version. arXiv admin note: text overlap with arXiv:2203.08994 ... Full version of this book has been published by Springer Nature and is now available on Web

    Journal ref: Springer Nature 2024

  8. arXiv:2210.17440  [pdf, other

    cs.CL

    Semantic Novelty Detection and Characterization in Factual Text Involving Named Entities

    Authors: Nianzu Ma, Sahisnu Mazumder, Alexander Politowicz, Bing Liu, Eric Robertson, Scott Grigsby

    Abstract: Much of the existing work on text novelty detection has been studied at the topic level, i.e., identifying whether the topic of a document or a sentence is novel or not. Little work has been done at the fine-grained semantic level (or contextual level). For example, given that we know Elon Musk is the CEO of a technology company, the sentence "Elon Musk acted in the sitcom The Big Bang Theory" is… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: 28 pages, 2 figures

    ACM Class: I.2.7

  9. arXiv:2210.15670  [pdf, other

    cs.LG cs.AI

    Knowledge-Guided Exploration in Deep Reinforcement Learning

    Authors: Sahisnu Mazumder, Bing Liu, Shuai Wang, Yingxuan Zhu, Xiaotian Yin, Lifeng Liu, Jian Li

    Abstract: This paper proposes a new method to drastically speed up deep reinforcement learning (deep RL) training for problems that have the property of state-action permissibility (SAP). Two types of permissibility are defined under SAP. The first type says that after an action $a_t$ is performed in a state $s_t$ and the agent has reached the new state $s_{t+1}$, the agent can decide whether $a_t$ is permi… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: This paper is an extended and revised version of the work: "Action permissibility in deep reinforcement learning and application to autonomous driving", KDD'18 Deep Learning Day (2018)

  10. arXiv:2203.08994  [pdf

    cs.AI cs.CL cs.HC cs.LG

    AI Autonomy : Self-Initiated Open-World Continual Learning and Adaptation

    Authors: Bing Liu, Sahisnu Mazumder, Eric Robertson, Scott Grigsby

    Abstract: As more and more AI agents are used in practice, it is time to think about how to make these agents fully autonomous so that they can (1) learn by themselves continually in a self-motivated and self-initiated manner rather than being retrained offline periodically on the initiation of human engineers and (2) accommodate or adapt to unexpected or novel circumstances. As the real-world is an open en… ▽ More

    Submitted 19 April, 2023; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: To appear in AI Magazine (AAAI), 2023. This draft is an extended and revised version of the previous work - "Self-initiated Open World Learning for Autonomous AI Agents" arXiv preprint arXiv:2110.11385 (2021), which was published in AAAI 2022 Spring Symposium Series

  11. arXiv:2110.11385  [pdf, ps, other

    cs.AI cs.HC cs.LG

    Self-Initiated Open World Learning for Autonomous AI Agents

    Authors: Bing Liu, Eric Robertson, Scott Grigsby, Sahisnu Mazumder

    Abstract: As more and more AI agents are used in practice, it is time to think about how to make these agents fully autonomous so that they can learn by themselves in a self-motivated and self-supervised manner rather than being retrained periodically on the initiation of human engineers using expanded training data. As the real-world is an open environment with unknowns or novelties, detecting novelties or… ▽ More

    Submitted 28 February, 2024; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: Published in AAAI 2022 Spring Symposium Series

  12. arXiv:2106.13959  [pdf, other

    stat.ML cs.CV cs.LG

    Functional Classwise Principal Component Analysis: A Novel Classification Framework

    Authors: Avishek Chatterjee, Satyaki Mazumder, Koel Das

    Abstract: In recent times, functional data analysis (FDA) has been successfully applied in the field of high dimensional data classification. In this paper, we present a novel classification framework using functional data and classwise Principal Component Analysis (PCA). Our proposed method can be used in high dimensional time series data which typically suffers from small sample size problem. Our method e… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

  13. Deep Image Compositing

    Authors: Shivangi Aneja, Soham Mazumder

    Abstract: In image editing, the most common task is pasting objects from one image to the other and then eventually adjusting the manifestation of the foreground object with the background object. This task is called image compositing. But image compositing is a challenging problem that requires professional editing skills and a considerable amount of time. Not only these professionals are expensive to hire… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: ESSE 2020: Proceedings of the 2020 European Symposium on Software Engineering

    Journal ref: In Proceedings of the 2020 European Symposium on Software Engineering (pp. 101-104) 2020

  14. arXiv:2010.12844  [pdf, other

    cs.CL cs.AI

    FLIN: A Flexible Natural Language Interface for Web Navigation

    Authors: Sahisnu Mazumder, Oriana Riva

    Abstract: AI assistants can now carry out tasks for users by directly interacting with website UIs. Current semantic parsing and slot-filling techniques cannot flexibly adapt to many different websites without being constantly re-trained. We propose FLIN, a natural language interface for web navigation that maps user commands to concept-level actions (rather than low-level UI actions), thus being able to fl… ▽ More

    Submitted 13 April, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: Accepted to NAACL-HLT 2021

  15. arXiv:2010.05357  [pdf, other

    cs.CL cs.AI

    A Knowledge-Driven Approach to Classifying Object and Attribute Coreferences in Opinion Mining

    Authors: Jiahua Chen, Shuai Wang, Sahisnu Mazumder, Bing Liu

    Abstract: Classifying and resolving coreferences of objects (e.g., product names) and attributes (e.g., product aspects) in opinionated reviews is crucial for improving the opinion mining performance. However, the task is challenging as one often needs to consider domain-specific knowledge (e.g., iPad is a tablet and has aspect resolution) to identify coreferences in opinionated reviews. Also, compiling a h… ▽ More

    Submitted 17 July, 2021; v1 submitted 11 October, 2020; originally announced October 2020.

    Comments: Accepted to Proceedings of EMNLP 2020 (Findings)

  16. arXiv:2009.10750  [pdf, ps, other

    cs.CL cs.AI cs.HC

    Lifelong Learning Dialogue Systems: Chatbots that Self-Learn On the Job

    Authors: Bing Liu, Sahisnu Mazumder

    Abstract: Dialogue systems, also called chatbots, are now used in a wide range of applications. However, they still have some major weaknesses. One key weakness is that they are typically trained from manually-labeled data and/or written with handcrafted rules, and their knowledge bases (KBs) are also compiled by human experts. Due to the huge amount of manual effort involved, they are difficult to scale an… ▽ More

    Submitted 23 February, 2021; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: A revised version of this work has been published in AAAI-2021 with title: "Lifelong and Continual Learning Dialogue Systems: Learning during Conversation". Please use this revised AAAI-21 version for citation

  17. arXiv:2004.14357  [pdf, ps, other

    cs.CL

    Detecting Domain Polarity-Changes of Words in a Sentiment Lexicon

    Authors: Shuai Wang, Guangyi Lv, Sahisnu Mazumder, Bing Liu

    Abstract: Sentiment lexicons are instrumental for sentiment analysis. One can use a set of sentiment words provided in a sentiment lexicon and a lexicon-based classifier to perform sentiment classification. One major issue with this approach is that many sentiment words are domain dependent. That is, they may be positive in some domains but negative in some others. We refer to this problem as domain polarit… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

  18. arXiv:1912.12652  [pdf

    cs.HC

    An assistive HCI system based on block scanning objects using eye blinks

    Authors: Supriya Sarker, Md. Shahraduan Mazumder, Md. Sajedur Rahman, Md. Anayt Rabbi

    Abstract: Human-Computer Interaction (HCI) provides a new communication channel between human and the computer. We develop an assistive system based on block scanning techniques using eye blinks that presents a hands-free interface between human and computer for people with motor impairments. The developed system has been tested by 12 users who performed 10 common in computer tasks using eye blinks with sca… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.

    Comments: 6 pages

  19. arXiv:1910.14084  [pdf, other

    cs.CL cs.HC cs.IR

    Building an Application Independent Natural Language Interface

    Authors: Sahisnu Mazumder, Bing Liu, Shuai Wang, Sepideh Esmaeilpour

    Abstract: Traditional approaches to building natural language (NL) interfaces typically use a semantic parser to parse the user command and convert it to a logical form, which is then translated to an executable action in an application. However, it is still challenging for a semantic parser to correctly parse natural language. For a different domain, the parser may need to be retrained or tuned, and a new… ▽ More

    Submitted 25 November, 2021; v1 submitted 30 October, 2019; originally announced October 2019.

  20. arXiv:1907.13295  [pdf, ps, other

    cs.CL cs.AI cs.HC

    Lifelong and Interactive Learning of Factual Knowledge in Dialogues

    Authors: Sahisnu Mazumder, Bing Liu, Shuai Wang, Nianzu Ma

    Abstract: Dialogue systems are increasingly using knowledge bases (KBs) storing real-world facts to help generate quality responses. However, as the KBs are inherently incomplete and remain fixed during conversation, it limits dialogue systems' ability to answer questions and to handle questions involving entities or relations that are not in the KB. In this paper, we make an attempt to propose an engine fo… ▽ More

    Submitted 20 December, 2019; v1 submitted 30 July, 2019; originally announced July 2019.

    Comments: Published in SIGDIAL 2019

  21. arXiv:1802.06024  [pdf, ps, other

    cs.CL cs.AI cs.HC

    Towards a Continuous Knowledge Learning Engine for Chatbots

    Authors: Sahisnu Mazumder, Nianzu Ma, Bing Liu

    Abstract: Although chatbots have been very popular in recent years, they still have some serious weaknesses which limit the scope of their applications. One major weakness is that they cannot learn new knowledge during the conversation process, i.e., their knowledge is fixed beforehand and cannot be expanded or updated during conversation. In this paper, we propose to build a general knowledge learning engi… ▽ More

    Submitted 24 February, 2018; v1 submitted 16 February, 2018; originally announced February 2018.

  22. arXiv:1802.05818  [pdf, other

    cs.CL cs.AI

    Disentangling Aspect and Opinion Words in Target-based Sentiment Analysis using Lifelong Learning

    Authors: Shuai Wang, Mianwei Zhou, Sahisnu Mazumder, Bing Liu, Yi Chang

    Abstract: Given a target name, which can be a product aspect or entity, identifying its aspect words and opinion words in a given corpus is a fine-grained task in target-based sentiment analysis (TSA). This task is challenging, especially when we have no labeled data and we want to perform it for any given domain. To address it, we propose a general two-stage approach. Stage one extracts/groups the target-r… ▽ More

    Submitted 15 February, 2018; originally announced February 2018.

  23. Context-aware Path Ranking for Knowledge Base Completion

    Authors: Sahisnu Mazumder, Bing Liu

    Abstract: Knowledge base (KB) completion aims to infer missing facts from existing ones in a KB. Among various approaches, path ranking (PR) algorithms have received increasing attention in recent years. PR algorithms enumerate paths between entity pairs in a KB and use those paths as features to train a model for missing fact prediction. Due to their good performances and high model interpretability, sever… ▽ More

    Submitted 20 December, 2017; originally announced December 2017.

    Journal ref: Published in IJCAI 2017

  24. arXiv:1606.03480  [pdf

    cs.SI

    LANet: An Enriched Knowledgebase for Location-aware Activity Recommendation System

    Authors: Sahisnu Mazumder

    Abstract: Accumulation of large amount of location-specific reviews on web due to escalating popularity of Location-based Social Networking platforms like Yelp, Foursquare, Brightkite etc. in recent years, has created the opportunity to discover location-specific activities and develop myriads of location-aware activity recommendation applications. The performance and popularity of such recommendation appli… ▽ More

    Submitted 10 June, 2016; originally announced June 2016.

    Comments: Part of the work presented in this dissertation has been published as- "Sahisnu Mazumder, Dhaval Patel and Sameep Mehta. ActMiner: Discovering Location-Specific Activities from Community-Authored Reviews, In 16th International Conference on Data Warehousing and Knowledge Discovery (DaWaK 2014}, Springer International Publishing