Skip to main content

Showing 1–50 of 293 results for author: Mukherjee, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06558  [pdf

    cs.SI

    Sampling-Based Attack for Centrality Disruption in Complex Networks

    Authors: Fariba Afrin Irany, Soumya Sarakar, Animesh Mukherjee, Sanjukta Bhowmick

    Abstract: Many mobile networks are represented as graphs to obtain insight to their connectivity and transmission properties. Among these properties centrality resilience, that is, how well centralities, such as closeness and betweennesss, are maintained under attacks is a critical factor for proper functioning of a network. In this paper, we study the centrality resilience of complex networks by developing… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 10 pages, 8 figures, 3 tables

  2. arXiv:2407.06137  [pdf, ps, other

    cs.CV

    OMuSense-23: A Multimodal Dataset for Contactless Breathing Pattern Recognition and Biometric Analysis

    Authors: Manuel Lage Cañellas, Le Nguyen, Anirban Mukherjee, Constantino Álvarez Casado, Xiaoting Wu, Praneeth Susarla, Sasan Sharifipour, Dinesh B. Jayagopi, Miguel Bordallo López

    Abstract: In the domain of non-contact biometrics and human activity recognition, the lack of a versatile, multimodal dataset poses a significant bottleneck. To address this, we introduce the Oulu Multi Sensing (OMuSense-23) dataset that includes biosignals obtained from a mmWave radar, and an RGB-D camera. The dataset features data from 50 individuals in three distinct poses -- standing, sitting, and lying… ▽ More

    Submitted 22 May, 2024; originally announced July 2024.

  3. arXiv:2407.02067  [pdf, other

    cs.CL

    Crossroads of Continents: Automated Artifact Extraction for Cultural Adaptation with Large Multimodal Models

    Authors: Anjishnu Mukherjee, Ziwei Zhu, Antonios Anastasopoulos

    Abstract: In this work, we present a comprehensive three-phase study to examine (1) the effectiveness of large multimodal models (LMMs) in recognizing cultural contexts; (2) the accuracy of their representations of diverse cultures; and (3) their ability to adapt content across cultural boundaries. We first introduce Dalle Street, a large-scale dataset generated by DALL-E 3 and validated by humans, containi… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: under review

  4. arXiv:2407.02066  [pdf, other

    cs.CL

    BiasDora: Exploring Hidden Biased Associations in Vision-Language Models

    Authors: Chahat Raj, Anjishnu Mukherjee, Aylin Caliskan, Antonios Anastasopoulos, Ziwei Zhu

    Abstract: Existing works examining Vision Language Models (VLMs) for social biases predominantly focus on a limited set of documented bias associations, such as gender:profession or race:crime. This narrow scope often overlooks a vast range of unexamined implicit associations, restricting the identification and, hence, mitigation of such biases. We address this gap by probing VLMs to (1) uncover hidden, imp… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Under Review

  5. arXiv:2407.02030  [pdf, other

    cs.CL

    Breaking Bias, Building Bridges: Evaluation and Mitigation of Social Biases in LLMs via Contact Hypothesis

    Authors: Chahat Raj, Anjishnu Mukherjee, Aylin Caliskan, Antonios Anastasopoulos, Ziwei Zhu

    Abstract: Large Language Models (LLMs) perpetuate social biases, reflecting prejudices in their training data and reinforcing societal stereotypes and inequalities. Our work explores the potential of the Contact Hypothesis, a concept from social psychology for debiasing LLMs. We simulate various forms of social contact through LLM prompting to measure their influence on the model's biases, mirroring how int… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Under Review

  6. arXiv:2407.01732  [pdf, other

    cs.CY cs.HC cs.IR

    Investigating Nudges toward Related Sellers on E-commerce Marketplaces: A Case Study on Amazon

    Authors: Abhisek Dash, Abhijnan Chakraborty, Saptarshi Ghosh, Animesh Mukherjee, Krishna P. Gummadi

    Abstract: E-commerce marketplaces provide business opportunities to millions of sellers worldwide. Some of these sellers have special relationships with the marketplace by virtue of using their subsidiary services (e.g., fulfillment and/or shipping services provided by the marketplace) -- we refer to such sellers collectively as Related Sellers. When multiple sellers offer to sell the same product, the mark… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: This work has been accepted for presentation at the ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW) 2024. It will appear in Proceedings of the ACM on Human-Computer Interaction

  7. arXiv:2407.00229  [pdf, other

    cs.CV cs.AI

    SemUV: Deep Learning based semantic manipulation over UV texture map of virtual human heads

    Authors: Anirban Mukherjee, Venkat Suprabath Bitra, Vignesh Bondugula, Tarun Reddy Tallapureddy, Dinesh Babu Jayagopi

    Abstract: Designing and manipulating virtual human heads is essential across various applications, including AR, VR, gaming, human-computer interaction and VFX. Traditional graphic-based approaches require manual effort and resources to achieve accurate representation of human heads. While modern deep learning techniques can generate and edit highly photorealistic images of faces, their focus remains predom… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: CVIP 2024 Preprint

  8. arXiv:2406.19543  [pdf, other

    cs.CL cs.SI

    Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management

    Authors: Seid Muhie Yimam, Daryna Dementieva, Tim Fischer, Daniil Moskovskiy, Naquee Rizwan, Punyajoy Saha, Sarthak Roy, Martin Semmann, Alexander Panchenko, Chris Biemann, Animesh Mukherjee

    Abstract: Despite regulations imposed by nations and social media platforms, such as recent EU regulations targeting digital violence, abusive content persists as a significant challenge. Existing approaches primarily rely on binary solutions, such as outright blocking or banning, yet fail to address the complex nature of abusive speech. In this work, we propose a more comprehensive approach called Demarcat… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  9. arXiv:2406.14012  [pdf, other

    cs.CL cs.AI

    Seeing Through AI's Lens: Enhancing Human Skepticism Towards LLM-Generated Fake News

    Authors: Navid Ayoobi, Sadat Shahriar, Arjun Mukherjee

    Abstract: LLMs offer valuable capabilities, yet they can be utilized by malicious users to disseminate deceptive information and generate fake news. The growing prevalence of LLMs poses difficulties in crafting detection approaches that remain effective across various text domains. Additionally, the absence of precautionary measures for AI-generated news on online social platforms is concerning. Therefore,… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  10. arXiv:2406.12274  [pdf, other

    cs.CL

    SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models

    Authors: Somnath Banerjee, Soham Tripathy, Sayan Layek, Shanu Kumar, Animesh Mukherjee, Rima Hazra

    Abstract: Safety-aligned language models often exhibit fragile and imbalanced safety mechanisms, increasing the likelihood of generating unsafe content. In addition, incorporating new knowledge through editing techniques to language models can further compromise safety. To address these issues, we propose SafeInfer, a context-adaptive, decoding-time safety alignment strategy for generating safe responses to… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Under review

  11. arXiv:2406.11139  [pdf, other

    cs.CL

    Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance

    Authors: Somnath Banerjee, Avik Halder, Rajarshi Mandal, Sayan Layek, Ian Soboroff, Rima Hazra, Animesh Mukherjee

    Abstract: The integration of pretrained language models (PLMs) like BERT and GPT has revolutionized NLP, particularly for English, but it has also created linguistic imbalances. This paper strategically identifies the need for linguistic equity by examining several knowledge editing techniques in multilingual contexts. We evaluate the performance of models such as Mistral, TowerInstruct, OpenHathi, Tamil-Ll… ▽ More

    Submitted 17 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: Under review

  12. arXiv:2405.13559  [pdf

    cs.CE

    Identification of microstructure from macroscopic measurement using inverse multiscale analysis

    Authors: Anjan Mukherjee, Biswanth Banerjee

    Abstract: Most of the tailored materials are heterogeneous at the ingredient level. Analysis of those heterogeneous structures requires the knowledge of microstructure. With the knowledge of microstructure, multiscale analysis is carried out with homogenization at the micro level. Second-order homogenization is carried out whenever the ingredient size is comparable to the structure size. Therefore, knowledg… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Structural Engineering Convention SEC 2023

  13. arXiv:2405.13384  [pdf, other

    cs.CE

    Elastic-gap free strain gradient crystal plasticity model that effectively account for plastic slip gradient and grain boundary dissipation

    Authors: Anjan Mukherjee, Biswanath Banerjee

    Abstract: This paper proposes an elastic-gap free strain gradient crystal plasticity model that addresses dissipation caused by plastic slip gradient and grain boundary (GB) Burger tensor. The model involves splitting plastic slip gradient and GB Burger tensor into energetic dissipative quantities. Unlike conventional models, the bulk and GB defect energy are considered to be a quadratic functional of the e… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Submitted in Journal of the Mechanics and Physics of Solids

  14. arXiv:2405.07795  [pdf, other

    stat.ML cs.LG

    Improved Bound for Robust Causal Bandits with Linear Models

    Authors: Zirui Yan, Arpan Mukherjee, Burak Varıcı, Ali Tajer

    Abstract: This paper investigates the robustness of causal bandits (CBs) in the face of temporal model fluctuations. This setting deviates from the existing literature's widely-adopted assumption of constant causal models. The focus is on causal systems with linear structural equation models (SEMs). The SEMs and the time-varying pre- and post-interventional statistical models are all unknown and subject to… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:2310.19794

  15. arXiv:2405.03963  [pdf, other

    cs.AI cs.LG

    ERATTA: Extreme RAG for Table To Answers with Large Language Models

    Authors: Sohini Roychowdhury, Marko Krema, Anvar Mahammad, Brian Moore, Arijit Mukherjee, Punit Prakashchandra

    Abstract: Large language models (LLMs) with retrieval augmented-generation (RAG) have been the optimal choice for scalable generative AI solutions in the recent past. However, the choice of use-cases that incorporate RAG with LLMs have been either generic or extremely domain specific, thereby questioning the scalability and generalizability of RAG-LLM approaches. In this work, we propose a unique LLM-based… ▽ More

    Submitted 14 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: 5 pages, 3 tables, Asilomar SSC Conference, 2024

  16. arXiv:2404.08624  [pdf, other

    cs.LG math.OC

    Regularized Gradient Clipping Provably Trains Wide and Deep Neural Networks

    Authors: Matteo Tucat, Anirbit Mukherjee

    Abstract: In this work, we instantiate a regularized form of the gradient clipping algorithm and prove that it can converge to the global minima of deep neural network loss functions provided that the net is of sufficient width. We present empirical evidence that our theoretically founded regularized gradient clipping algorithm is also competitive with the state-of-the-art deep-learning heuristics. Hence th… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 16 pages, 4 figures

  17. arXiv:2404.04979  [pdf, other

    econ.EM cs.LG

    CAVIAR: Categorical-Variable Embeddings for Accurate and Robust Inference

    Authors: Anirban Mukherjee, Hannah Hanwen Chang

    Abstract: Social science research often hinges on the relationship between categorical variables and outcomes. We introduce CAVIAR, a novel method for embedding categorical variables that assume values in a high-dimensional ambient space but are sampled from an underlying manifold. Our theoretical and numerical analyses outline challenges posed by such categorical variables in causal inference. Specifically… ▽ More

    Submitted 11 April, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

  18. arXiv:2404.04436  [pdf, other

    cs.AI

    AI Knowledge and Reasoning: Emulating Expert Creativity in Scientific Research

    Authors: Anirban Mukherjee, Hannah Hanwen Chang

    Abstract: We investigate whether modern AI can emulate expert creativity in complex scientific endeavors. We introduce novel methodology that utilizes original research articles published after the AI's training cutoff, ensuring no prior exposure, mitigating concerns of rote memorization and prior training. The AI are tasked with redacting findings, predicting outcomes from redacted research, and assessing… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  19. arXiv:2404.00185  [pdf, other

    cs.CV cs.AI

    On Inherent Adversarial Robustness of Active Vision Systems

    Authors: Amitangshu Mukherjee, Timur Ibrayev, Kaushik Roy

    Abstract: Current Deep Neural Networks are vulnerable to adversarial examples, which alter their predictions by adding carefully crafted noise. Since human eyes are robust to such inputs, it is possible that the vulnerability stems from the standard way of processing inputs in one shot by processing every pixel with the same importance. In contrast, neuroscience suggests that the human vision system can dif… ▽ More

    Submitted 5 April, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

  20. arXiv:2404.00017  [pdf, other

    cs.AI cs.CL cs.HC

    Psittacines of Innovation? Assessing the True Novelty of AI Creations

    Authors: Anirban Mukherjee

    Abstract: We examine whether Artificial Intelligence (AI) systems generate truly novel ideas rather than merely regurgitating patterns learned during training. Utilizing a novel experimental design, we task an AI with generating project titles for hypothetical crowdfunding campaigns. We compare within AI-generated project titles, measuring repetition and complexity. We compare between the AI-generated title… ▽ More

    Submitted 17 March, 2024; originally announced April 2024.

  21. arXiv:2403.18623  [pdf, other

    cs.CY cs.HC cs.IR

    Antitrust, Amazon, and Algorithmic Auditing

    Authors: Abhisek Dash, Abhijnan Chakraborty, Saptarshi Ghosh, Animesh Mukherjee, Jens Frankenreiter, Stefan Bechtold, Krishna P. Gummadi

    Abstract: In digital markets, antitrust law and special regulations aim to ensure that markets remain competitive despite the dominating role that digital platforms play today in everyone's life. Unlike traditional markets, market participant behavior is easily observable in these markets. We present a series of empirical investigations into the extent to which Amazon engages in practices that are typically… ▽ More

    Submitted 25 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: The paper has been accepted to appear at Journal of Institutional and Theoretical Economics (JITE) 2024

  22. arXiv:2403.17692  [pdf, other

    cs.CV cs.LG math.DG math.OC stat.CO

    Manifold-Guided Lyapunov Control with Diffusion Models

    Authors: Amartya Mukherjee, Thanin Quartz, Jun Liu

    Abstract: This paper presents a novel approach to generating stabilizing controllers for a large class of dynamical systems using diffusion models. The core objective is to develop stabilizing control functions by identifying the closest asymptotically stable vector field relative to a predetermined manifold and adjusting the control function based on this finding. To achieve this, we employ a diffusion mod… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 14 pages

  23. Towards Two-Stream Foveation-based Active Vision Learning

    Authors: Timur Ibrayev, Amitangshu Mukherjee, Sai Aparna Aketi, Kaushik Roy

    Abstract: Deep neural network (DNN) based machine perception frameworks process the entire input in a one-shot manner to provide answers to both "what object is being observed" and "where it is located". In contrast, the "two-stream hypothesis" from neuroscience explains the neural processing in the human visual cortex as an active vision system that utilizes two separate regions of the brain to answer the… ▽ More

    Submitted 20 April, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted version of the article, 18 pages, 14 figures

    Journal ref: IEEE Transactions on Cognitive and Developmental Systems, 2024

  24. arXiv:2403.14938  [pdf, ps, other

    cs.CL

    On Zero-Shot Counterspeech Generation by LLMs

    Authors: Punyajoy Saha, Aalok Agrawal, Abhik Jana, Chris Biemann, Animesh Mukherjee

    Abstract: With the emergence of numerous Large Language Models (LLM), the usage of such models in various Natural Language Processing (NLP) applications is increasing extensively. Counterspeech generation is one such key task where efforts are made to develop generative models by fine-tuning LLMs with hatespeech - counterspeech pairs, but none of these attempts explores the intrinsic properties of large lan… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 12 pages, 7 tables, accepted at LREC-COLING 2024

  25. arXiv:2403.14706  [pdf, other

    cs.CY cs.AI

    Safeguarding Marketing Research: The Generation, Identification, and Mitigation of AI-Fabricated Disinformation

    Authors: Anirban Mukherjee

    Abstract: Generative AI has ushered in the ability to generate content that closely mimics human contributions, introducing an unprecedented threat: Deployed en masse, these models can be used to manipulate public opinion and distort perceptions, resulting in a decline in trust towards digital platforms. This study contributes to marketing literature and practice in three ways. First, it demonstrates the pr… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  26. arXiv:2403.10058  [pdf, other

    cs.CV

    RID-TWIN: An end-to-end pipeline for automatic face de-identification in videos

    Authors: Anirban Mukherjee, Monjoy Narayan Choudhury, Dinesh Babu Jayagopi

    Abstract: Face de-identification in videos is a challenging task in the domain of computer vision, primarily used in privacy-preserving applications. Despite the considerable progress achieved through generative vision models, there remain multiple challenges in the latest approaches. They lack a comprehensive discussion and evaluation of aspects such as realism, temporal coherence, and preservation of non-… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: This work has been submitted to IEEE ICIP 2024

  27. arXiv:2403.09404  [pdf, other

    cs.AI

    Heuristic Reasoning in AI: Instrumental Use and Mimetic Absorption

    Authors: Anirban Mukherjee, Hannah Hanwen Chang

    Abstract: Deviating from conventional perspectives that frame artificial intelligence (AI) systems solely as logic emulators, we propose a novel program of heuristic reasoning. We distinguish between the 'instrumental' use of heuristics to match resources with objectives, and 'mimetic absorption,' whereby heuristics manifest randomly and universally. Through a series of innovative experiments, including var… ▽ More

    Submitted 18 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  28. arXiv:2403.09289  [pdf, other

    cs.AI

    Silico-centric Theory of Mind

    Authors: Anirban Mukherjee, Hannah Hanwen Chang

    Abstract: Theory of Mind (ToM) refers to the ability to attribute mental states, such as beliefs, desires, intentions, and knowledge, to oneself and others, and to understand that these mental states can differ from one's own and from reality. We investigate ToM in environments with multiple, distinct, independent AI agents, each possessing unique internal states, information, and objectives. Inspired by hu… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  29. arXiv:2403.05434  [pdf

    cs.CL

    Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs

    Authors: Arijit Nag, Animesh Mukherjee, Niloy Ganguly, Soumen Chakrabarti

    Abstract: Large Language Models (LLMs) exhibit impressive zero/few-shot inference and generation quality for high-resource languages (HRLs). A few of them have been trained on low-resource languages (LRLs) and give decent performance. Owing to the prohibitive costs of training LLMs, they are usually used as a network service, with the client charged by the count of input and output tokens. The number of tok… ▽ More

    Submitted 18 April, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  30. Understanding how social discussion platforms like Reddit are influencing financial behavior

    Authors: Sachin Thukral, Suyash Sangwan, Arnab Chatterjee, Lipika Dey, Aaditya Agrawal, Pramit Kumar Chandra, Animesh Mukherjee

    Abstract: This study proposes content and interaction analysis techniques for a large repository created from social media content. Though we have presented our study for a large platform dedicated to discussions around financial topics, the proposed methods are generic and applicable to all platforms. Along with an extension of topic extraction method using Latent Dirichlet Allocation, we propose a few mea… ▽ More

    Submitted 12 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: 8 pages, 8 figures, 3 tables, and 1 algorithm; Published in WI-IAT 2022 (The 21st IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology)

    Journal ref: IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) 2022 (pp. 612-619)

  31. arXiv:2402.16159  [pdf, other

    cs.CL

    DistALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem

    Authors: Somnath Banerjee, Avik Dutta, Aaditya Agrawal, Rima Hazra, Animesh Mukherjee

    Abstract: With the AI revolution in place, the trend for building automated systems to support professionals in different domains such as the open source software systems, healthcare systems, banking systems, transportation systems and many others have become increasingly prominent. A crucial requirement in the automation of support tools for such systems is the early identification of named entities, which… ▽ More

    Submitted 20 June, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: Accepted at ECML-PKDD 2024 (Long Paper)

  32. arXiv:2402.15302  [pdf, other

    cs.CL cs.CR

    How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries

    Authors: Somnath Banerjee, Sayan Layek, Rima Hazra, Animesh Mukherjee

    Abstract: In this study, we tackle a growing concern around the safety and ethical use of large language models (LLMs). Despite their potential, these models can be tricked into producing harmful or unethical content through various sophisticated methods, including 'jailbreaking' techniques and targeted manipulation. Our work zeroes in on a specific issue: to what extent LLMs can be led astray by asking the… ▽ More

    Submitted 15 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Under review. {https://huggingface.co/datasets/SoftMINER-Group/TechHazardQA}

  33. arXiv:2402.14702  [pdf, other

    cs.CL

    InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks

    Authors: Somnath Banerjee, Maulindu Sarkar, Punyajoy Saha, Binny Mathew, Animesh Mukherjee

    Abstract: Recently, influence functions present an apparatus for achieving explainability for deep neural models by quantifying the perturbation of individual train instances that might impact a test prediction. Our objectives in this paper are twofold. First we incorporate influence functions as a feedback into the model to improve its performance. Second, in a dataset extension exercise, using influence f… ▽ More

    Submitted 9 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted at LREC-COLING 2024 (Long Paper)

  34. arXiv:2402.13771  [pdf, other

    cs.CV cs.AI cs.CY cs.HC

    Mask-up: Investigating Biases in Face Re-identification for Masked Faces

    Authors: Siddharth D Jaiswal, Ankit Kr. Verma, Animesh Mukherjee

    Abstract: AI based Face Recognition Systems (FRSs) are now widely distributed and deployed as MLaaS solutions all over the world, moreso since the COVID-19 pandemic for tasks ranging from validating individuals' faces while buying SIM cards to surveillance of citizens. Extensive biases have been reported against marginalized groups in these systems and have led to highly discriminatory outcomes. The post-pa… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  35. arXiv:2402.12881  [pdf, other

    cs.CL

    GRAFFORD: A Benchmark Dataset for Testing the Knowledge of Object Affordances of Language and Vision Models

    Authors: Sayantan Adak, Daivik Agrawal, Animesh Mukherjee, Somak Aditya

    Abstract: We investigate the knowledge of object affordances in pre-trained language models (LMs) and pre-trained Vision-Language models (VLMs). Transformers-based large pre-trained language models (PTLM) learn contextual representation from massive amounts of unlabeled text and are shown to perform impressively in downstream NLU tasks. In parallel, a growing body of literature shows that PTLMs fail inconsi… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  36. arXiv:2402.12198  [pdf, other

    cs.CL cs.CV cs.LG

    Zero shot VLMs for hate meme detection: Are we there yet?

    Authors: Naquee Rizwan, Paramananda Bhaskar, Mithun Das, Swadhin Satyaprakash Majhi, Punyajoy Saha, Animesh Mukherjee

    Abstract: Multimedia content on social media is rapidly evolving, with memes gaining prominence as a distinctive form. Unfortunately, some malicious users exploit memes to target individuals or vulnerable communities, making it imperative to identify and address such instances of hateful memes. Extensive research has been conducted to address this issue by developing hate meme detection models. However, a n… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  37. arXiv:2402.08563  [pdf, other

    cs.LG cs.CV math.AP

    Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator

    Authors: Amartya Mukherjee, Melissa M. Stadt, Lena Podina, Mohammad Kohandel, Jun Liu

    Abstract: Diffusion models have emerged as a promising class of generative models that map noisy inputs to realistic images. More recently, they have been employed to generate solutions to partial differential equations (PDEs). However, they still struggle with inverse problems in the Laplacian operator, for instance, the Poisson equation, because the eigenvalues that are large in magnitude amplify the meas… ▽ More

    Submitted 14 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 29 pages

  38. arXiv:2402.07262  [pdf, other

    cs.CL cs.HC

    Low-Resource Counterspeech Generation for Indic Languages: The Case of Bengali and Hindi

    Authors: Mithun Das, Saurabh Kumar Pandey, Shivansh Sethi, Punyajoy Saha, Animesh Mukherjee

    Abstract: With the rise of online abuse, the NLP community has begun investigating the use of neural architectures to generate counterspeech that can "counter" the vicious tone of such abusive speech and dilute/ameliorate their rippling effect over the social network. However, most of the efforts so far have been primarily focused on English. To bridge the gap for low-resource languages such as Bengali and… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: Accepted to the Findings of the ACL: EACL 2024

  39. arXiv:2401.12671  [pdf, other

    cs.CL

    Context Matters: Pushing the Boundaries of Open-Ended Answer Generation with Graph-Structured Knowledge Context

    Authors: Somnath Banerjee, Amruit Sahoo, Sayan Layek, Avik Dutta, Rima Hazra, Animesh Mukherjee

    Abstract: In the continuously advancing AI landscape, crafting context-rich and meaningful responses via Large Language Models (LLMs) is essential. Researchers are becoming more aware of the challenges that LLMs with fewer parameters encounter when trying to provide suitable answers to open-ended questions. To address these hurdles, the integration of cutting-edge strategies, augmentation of rich external d… ▽ More

    Submitted 5 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  40. arXiv:2401.05834  [pdf, ps, other

    cs.DS

    Modeling Online Paging in Multi-Core Systems

    Authors: Mathieu Mari, Anish Mukherjee, Runtian Ren, Piotr Sankowski

    Abstract: Web requests are growing exponentially since the 90s due to the rapid development of the Internet. This process was further accelerated by the introduction of cloud services. It has been observed statistically that memory or web requests generally follow power-law distribution, Breslau et al. INFOCOM'99. That is, the $i^{\text{th}}$ most popular web page is requested with a probability proportiona… ▽ More

    Submitted 12 January, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  41. arXiv:2401.05538  [pdf, other

    cs.LG

    Multi-objective Feature Selection in Remote Health Monitoring Applications

    Authors: Le Ngu Nguyen, Constantino Álvarez Casado, Manuel Lage Cañellas, Anirban Mukherjee, Nhi Nguyen, Dinesh Babu Jayagopi, Miguel Bordallo López

    Abstract: Radio frequency (RF) signals have facilitated the development of non-contact human monitoring tasks, such as vital signs measurement, activity recognition, and user identification. In some specific scenarios, an RF signal analysis framework may prioritize the performance of one task over that of others. In response to this requirement, we employ a multi-objective optimization approach inspired by… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: Under review

  42. arXiv:2401.02649  [pdf, other

    cs.CV

    Enhancing 3D-Air Signature by Pen Tip Tail Trajectory Awareness: Dataset and Featuring by Novel Spatio-temporal CNN

    Authors: Saurabh Atreya, Maheswar Bora, Aritra Mukherjee, Abhijit Das

    Abstract: This work proposes a novel process of using pen tip and tail 3D trajectory for air signature. To acquire the trajectories we developed a new pen tool and a stereo camera was used. We proposed SliT-CNN, a novel 2D spatial-temporal convolutional neural network (CNN) for better featuring of the air signature. In addition, we also collected an air signature dataset from $45$ signers. Skilled forgery s… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted and presented in IJCB 2023

  43. arXiv:2401.02646  [pdf, other

    cs.CV

    Recent Advancement in 3D Biometrics using Monocular Camera

    Authors: Aritra Mukherjee, Abhijit Das

    Abstract: Recent literature has witnessed significant interest towards 3D biometrics employing monocular vision for robust authentication methods. Motivated by this, in this work we seek to provide insight on recent development in the area of 3D biometrics employing monocular vision. We present the similarity and dissimilarity of 3D monocular biometrics and classical biometrics, listing the strengths and ch… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted and presented in IJCB 2023

  44. arXiv:2312.16256  [pdf, other

    cs.CV cs.AI

    DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision

    Authors: Lu Ling, Yichen Sheng, Zhi Tu, Wentian Zhao, Cheng Xin, Kun Wan, Lantao Yu, Qianyu Guo, Zixun Yu, Yawen Lu, Xuanmao Li, Xingpeng Sun, Rohan Ashok, Aniruddha Mukherjee, Hao Kang, Xiangrui Kong, Gang Hua, Tianyi Zhang, Bedrich Benes, Aniket Bera

    Abstract: We have witnessed significant progress in deep learning-based 3D vision, ranging from neural radiance field (NeRF) based 3D representation learning to applications in novel view synthesis (NVS). However, existing scene-level datasets for deep learning-based 3D vision, limited to either synthetic environments or a narrow selection of real-world scenes, are quite insufficient. This insufficiency not… ▽ More

    Submitted 29 December, 2023; v1 submitted 25 December, 2023; originally announced December 2023.

  45. arXiv:2312.07601  [pdf, other

    eess.SP cs.LG

    Non-contact Multimodal Indoor Human Monitoring Systems: A Survey

    Authors: Le Ngu Nguyen, Praneeth Susarla, Anirban Mukherjee, Manuel Lage Cañellas, Constantino Álvarez Casado, Xiaoting Wu, Olli~Silvén, Dinesh Babu Jayagopi, Miguel Bordallo López

    Abstract: Indoor human monitoring systems leverage a wide range of sensors, including cameras, radio devices, and inertial measurement units, to collect extensive data from users and the environment. These sensors contribute diverse data modalities, such as video feeds from cameras, received signal strength indicators and channel state information from WiFi devices, and three-axis acceleration data from ine… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 19 pages, 5 figures

  46. arXiv:2312.05686  [pdf, other

    cs.AI

    Privacy Preserving Multi-Agent Reinforcement Learning in Supply Chains

    Authors: Ananta Mukherjee, Peeyush Kumar, Boling Yang, Nishanth Chandran, Divya Gupta

    Abstract: This paper addresses privacy concerns in multi-agent reinforcement learning (MARL), specifically within the context of supply chains where individual strategic data must remain confidential. Organizations within the supply chain are modeled as agents, each seeking to optimize their own objectives while interacting with others. As each organization's strategy is contingent on neighboring strategies… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  47. arXiv:2312.01500  [pdf, other

    cs.CL

    Unsupervised Approach to Evaluate Sentence-Level Fluency: Do We Really Need Reference?

    Authors: Gopichand Kanumolu, Lokesh Madasu, Pavan Baswani, Ananya Mukherjee, Manish Shrivastava

    Abstract: Fluency is a crucial goal of all Natural Language Generation (NLG) systems. Widely used automatic evaluation metrics fall short in capturing the fluency of machine-generated text. Assessing the fluency of NLG systems poses a challenge since these models are not limited to simply reusing words from the input but may also generate abstractions. Existing reference-based fluency evaluations, such as w… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Accepted at IJCNLP-AACL SEALP Workshop

  48. arXiv:2311.07592  [pdf, other

    cs.CL cs.AI cs.IR

    Hallucination-minimized Data-to-answer Framework for Financial Decision-makers

    Authors: Sohini Roychowdhury, Andres Alvarez, Brian Moore, Marko Krema, Maria Paz Gelpi, Federico Martin Rodriguez, Angel Rodriguez, Jose Ramon Cabrejas, Pablo Martinez Serrano, Punit Agrawal, Arijit Mukherjee

    Abstract: Large Language Models (LLMs) have been applied to build several automation and personalized question-answering prototypes so far. However, scaling such prototypes to robust products with minimized hallucinations or fake responses still remains an open challenge, especially in niche data-table heavy domains such as financial decision making. In this work, we present a novel Langchain-based framewor… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 11 pages, 5 figures, 4 tables

  49. arXiv:2311.05870  [pdf

    cs.CV math.OC

    Automated Heterogeneous Low-Bit Quantization of Multi-Model Deep Learning Inference Pipeline

    Authors: Jayeeta Mondal, Swarnava Dey, Arijit Mukherjee

    Abstract: Multiple Deep Neural Networks (DNNs) integrated into single Deep Learning (DL) inference pipelines e.g. Multi-Task Learning (MTL) or Ensemble Learning (EL), etc., albeit very accurate, pose challenges for edge deployment. In these systems, models vary in their quantization tolerance and resource demands, requiring meticulous tuning for accuracy-latency balance. This paper introduces an automated h… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Journal ref: LBQNN@ICCV2023

  50. arXiv:2310.19794  [pdf, other

    stat.ML cs.LG

    Robust Causal Bandits for Linear Models

    Authors: Zirui Yan, Arpan Mukherjee, Burak Varıcı, Ali Tajer

    Abstract: Sequential design of experiments for optimizing a reward function in causal systems can be effectively modeled by the sequential design of interventions in causal bandits (CBs). In the existing literature on CBs, a critical assumption is that the causal models remain constant over time. However, this assumption does not necessarily hold in complex systems, which constantly undergo temporal model f… ▽ More

    Submitted 4 March, 2024; v1 submitted 30 October, 2023; originally announced October 2023.