Skip to main content

Showing 1–50 of 329 results for author: Banerjee, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04174  [pdf, other

    cs.NI eess.SP

    Gemini: Integrating Full-fledged Sensing upon Millimeter Wave Communications

    Authors: Yilong Li, Zhe Chen, Jun Luo, Suman Banerjee

    Abstract: Integrating millimeter wave (mmWave)technology in both communication and sensing is promising as it enables the reuse of existing spectrum and infrastructure without draining resources. Most existing systems piggyback sensing onto conventional communication modes without fully exploiting the potential of integrated sensing and communication (ISAC) in mmWave radios (not full-fledged). In this paper… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 12 pages

  2. arXiv:2407.00091  [pdf, other

    cs.IR cs.HC cs.LG

    Learning to Rank for Maps at Airbnb

    Authors: Malay Haldar, Hongwei Zhang, Kedar Bellare, Sherry Chen, Soumyadip Banerjee, Xiaotang Wang, Mustafa Abdool, Huiji Gao, Pavan Tapadia, Liwei He, Sanjeev Katariya

    Abstract: As a two-sided marketplace, Airbnb brings together hosts who own listings for rent with prospective guests from around the globe. Results from a guest's search for listings are displayed primarily through two interfaces: (1) as a list of rectangular cards that contain on them the listing image, price, rating, and other details, referred to as list-results (2) as oval pins on a map showing the list… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

  3. arXiv:2406.12274  [pdf, other

    cs.CL

    SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models

    Authors: Somnath Banerjee, Soham Tripathy, Sayan Layek, Shanu Kumar, Animesh Mukherjee, Rima Hazra

    Abstract: Safety-aligned language models often exhibit fragile and imbalanced safety mechanisms, increasing the likelihood of generating unsafe content. In addition, incorporating new knowledge through editing techniques to language models can further compromise safety. To address these issues, we propose SafeInfer, a context-adaptive, decoding-time safety alignment strategy for generating safe responses to… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Under review

  4. arXiv:2406.11801  [pdf, other

    cs.CL

    Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations

    Authors: Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria

    Abstract: Ensuring the safe alignment of large language models (LLMs) with human values is critical as they become integral to applications like translation and question answering. Current alignment methods struggle with dynamic user intentions and complex objectives, making models vulnerable to generating harmful content. We propose Safety Arithmetic, a training-free framework enhancing LLM safety across d… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under Review. Codes are available at: https://github.com/declare-lab/safety-arithmetic

  5. arXiv:2406.11139  [pdf, other

    cs.CL

    Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance

    Authors: Somnath Banerjee, Avik Halder, Rajarshi Mandal, Sayan Layek, Ian Soboroff, Rima Hazra, Animesh Mukherjee

    Abstract: The integration of pretrained language models (PLMs) like BERT and GPT has revolutionized NLP, particularly for English, but it has also created linguistic imbalances. This paper strategically identifies the need for linguistic equity by examining several knowledge editing techniques in multilingual contexts. We evaluate the performance of models such as Mistral, TowerInstruct, OpenHathi, Tamil-Ll… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Under review

  6. arXiv:2406.10886  [pdf, other

    cs.CL cs.LG

    Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM

    Authors: Sri Raghava Muddu, Rupasai Rangaraju, Tejpalsingh Siledar, Swaroop Nath, Pushpak Bhattacharyya, Swaprava Nath, Suman Banerjee, Amey Patil, Muthusamy Chelliah, Sudhanshu Shekhar Singh, Nikesh Garera

    Abstract: Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limi… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  7. arXiv:2406.08307  [pdf, other

    stat.ML cs.LG

    Measuring model variability using robust non-parametric testing

    Authors: Sinjini Banerjee, Tim Marrinan, Reilly Cannon, Tony Chiang, Anand D. Sarwate

    Abstract: Training a deep neural network often involves stochastic optimization, meaning each run will produce a different model. The seed used to initialize random elements of the optimization procedure heavily influences the quality of a trained model, which may be obscure from many commonly reported summary statistics, like accuracy. However, random seed is often not included in hyper-parameter optimizat… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  8. arXiv:2406.02402  [pdf, other

    math.OC cs.GT stat.ML

    Online Fair Allocation of Perishable Resources

    Authors: Siddhartha Banerjee, Chamsi Hssaine, Sean R. Sinclair

    Abstract: We consider a practically motivated variant of the canonical online fair allocation problem: a decision-maker has a budget of perishable resources to allocate over a fixed number of rounds. Each round sees a random number of arrivals, and the decision-maker must commit to an allocation for these individuals before moving on to the next round. The goal is to construct a sequence of allocations that… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 51 pages, 8 figures

    MSC Class: 91B32

  9. arXiv:2406.00375  [pdf, other

    cs.RO

    Teledrive: An Embodied AI based Telepresence System

    Authors: Snehasis Banerjee, Sayan Paul, Ruddradev Roychoudhury, Abhijan Bhattacharya, Chayan Sarkar, Ashis Sau, Pradip Pramanick, Brojeshwar Bhowmick

    Abstract: This article presents Teledrive, a telepresence robotic system with embodied AI features that empowers an operator to navigate the telerobot in any unknown remote place with minimal human intervention. We conceive Teledrive in the context of democratizing remote care-giving for elderly citizens as well as for isolated patients, affected by contagious diseases. In particular, this paper focuses on… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted in Journal of Intelligent Robotic System

    Journal ref: Journal of Intelligent Robotic System 2024

  10. arXiv:2404.12913  [pdf, other

    cs.DB

    Influential Billboard Slot Selection under Zonal Influence Constraint

    Authors: Dildar Ali, Suman Banerjee, Yamuna Prasad

    Abstract: Given billboard and trajectory database, finding a limited number of billboard slots for maximizing the influence is an important problem in the context of billboard advertisement. Most of the existing literature focused on the influential slot selection problem without considering any specific zonal influence constraint. To bridge this gap in this paper, we introduce and study the Influential Bil… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 14 Pages

  11. arXiv:2404.05243  [pdf, other

    cs.CL cs.AI

    Product Description and QA Assisted Self-Supervised Opinion Summarization

    Authors: Tejpalsingh Siledar, Rupasai Rangaraju, Sankara Sri Raghava Ravindra Muddu, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera, Swaprava Nath, Pushpak Bhattacharyya

    Abstract: In e-commerce, opinion summarization is the process of summarizing the consensus opinions found in product reviews. However, the potential of additional sources such as product description and question-answers (QA) has been considered less often. Moreover, the absence of any supervised training data makes this task challenging. To address this, we propose a novel synthetic dataset creation (SDC) s… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  12. arXiv:2404.03587  [pdf, other

    cs.RO cs.AI

    Anticipate & Collab: Data-driven Task Anticipation and Knowledge-driven Planning for Human-robot Collaboration

    Authors: Shivam Singh, Karthik Swaminathan, Raghav Arora, Ramandeep Singh, Ahana Datta, Dipanjan Das, Snehasis Banerjee, Mohan Sridharan, Madhava Krishna

    Abstract: An agent assisting humans in daily living activities can collaborate more effectively by anticipating upcoming tasks. Data-driven methods represent the state of the art in task anticipation, planning, and related problems, but these methods are resource-hungry and opaque. Our prior work introduced a proof of concept framework that used an LLM to anticipate 3 high-level tasks that served as goals f… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  13. arXiv:2404.01632  [pdf, other

    cs.LG eess.SY

    Enhancing Functional Safety in Automotive AMS Circuits through Unsupervised Machine Learning

    Authors: Ayush Arunachalam, Ian Kintz, Suvadeep Banerjee, Arnab Raha, Xiankun Jin, Fei Su, Viswanathan Pillai Prasanth, Rubin A. Parekhji, Suriyaprakash Natarajan, Kanad Basu

    Abstract: Given the widespread use of safety-critical applications in the automotive field, it is crucial to ensure the Functional Safety (FuSa) of circuits and components within automotive systems. The Analog and Mixed-Signal (AMS) circuits prevalent in these systems are more vulnerable to faults induced by parametric perturbations, noise, environmental stress, and other factors, in comparison to their dig… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 12 pages, 12 figures

  14. arXiv:2403.19717  [pdf, other

    cs.LG cs.CR cs.CY

    A Picture is Worth 500 Labels: A Case Study of Demographic Disparities in Local Machine Learning Models for Instagram and TikTok

    Authors: Jack West, Lea Thiemt, Shimaa Ahmed, Maggie Bartig, Kassem Fawaz, Suman Banerjee

    Abstract: Mobile apps have embraced user privacy by moving their data processing to the user's smartphone. Advanced machine learning (ML) models, such as vision models, can now locally analyze user images to extract insights that drive several functionalities. Capitalizing on this new processing model of locally analyzing user images, we analyze two popular social media apps, TikTok and Instagram, to reveal… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 18 pages, 13 figures, to appear at IEEE Symposium on Security and Privacy 2024

    ACM Class: K.4.2; C.4; D.2.2

  15. arXiv:2403.12047  [pdf, other

    cs.CV

    Alpha-wolves and Alpha-mammals: Exploring Dictionary Attacks on Iris Recognition Systems

    Authors: Sudipta Banerjee, Anubhav Jain, Zehua Jiang, Nasir Memon, Julian Togelius, Arun Ross

    Abstract: A dictionary attack in a biometric system entails the use of a small number of strategically generated images or templates to successfully match with a large number of identities, thereby compromising security. We focus on dictionary attacks at the template level, specifically the IrisCodes used in iris recognition systems. We present an hitherto unknown vulnerability wherein we mix IrisCodes usin… ▽ More

    Submitted 20 November, 2023; originally announced March 2024.

    Comments: 8 pages, 5 figures, 13 tables, Workshop on Manipulation, Adversarial, and Presentation Attacks in Biometrics, Winter Conference on Applications of Computer Vision

  16. arXiv:2403.08092  [pdf, other

    cs.CV

    Mitigating the Impact of Attribute Editing on Face Recognition

    Authors: Sudipta Banerjee, Sai Pranaswi Mullangi, Shruti Wagle, Chinmay Hegde, Nasir Memon

    Abstract: Through a large-scale study over diverse face images, we show that facial attribute editing using modern generative AI models can severely degrade automated face recognition systems. This degradation persists even with identity-preserving generative models. To mitigate this issue, we propose two novel techniques for local and global attribute editing. We empirically ablate twenty-six facial semant… ▽ More

    Submitted 9 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Under review

  17. arXiv:2403.04660  [pdf, other

    cs.HC

    Exploring the Design Space of Optical See-through AR Head-Mounted Displays to Support First Responders in the Field

    Authors: Kexin Zhang, Brianna Cochran, Ruijia Chen, Lance Hartung, Bryce Sprecher, Ross Tredinnick, Kevin Ponto, Suman Banerjee, Yuhang Zhao

    Abstract: First responders (FRs) navigate hazardous, unfamiliar environments in the field (e.g., mass-casualty incidents), making life-changing decisions in a split second. AR head-mounted displays (HMDs) have shown promise in supporting them due to its capability of recognizing and augmenting the challenging environments in a hands-free manner. However, the design space have not been thoroughly explored by… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Journal ref: CHI 2024

  18. arXiv:2403.02688  [pdf, other

    cs.ET cs.AI cs.LG

    DOCTOR: Dynamic On-Chip Temporal Variation Remediation Toward Self-Corrected Photonic Tensor Accelerators

    Authors: Haotian Lu, Sanmitra Banerjee, Jiaqi Gu

    Abstract: Photonic computing has emerged as a promising solution for accelerating computation-intensive artificial intelligence (AI) workloads, offering unparalleled speed and energy efficiency, especially in resource-limited, latency-sensitive edge computing environments. However, the deployment of analog photonic tensor accelerators encounters reliability challenges due to hardware noise and environmental… ▽ More

    Submitted 31 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 9 pages. Accepted to IEEE JLT 2024

  19. arXiv:2402.17720  [pdf, other

    cs.LG cs.DS cs.IT

    The SMART approach to instance-optimal online learning

    Authors: Siddhartha Banerjee, Alankrita Bhatt, Christina Lee Yu

    Abstract: We devise an online learning algorithm -- titled Switching via Monotone Adapted Regret Traces (SMART) -- that adapts to the data and achieves regret that is instance optimal, i.e., simultaneously competitive on every input sequence compared to the performance of the follow-the-leader (FTL) policy and the worst case guarantee of any other input policy. We show that the regret of the SMART policy on… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  20. arXiv:2402.16342  [pdf, other

    cs.AI cs.RO

    Contingency Planning Using Bi-level Markov Decision Processes for Space Missions

    Authors: Somrita Banerjee, Edward Balaban, Mark Shirley, Kevin Bradner, Marco Pavone

    Abstract: This work focuses on autonomous contingency planning for scientific missions by enabling rapid policy computation from any off-nominal point in the state space in the event of a delay or deviation from the nominal mission plan. Successful contingency planning involves managing risks and rewards, often probabilistically associated with actions, in stochastic scenarios. Markov Decision Processes (MD… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  21. arXiv:2402.16159  [pdf, other

    cs.CL

    DistALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem

    Authors: Somnath Banerjee, Avik Dutta, Aaditya Agrawal, Rima Hazra, Animesh Mukherjee

    Abstract: With the AI revolution in place, the trend for building automated systems to support professionals in different domains such as the open source software systems, healthcare systems, banking systems, transportation systems and many others have become increasingly prominent. A crucial requirement in the automation of support tools for such systems is the early identification of named entities, which… ▽ More

    Submitted 20 June, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: Accepted at ECML-PKDD 2024 (Long Paper)

  22. arXiv:2402.15473  [pdf, other

    cs.CL cs.LG

    Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization

    Authors: Swaroop Nath, Tejpalsingh Siledar, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Harshad Khadilkar, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a dominating strategy in aligning Language Models (LMs) with human values/goals. The key to the strategy is learning a reward model ($\varphi$), which can reflect the latent reward model of humans. While this strategy has proven effective, the training methodology requires a lot of human preference annotation (usually in the order of ten… ▽ More

    Submitted 18 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 19 pages, 6 figures, 21 tables

  23. arXiv:2402.15302  [pdf, other

    cs.CL cs.CR

    How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries

    Authors: Somnath Banerjee, Sayan Layek, Rima Hazra, Animesh Mukherjee

    Abstract: In this study, we tackle a growing concern around the safety and ethical use of large language models (LLMs). Despite their potential, these models can be tricked into producing harmful or unethical content through various sophisticated methods, including 'jailbreaking' techniques and targeted manipulation. Our work zeroes in on a specific issue: to what extent LLMs can be led astray by asking the… ▽ More

    Submitted 15 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Under review. {https://huggingface.co/datasets/SoftMINER-Group/TechHazardQA}

  24. arXiv:2402.14702  [pdf, other

    cs.CL

    InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks

    Authors: Somnath Banerjee, Maulindu Sarkar, Punyajoy Saha, Binny Mathew, Animesh Mukherjee

    Abstract: Recently, influence functions present an apparatus for achieving explainability for deep neural models by quantifying the perturbation of individual train instances that might impact a test prediction. Our objectives in this paper are twofold. First we incorporate influence functions as a feedback into the model to improve its performance. Second, in a dataset extension exercise, using influence f… ▽ More

    Submitted 9 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted at LREC-COLING 2024 (Long Paper)

  25. arXiv:2402.11683  [pdf, other

    cs.CL

    One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation

    Authors: Tejpalsingh Siledar, Swaroop Nath, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Swaprava Nath, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Evaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments. Recent studies suggest using Large Language Models (LLMs) as reference-free metrics for NLG evaluation, however, they remain unexplored for opinion summary evaluation. Moreover, limited opinion summary evaluat… ▽ More

    Submitted 9 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  26. Publicly auditable privacy-preserving electoral rolls

    Authors: Prashant Agrawal, Mahabir Prasad Jhanwar, Subodh Vishnu Sharma, Subhashis Banerjee

    Abstract: While existing literature on electronic voting has extensively addressed verifiability of voting protocols, the vulnerability of electoral rolls in large public elections remains a critical concern. To ensure integrity of electoral rolls, the current practice is to either make electoral rolls public or share them with the political parties. However, this enables construction of detailed voter prof… ▽ More

    Submitted 2 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Report number: CSF 2024

    Journal ref: 2024 IEEE 37th Computer Security Foundations Symposium (CSF)

  27. arXiv:2402.03507  [pdf, other

    cs.AI cs.CL cs.LG

    Neural networks for abstraction and reasoning: Towards broad generalization in machines

    Authors: Mikel Bober-Irizar, Soumya Banerjee

    Abstract: For half a century, artificial intelligence research has attempted to reproduce the human qualities of abstraction and reasoning - creating computer systems that can learn new concepts from a minimal set of examples, in settings where humans find this easy. While specific neural networks are able to solve an impressive range of problems, broad generalisation to situations outside their training da… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 32 pages main text, 17 pages

  28. arXiv:2402.01294  [pdf, other

    cs.DB cs.IR cs.MA

    Minimizing Regret in Billboard Advertisement under Zonal Influence Constraint

    Authors: Dildar Ali, Suman Banerjee, Yamuna Prasad

    Abstract: In a typical billboard advertisement technique, a number of digital billboards are owned by an influence provider, and many advertisers approach the influence provider for a specific number of views of their advertisement content on a payment basis. If the influence provider provides the demanded or more influence, then he will receive the full payment or else a partial payment. In the context of… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 32 Pages

  29. arXiv:2402.00845  [pdf, ps, other

    cs.IT cs.GT cs.NI eess.SP eess.SY

    When to Preempt in a Status Update System?

    Authors: Subhankar Banerjee, Sennur Ulukus

    Abstract: We consider a time-slotted status update system with an error-free preemptive queue. The goal of the sampler-scheduler pair is to minimize the age of information at the monitor by sampling and transmitting the freshly sampled update packets to the monitor. The sampler-scheduler pair also has a choice to preempt an old update packet from the server and transmit a new update packet to the server. We… ▽ More

    Submitted 20 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  30. arXiv:2401.16649  [pdf, other

    cs.LG cs.CR

    Using Motion Forecasting for Behavior-Based Virtual Reality (VR) Authentication

    Authors: Mingjun Li, Natasha Kholgade Banerjee, Sean Banerjee

    Abstract: Task-based behavioral biometric authentication of users interacting in virtual reality (VR) environments enables seamless continuous authentication by using only the motion trajectories of the person's body as a unique signature. Deep learning-based approaches for behavioral biometrics show high accuracy when using complete or near complete portions of the user trajectory, but show lower performan… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: AIxVR 2024 Best Paper Award

  31. arXiv:2401.16464  [pdf, other

    cs.IR cs.DB cs.LG cs.MA

    Towards Regret Free Slot Allocation in Billboard Advertisement

    Authors: Dildar Ali, Suman Banerjee, Yamuna Prasad

    Abstract: Creating and maximizing influence among the customers is one of the central goals of an advertiser, and hence, remains an active area of research in recent times. In this advertisement technique, the advertisers approach an influence provider for a specific number of views of their content on a payment basis. Now, if the influence provider can provide the required number of views or more, he will… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 37 Pages

  32. arXiv:2401.16443  [pdf, other

    cs.HC cs.AI cs.LG

    Evaluating Deep Networks for Detecting User Familiarity with VR from Hand Interactions

    Authors: Mingjun Li, Numan Zafar, Natasha Kholgade Banerjee, Sean Banerjee

    Abstract: As VR devices become more prevalent in the consumer space, VR applications are likely to be increasingly used by users unfamiliar with VR. Detecting the familiarity level of a user with VR as an interaction medium provides the potential of providing on-demand training for acclimatization and prevents the user from being burdened by the VR environment in accomplishing their tasks. In this work, we… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: AIxVR 2024 poster paper

  33. arXiv:2401.12671  [pdf, other

    cs.CL

    Context Matters: Pushing the Boundaries of Open-Ended Answer Generation with Graph-Structured Knowledge Context

    Authors: Somnath Banerjee, Amruit Sahoo, Sayan Layek, Avik Dutta, Rima Hazra, Animesh Mukherjee

    Abstract: In the continuously advancing AI landscape, crafting context-rich and meaningful responses via Large Language Models (LLMs) is essential. Researchers are becoming more aware of the challenges that LLMs with fewer parameters encounter when trying to provide suitable answers to open-ended questions. To address these hurdles, the integration of cutting-edge strategies, augmentation of rich external d… ▽ More

    Submitted 5 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  34. Automatic Recognition of Learning Resource Category in a Digital Library

    Authors: Soumya Banerjee, Debarshi Kumar Sanyal, Samiran Chattopadhyay, Plaban Kumar Bhowmick, Partha Pratim Das

    Abstract: Digital libraries often face the challenge of processing a large volume of diverse document types. The manual collection and tagging of metadata can be a time-consuming and error-prone task. To address this, we aim to develop an automatic metadata extractor for digital libraries. In this work, we introduce the Heterogeneous Learning Resources (HLR) dataset designed for document image classificatio… ▽ More

    Submitted 28 November, 2023; originally announced January 2024.

    Comments: 2 pages, 3 figures, Published in JCDL 21

  35. arXiv:2401.10647  [pdf, other

    cs.CL

    Sowing the Wind, Reaping the Whirlwind: The Impact of Editing Language Models

    Authors: Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria

    Abstract: In the rapidly advancing field of artificial intelligence, the concept of Red-Teaming or Jailbreaking large language models (LLMs) has emerged as a crucial area of study. This approach is especially significant in terms of assessing and enhancing the safety and robustness of these models. This paper investigates the intricate consequences of such modifications through model editing, uncovering a c… ▽ More

    Submitted 16 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted at ACL 2024

  36. arXiv:2401.10601  [pdf, other

    cs.DS cs.DB

    Influential Slot and Tag Selection in Billboard Advertisement

    Authors: Dildar Ali, Tejash Gupta, Suman Banerjee, Yamuna Prasad

    Abstract: The selection of influential billboard slots remains an important problem in billboard advertisements. Existing studies on this problem have not considered the case of context-specific influence probability. To bridge this gap, in this paper, we introduce the Context Dependent Influential Billboard Slot Selection Problem. First, we show that the problem is NP-hard. We also show that the influence… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 15 pages

  37. arXiv:2312.16071  [pdf, other

    cs.NE cs.AI cs.GR cs.LG

    Event-based Shape from Polarization with Spiking Neural Networks

    Authors: Peng Kang, Srutarshi Banerjee, Henry Chopp, Aggelos Katsaggelos, Oliver Cossairt

    Abstract: Recent advances in event-based shape determination from polarization offer a transformative approach that tackles the trade-off between speed and accuracy in capturing surface geometries. In this paper, we investigate event-based shape from polarization using Spiking Neural Networks (SNNs), introducing the Single-Timestep and Multi-Timestep Spiking UNets for effective and efficient surface normal… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 25 pages

  38. Performance Analysis of Fixed Broadband Wireless Access in mmWave Band in 5G

    Authors: Soumya Banerjee, Sarada Prasad Gochhayat, Sachin Shetty

    Abstract: An end-to-end fiber-based network holds the potential to provide multi-gigabit fixed access to end-users. However, deploying fiber access, especially in areas where fiber is non-existent, can be time-consuming and costly, resulting in delayed returns for Operators. This work investigates transmission data from fixed broadband wireless access in the mmWave band in 5G. Given the growing interest in… ▽ More

    Submitted 28 November, 2023; originally announced December 2023.

    Comments: 6 pages, 16 figures, Published in ICNC 22

  39. arXiv:2312.05626  [pdf, other

    cs.SE cs.AI

    Redefining Developer Assistance: Through Large Language Models in Software Ecosystem

    Authors: Somnath Banerjee, Avik Dutta, Sayan Layek, Amruit Sahoo, Sam Conrad Joyce, Rima Hazra

    Abstract: In this paper, we delve into the advancement of domain-specific Large Language Models (LLMs) with a focus on their application in software development. We introduce DevAssistLlama, a model developed through instruction tuning, to assist developers in processing software-related natural language queries. This model, a variant of instruction tuned LLM, is particularly adept at handling intricate tec… ▽ More

    Submitted 15 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: Under review

  40. arXiv:2312.00507  [pdf, other

    cs.PL cs.CR cs.LG

    VEXIR2Vec: An Architecture-Neutral Embedding Framework for Binary Similarity

    Authors: S. VenkataKeerthy, Yashas Andaluri, Sayan Dey, Soumya Banerjee, Ramakrishna Upadrasta

    Abstract: We propose VEXIR2Vec, a code embedding framework for finding similar functions in binaries. Our representations rely on VEX IR, the intermediate representation used by binary analysis tools like Valgrind and angr. Our proposed embeddings encode both syntactic and semantic information to represent a function, and is both application and architecture independent. We also propose POV, a custom Peepho… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  41. arXiv:2312.00051  [pdf, other

    cs.CR cs.AI cs.LG

    MIA-BAD: An Approach for Enhancing Membership Inference Attack and its Mitigation with Federated Learning

    Authors: Soumya Banerjee, Sandip Roy, Sayyed Farid Ahamed, Devin Quinn, Marc Vucovich, Dhruv Nandakumar, Kevin Choi, Abdul Rahman, Edward Bowen, Sachin Shetty

    Abstract: The membership inference attack (MIA) is a popular paradigm for compromising the privacy of a machine learning (ML) model. MIA exploits the natural inclination of ML models to overfit upon the training data. MIAs are trained to distinguish between training and testing prediction confidence to infer membership information. Federated Learning (FL) is a privacy-preserving ML paradigm that enables mul… ▽ More

    Submitted 28 November, 2023; originally announced December 2023.

    Comments: 6 pages, 5 figures, Accepted to be published in ICNC 23

  42. arXiv:2311.17097  [pdf, other

    cs.LG cs.AI cs.CR cs.NI

    Anonymous Jamming Detection in 5G with Bayesian Network Model Based Inference Analysis

    Authors: Ying Wang, Shashank Jere, Soumya Banerjee, Lingjia Liu, Sachin Shetty, Shehadi Dayekh

    Abstract: Jamming and intrusion detection are critical in 5G research, aiming to maintain reliability, prevent user experience degradation, and avoid infrastructure failure. This paper introduces an anonymous jamming detection model for 5G based on signal parameters from the protocol stacks. The system uses supervised and unsupervised learning for real-time, high-accuracy detection of jamming, including unk… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 6 pages, 9 figures, Published in HPSR22. arXiv admin note: text overlap with arXiv:2304.13660

  43. arXiv:2311.15679  [pdf, other

    cs.CV

    Model-agnostic Body Part Relevance Assessment for Pedestrian Detection

    Authors: Maurice Günder, Sneha Banerjee, Rafet Sifa, Christian Bauckhage

    Abstract: Model-agnostic explanation methods for deep learning models are flexible regarding usability and availability. However, due to the fact that they can only manipulate input to see changes in output, they suffer from weak performance when used with complex model architectures. For models with large inputs as, for instance, in object detection, sampling-based methods like KernelSHAP are inefficient d… ▽ More

    Submitted 1 February, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  44. arXiv:2311.00176  [pdf, other

    cs.CL

    ChipNeMo: Domain-Adapted LLMs for Chip Design

    Authors: Mingjie Liu, Teodor-Dumitru Ene, Robert Kirby, Chris Cheng, Nathaniel Pinckney, Rongjian Liang, Jonah Alben, Himyanshu Anand, Sanmitra Banerjee, Ismet Bayraktaroglu, Bonita Bhaskaran, Bryan Catanzaro, Arjun Chaudhuri, Sharon Clay, Bill Dally, Laura Dang, Parikshit Deshpande, Siddhanth Dhodhi, Sameer Halepete, Eric Hill, Jiashang Hu, Sumit Jain, Ankit Jindal, Brucek Khailany, George Kokai , et al. (17 additional authors not shown)

    Abstract: ChipNeMo aims to explore the applications of large language models (LLMs) for industrial chip design. Instead of directly deploying off-the-shelf commercial or open-source LLMs, we instead adopt the following domain adaptation techniques: domain-adaptive tokenization, domain-adaptive continued pretraining, model alignment with domain-specific instructions, and domain-adapted retrieval models. We e… ▽ More

    Submitted 4 April, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: Updated results for ChipNeMo-70B model

  45. arXiv:2310.08881  [pdf, ps, other

    cs.GT

    Online Resource Sharing via Dynamic Max-Min Fairness: Efficiency, Robustness and Non-Stationarity

    Authors: Giannis Fikioris, Siddhartha Banerjee, Éva Tardos

    Abstract: We study the allocation of shared resources over multiple rounds among competing agents, via a dynamic max-min fair (DMMF) mechanism: the good in each round is allocated to the requesting agent with the least number of allocations received to date. Previous work has shown that when an agent has i.i.d. values across rounds, then in the worst case, she can never get more than a constant strictly les… ▽ More

    Submitted 13 February, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  46. arXiv:2310.05507  [pdf, other

    cs.AR eess.SP

    MEDUSA: Scalable Biometric Sensing in the Wild through Distributed MIMO Radars

    Authors: Yilong Li, Ramanujan K Sheshadri, Karthik Sundaresan, Eugene Chai, Suman Banerjee

    Abstract: Radar-based techniques for detecting vital signs have shown promise for continuous contactless vital sign sensing and healthcare applications. However, real-world indoor environments face significant challenges for existing vital sign monitoring systems. These include signal blockage in non-line-of-sight (NLOS) situations, movement of human subjects, and alterations in location and orientation. Ad… ▽ More

    Submitted 9 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Preprint. Under Review

  47. arXiv:2310.00723  [pdf, other

    cs.CV cs.RO

    HOH: Markerless Multimodal Human-Object-Human Handover Dataset with Large Object Count

    Authors: Noah Wiederhold, Ava Megyeri, DiMaggio Paris, Sean Banerjee, Natasha Kholgade Banerjee

    Abstract: We present the HOH (Human-Object-Human) Handover Dataset, a large object count dataset with 136 objects, to accelerate data-driven research on handover studies, human-robot handover implementation, and artificial intelligence (AI) on handover parameter estimation from 2D and 3D data of person interactions. HOH contains multi-view RGB and depth data, skeletons, fused point clouds, grasp type and ha… ▽ More

    Submitted 3 May, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 Datasets and Benchmarks

  48. arXiv:2310.00541  [pdf, other

    stat.ML cs.LG

    Robust Nonparametric Hypothesis Testing to Understand Variability in Training Neural Networks

    Authors: Sinjini Banerjee, Reilly Cannon, Tim Marrinan, Tony Chiang, Anand D. Sarwate

    Abstract: Training a deep neural network (DNN) often involves stochastic optimization, which means each run will produce a different model. Several works suggest this variability is negligible when models have the same performance, which in the case of classification is test accuracy. However, models with similar test accuracy may not be computing the same function. We propose a new measure of closeness bet… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  49. arXiv:2309.08227  [pdf, other

    cs.LG cs.AI cs.CV

    VERSE: Virtual-Gradient Aware Streaming Lifelong Learning with Anytime Inference

    Authors: Soumya Banerjee, Vinay K. Verma, Avideep Mukherjee, Deepak Gupta, Vinay P. Namboodiri, Piyush Rai

    Abstract: Lifelong learning or continual learning is the problem of training an AI agent continuously while also preventing it from forgetting its previously acquired knowledge. Streaming lifelong learning is a challenging setting of lifelong learning with the goal of continuous learning in a dynamic non-stationary environment without forgetting. We introduce a novel approach to lifelong learning, which is… ▽ More

    Submitted 19 February, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  50. arXiv:2309.06579  [pdf, other

    cs.ET physics.app-ph

    Compact and Low-Loss PCM-based Silicon Photonic MZIs for Photonic Neural Networks

    Authors: Amin Shafiee, Sanmitra Banerjee, Benoit Charbonnier, Sudeep Pasricha, Mahdi Nikdast

    Abstract: We present an optimized Mach-Zehnder Interferometer (MZI) with phase change materials for photonic neural networks (PNNs). With 0.2 dB loss, -38 dB crosstalk, and length of 52 micrometer, the designed MZI significantly improves the scalability and accuracy of PNNs under loss and crosstalk.

    Submitted 17 August, 2023; originally announced September 2023.

    Comments: This paper is accepted at IEEE Photonics Conference (IPC) 2023