Zum Hauptinhalt springen

Showing 1–50 of 85 results for author: Mondal, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03941  [pdf, other

    cs.SE cs.AI cs.CL

    Narrow Transformer: Starcoder-Based Java-LM For Desktop

    Authors: Kamalkumar Rathinasamy, Balaji A J, Ankush Kumar, Gagan Gayari, Harshini K, Rajab Ali Mondal, Sreenivasa Raghavan K S, Swayam Singh

    Abstract: This paper presents NT-Java-1.1B, an open-source specialized code language model built on StarCoderBase-1.1B, designed for coding tasks in Java programming. NT-Java-1.1B achieves state-of-the-art performance, surpassing its base model and majority of other models of similar size on MultiPL-E Java code benchmark. While there have been studies on extending large, generic pre-trained models to improv… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    ACM Class: I.2.7

  2. arXiv:2407.03216  [pdf, other

    cs.CV cs.AI

    Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers

    Authors: Sanket Gandhi, Atul, Samanyu Mahajan, Vishal Sharma, Rushil Gupta, Arnab Kumar Mondal, Parag Singla

    Abstract: Recent work has shown that object-centric representations can greatly help improve the accuracy of learning dynamics while also bringing interpretability. In this work, we take this idea one step further, ask the following question: "can learning disentangled representation further improve the accuracy of visual dynamics prediction in object-centric models?" While there has been some attempt to le… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  3. arXiv:2406.17437  [pdf, other

    cs.CV

    Advancing Question Answering on Handwritten Documents: A State-of-the-Art Recognition-Based Model for HW-SQuAD

    Authors: Aniket Pal, Ajoy Mondal, C. V. Jawahar

    Abstract: Question-answering handwritten documents is a challenging task with numerous real-world applications. This paper proposes a novel recognition-based approach that improves upon the previous state-of-the-art on the HW-SQuAD and BenthamQA datasets. Our model incorporates transformer-based document retrieval and ensemble methods at the model level, achieving an Exact Match score of 82.02% and 69% in H… ▽ More

    Submitted 15 July, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: 16 pages

  4. arXiv:2406.00550  [pdf, other

    cs.DB cs.DC

    Demystifying Object-based Big Data Storage Systems

    Authors: Anindita Sarkar Mondal, Madhupa Sanyal, Ari Kusumastuti, Hrishav Bakul Barua, Kartick Chandra Mondal

    Abstract: Today's era is the digitized era. Managing such generated big data is an important factor for data scientists. Day by day, it increases the demand for big data storage systems. Different organizations are involved in providing storage-related services. They follow the different architectures or storage models for storing big data. In this survey paper, our target is to highlight such storage archi… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 32 Pages

  5. arXiv:2406.00010  [pdf, other

    cs.IR cs.CL

    EnterpriseEM: Fine-tuned Embeddings for Enterprise Semantic Search

    Authors: Kamalkumar Rathinasamy, Jayarama Nettar, Amit Kumar, Vishal Manchanda, Arun Vijayakumar, Ayush Kataria, Venkateshprasanna Manjunath, Chidambaram GS, Jaskirat Singh Sodhi, Shoeb Shaikh, Wasim Akhtar Khan, Prashant Singh, Tanishq Dattatray Ige, Vipin Tiwari, Rajab Ali Mondal, Harshini K, S Reka, Chetana Amancharla, Faiz ur Rahman, Harikrishnan P A, Indraneel Saha, Bhavya Tiwary, Navin Shankar Patel, Pradeep T S, Balaji A J , et al. (2 additional authors not shown)

    Abstract: Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components.… ▽ More

    Submitted 18 May, 2024; originally announced June 2024.

    ACM Class: I.2.7

  6. arXiv:2405.14089  [pdf, other

    cs.LG

    Improved Canonicalization for Model Agnostic Equivariance

    Authors: Siba Smarak Panigrahi, Arnab Kumar Mondal

    Abstract: This work introduces a novel approach to achieving architecture-agnostic equivariance in deep learning, particularly addressing the limitations of traditional equivariant architectures and the inefficiencies of the existing architecture-agnostic methods. Building equivariant models using traditional methods requires designing equivariant versions of existing models and training them from scratch,… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted to EquiVision workshop, CVPR 2024. 7 pages, 1 figure

  7. arXiv:2405.02523  [pdf, other

    quant-ph cs.ET

    Optimal Toffoli-Depth Quantum Adder

    Authors: Siyi Wang, Suman Deb, Ankit Mondal, Anupam Chattopadhyay

    Abstract: Efficient quantum arithmetic circuits are commonly found in numerous quantum algorithms of practical significance. Till date, the logarithmic-depth quantum adders includes a constant coefficient k >= 2 while achieving the Toffoli-Depth of klog n + O(1). In this work, 160 alternative compositions of the carry-propagation structure are comprehensively explored to determine the optimal depth structur… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: This paper is under review in ACM Transactions on Quantum Computing

  8. arXiv:2404.10880  [pdf, other

    cs.CV cs.AI

    HumMUSS: Human Motion Understanding using State Space Models

    Authors: Arnab Kumar Mondal, Stefano Alletto, Denis Tome

    Abstract: Understanding human motion from video is essential for a range of applications, including pose estimation, mesh recovery and action recognition. While state-of-the-art methods predominantly rely on transformer-based architectures, these approaches have limitations in practical scenarios. Transformers are slower when sequentially predicting on a continuous stream of frames in real-time, and do not… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: CVPR 24

  9. arXiv:2404.04642  [pdf

    eess.IV cs.AI cs.LG

    Power-Efficient Image Storage: Leveraging Super Resolution Generative Adversarial Network for Sustainable Compression and Reduced Carbon Footprint

    Authors: Ashok Mondal, Satyam Singh

    Abstract: In recent years, large-scale adoption of cloud storage solutions has revolutionized the way we think about digital data storage. However, the exponential increase in data volume, especially images, has raised environmental concerns regarding power and resource consumption, as well as the rising digital carbon footprint emissions. The aim of this research is to propose a methodology for cloud-based… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 5 pages, 5 figures

    MSC Class: 68T07 ACM Class: I.2.m; H.3.2

  10. IndicSTR12: A Dataset for Indic Scene Text Recognition

    Authors: Harsh Lunia, Ajoy Mondal, C V Jawahar

    Abstract: The importance of Scene Text Recognition (STR) in today's increasingly digital world cannot be overstated. Given the significance of STR, data intensive deep learning approaches that auto-learn feature mappings have primarily driven the development of STR solutions. Several benchmark datasets and substantial work on deep learning models are available for Latin languages to meet this need. On more… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Journal ref: ICDAR 2023 Workshops. Lecture Notes in Computer Science, vol 14193. Springer, Cham (2023)

  11. arXiv:2403.05435  [pdf, other

    cs.CV eess.IV eess.SP

    OmniCount: Multi-label Object Counting with Semantic-Geometric Priors

    Authors: Anindya Mondal, Sauradip Nag, Xiatian Zhu, Anjan Dutta

    Abstract: Object counting is pivotal for understanding the composition of scenes. Previously, this task was dominated by class-specific methods, which have gradually evolved into more adaptable class-agnostic strategies. However, these strategies come with their own set of limitations, such as the need for manual exemplar input and multiple passes for multiple categories, resulting in significant inefficien… ▽ More

    Submitted 20 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  12. arXiv:2403.04178  [pdf, other

    cs.CL cs.SD eess.AS

    Attempt Towards Stress Transfer in Speech-to-Speech Machine Translation

    Authors: Sai Akarsh, Vamshi Raghusimha, Anindita Mondal, Anil Vuppala

    Abstract: The language diversity in India's education sector poses a significant challenge, hindering inclusivity. Despite the democratization of knowledge through online educational content, the dominance of English, as the internet's lingua franca, limits accessibility, emphasizing the crucial need for translation into Indian languages. Despite existing Speech-to-Speech Machine Translation (SSMT) technolo… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  13. arXiv:2402.02957   

    eess.SY cs.LG

    Multi-Agent Reinforcement Learning for Offloading Cellular Communications with Cooperating UAVs

    Authors: Abhishek Mondal, Deepak Mishra, Ganesh Prasad, George C. Alexandropoulos, Azzam Alnahari, Riku Jantti

    Abstract: Effective solutions for intelligent data collection in terrestrial cellular networks are crucial, especially in the context of Internet of Things applications. The limited spectrum and coverage area of terrestrial base stations pose challenges in meeting the escalating data rate demands of network users. Unmanned aerial vehicles, known for their high agility, mobility, and flexibility, present an… ▽ More

    Submitted 31 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Significant modification required to get novel results

  14. arXiv:2402.02036  [pdf, other

    cs.LG

    Generating In-Distribution Proxy Graphs for Explaining Graph Neural Networks

    Authors: Zhuomin Chen, Jiaxing Zhang, Jingchao Ni, Xiaoting Li, Yuchen Bian, Md Mezbahul Islam, Ananda Mohan Mondal, Hua Wei, Dongsheng Luo

    Abstract: Graph Neural Networks (GNNs) have become a building block in graph data processing, with wide applications in critical domains. The growing needs to deploy GNNs in high-stakes applications necessitate explainability for users in the decision-making processes. A popular paradigm for the explainability of GNNs is to identify explainable subgraphs by comparing their labels with the ones of original g… ▽ More

    Submitted 29 May, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: Accepted to International Conference on Machine Learning (ICML 2024)

  15. arXiv:2310.12723  [pdf, other

    cs.CR

    Tight Short-Lived Signatures

    Authors: Arup Mondal, Ruthu Hulikal Rooparaghunath, Debayan Gupta

    Abstract: A Time-lock puzzle (TLP) sends information into the future: a predetermined number of sequential computations must occur (i.e., a predetermined amount of time must pass) to retrieve the information, regardless of parallelization. Buoyed by the excitement around secure decentralized applications and cryptocurrencies, the last decade has witnessed numerous constructions of TLP variants and related a… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  16. arXiv:2310.12693  [pdf, ps, other

    cs.CR

    RANDGENER: Distributed Randomness Beacon from Verifiable Delay Function

    Authors: Arup Mondal, Ruthu Hulikal Rooparaghunath, Debayan Gupta

    Abstract: Buoyed by the excitement around secure decentralized applications, the last few decades have seen numerous constructions of distributed randomness beacons (DRB) along with use cases; however, a secure DRB (in many variations) remains an open problem. We further note that it is natural to want some kind of reward for participants who spend time and energy evaluating the randomness beacon value -- t… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  17. arXiv:2310.01647  [pdf, other

    cs.LG

    Equivariant Adaptation of Large Pretrained Models

    Authors: Arnab Kumar Mondal, Siba Smarak Panigrahi, Sékou-Oumar Kaba, Sai Rajeswar, Siamak Ravanbakhsh

    Abstract: Equivariant networks are specifically designed to ensure consistent behavior with respect to a set of input transformations, leading to higher sample efficiency and more accurate and robust predictions. However, redesigning each component of prevalent deep neural network architectures to achieve chosen equivariance is a difficult problem and can result in a computationally expensive network during… ▽ More

    Submitted 29 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 17 pages, 6 figures. Accepted to NeurIPS 2023

  18. arXiv:2307.10763  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Actor-agnostic Multi-label Action Recognition with Multi-modal Query

    Authors: Anindya Mondal, Sauradip Nag, Joaquin M Prada, Xiatian Zhu, Anjan Dutta

    Abstract: Existing action recognition methods are typically actor-specific due to the intrinsic topological and apparent differences among the actors. This requires actor-specific pose estimation (e.g., humans vs. animals), leading to cumbersome model design complexity and high maintenance costs. Moreover, they often focus on learning the visual modality alone and single-label classification whilst neglecti… ▽ More

    Submitted 10 January, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: Published at the 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Paris, France

  19. arXiv:2306.11941  [pdf, other

    cs.LG cs.AI

    Efficient Dynamics Modeling in Interactive Environments with Koopman Theory

    Authors: Arnab Kumar Mondal, Siba Smarak Panigrahi, Sai Rajeswar, Kaleem Siddiqi, Siamak Ravanbakhsh

    Abstract: The accurate modeling of dynamics in interactive environments is critical for successful long-range prediction. Such a capability could advance Reinforcement Learning (RL) and Planning algorithms, but achieving it is challenging. Inaccuracies in model estimates can compound, resulting in increased errors over long horizons. We approach this problem from the lens of Koopman theory, where the nonlin… ▽ More

    Submitted 12 May, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted to ICLR 2024 and EWRL 2023

  20. arXiv:2305.14410  [pdf, other

    cs.CV cs.AI cs.CL

    Image Manipulation via Multi-Hop Instructions -- A New Dataset and Weakly-Supervised Neuro-Symbolic Approach

    Authors: Harman Singh, Poorva Garg, Mohit Gupta, Kevin Shah, Ashish Goswami, Satyam Modi, Arnab Kumar Mondal, Dinesh Khandelwal, Dinesh Garg, Parag Singla

    Abstract: We are interested in image manipulation via natural language text -- a task that is useful for multiple AI applications but requires complex reasoning over multi-modal spaces. We extend recently proposed Neuro Symbolic Concept Learning (NSCL), which has been quite effective for the task of Visual Question Answering (VQA), for the task of image manipulation. Our system referred to as NeuroSIM can p… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 (long paper, main conference)

  21. arXiv:2304.10951  [pdf, ps, other

    cs.LG math.OC stat.ML

    A Cubic-regularized Policy Newton Algorithm for Reinforcement Learning

    Authors: Mizhaan Prajit Maniyar, Akash Mondal, Prashanth L. A., Shalabh Bhatnagar

    Abstract: We consider the problem of control in the setting of reinforcement learning (RL), where model information is not available. Policy gradient algorithms are a popular solution approach for this problem and are usually shown to converge to a stationary point of the value function. In this paper, we propose two policy Newton algorithms that incorporate cubic regularization. Both algorithms employ the… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  22. Time-varying Signals Recovery via Graph Neural Networks

    Authors: Jhon A. Castro-Correa, Jhony H. Giraldo, Anindya Mondal, Mohsen Badiey, Thierry Bouwmans, Fragkiskos D. Malliaros

    Abstract: The recovery of time-varying graph signals is a fundamental problem with numerous applications in sensor networks and forecasting in time series. Effectively capturing the spatio-temporal information in these signals is essential for the downstream tasks. Previous studies have used the smoothness of the temporal differences of such graph signals as an initial assumption. Nevertheless, this smoothn… ▽ More

    Submitted 12 August, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: Published in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023, Greece

  23. arXiv:2302.06004  [pdf, other

    cs.NI

    Improving UE Energy Efficiency through Network-aware Video Streaming over 5G

    Authors: Basabdatta Palit, Argha Sen, Abhijit Mondal, Ayan Zunaid, Jay Jayatheerthan, Sandip Chakraborty

    Abstract: Adaptive Bitrate (ABR) Streaming over the cellular networks has been well studied in the literature; however, existing ABR algorithms primarily focus on improving the end-users' Quality of Experience (QoE) while ignoring the resource consumption aspect of the underlying device. Consequently, proactive attempts to download video data to maintain the user's QoE often impact the battery life of the u… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

    Comments: 14 pages, 11 figures, journal

  24. arXiv:2212.08834  [pdf, other

    cs.CV

    Towards Robust Handwritten Text Recognition with On-the-fly User Participation

    Authors: Ajoy Mondal, Rohit saluja, C. V. Jawahar

    Abstract: Long-term OCR services aim to provide high-quality output to their users at competitive costs. It is essential to upgrade the models because of the complex data loaded by the users. The service providers encourage the users who provide data where the OCR model fails by rewarding them based on data complexity, readability, and available budget. Hitherto, the OCR works include preparing the models o… ▽ More

    Submitted 17 December, 2022; originally announced December 2022.

  25. arXiv:2212.07776  [pdf, other

    cs.CV

    Enhancing Indic Handwritten Text Recognition Using Global Semantic Information

    Authors: Ajoy Mondal, C. V. Jawahar

    Abstract: Handwritten Text Recognition (HTR) is more interesting and challenging than printed text due to uneven variations in the handwriting style of the writers, content, and time. HTR becomes more challenging for the Indic languages because of (i) multiple characters combined to form conjuncts which increase the number of characters of respective languages, and (ii) near to 100 unique basic Unicode char… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  26. arXiv:2211.11583  [pdf, other

    cs.IR cs.LG

    Recommending Related Products Using Graph Neural Networks in Directed Graphs

    Authors: Srinivas Virinchi, Anoop Saladi, Abhirup Mondal

    Abstract: Related product recommendation (RPR) is pivotal to the success of any e-commerce service. In this paper, we deal with the problem of recommending related products i.e., given a query product, we would like to suggest top-k products that have high likelihood to be bought together with it. Our problem implicitly assumes asymmetry i.e., for a phone, we would like to recommend a suitable phone case, b… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: This work was accepted in ECML PKDD 2022

  27. arXiv:2211.06489  [pdf, other

    cs.LG cs.AI

    Equivariance with Learned Canonicalization Functions

    Authors: Sékou-Oumar Kaba, Arnab Kumar Mondal, Yan Zhang, Yoshua Bengio, Siamak Ravanbakhsh

    Abstract: Symmetry-based neural networks often constrain the architecture in order to achieve invariance or equivariance to a group of transformations. In this paper, we propose an alternative that avoids this architectural constraint by learning to produce canonical representations of the data. These canonicalization functions can readily be plugged into non-equivariant backbone architectures. We offer exp… ▽ More

    Submitted 7 July, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 21 pages, 5 figures

  28. arXiv:2208.08697  [pdf, other

    cs.LG cs.CR cs.CV

    Resisting Adversarial Attacks in Deep Neural Networks using Diverse Decision Boundaries

    Authors: Manaar Alam, Shubhajit Datta, Debdeep Mukhopadhyay, Arijit Mondal, Partha Pratim Chakrabarti

    Abstract: The security of deep learning (DL) systems is an extremely important field of study as they are being deployed in several applications due to their ever-improving performance to solve challenging tasks. Despite overwhelming promises, the deep learning systems are vulnerable to crafted adversarial examples, which may be imperceptible to the human eye, but can lead the model to misclassify. Protecti… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

  29. arXiv:2208.00290  [pdf, ps, other

    math.OC cs.LG

    A Gradient Smoothed Functional Algorithm with Truncated Cauchy Random Perturbations for Stochastic Optimization

    Authors: Akash Mondal, Prashanth L. A., Shalabh Bhatnagar

    Abstract: In this paper, we present a stochastic gradient algorithm for minimizing a smooth objective function that is an expectation over noisy cost samples, and only the latter are observed for any given parameter. Our algorithm employs a gradient estimation scheme with random perturbations, which are formed using the truncated Cauchy distribution from the delta sphere. We analyze the bias and variance of… ▽ More

    Submitted 30 June, 2023; v1 submitted 30 July, 2022; originally announced August 2022.

  30. ImAiR: Airwriting Recognition framework using Image Representation of IMU Signals

    Authors: Ayush Tripathi, Arnab Kumar Mondal, Lalan Kumar, Prathosh A. P

    Abstract: The problem of Airwriting Recognition is focused on identifying letters written by movement of finger in free space. It is a type of gesture recognition where the dictionary corresponds to letters in a specific language. In particular, airwriting recognition using sensor data from wrist-worn devices can be used as a medium of user input for applications in Human-Computer Interaction (HCI). Recogni… ▽ More

    Submitted 8 September, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

  31. arXiv:2203.11977  [pdf, other

    cs.NI eess.SY

    YouTube over Google's QUIC vs Internet Middleboxes: A Tug of War between Protocol Sustainability and Application QoE

    Authors: Sapna Chaudhary, Prince Sachdeva, Abhijit Mondal, Sandip Chakraborty, Mukulika Maity

    Abstract: Middleboxes such as web proxies, firewalls, etc. are widely deployed in today's network infrastructure. As a result, most protocols need to adapt their behavior to co-exist. One of the most commonly used transport protocols, QUIC, adapts to such middleboxes by falling back to TCP, where they block it. In this paper, we argue that the blind fallback behavior of QUIC, i.e., not distinguishing betwee… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

  32. Recovery of Missing Sensor Data by Reconstructing Time-varying Graph Signals

    Authors: Anindya Mondal, Mayukhmali Das, Aditi Chatterjee, Palaniandavar Venkateswaran

    Abstract: Wireless sensor networks are among the most promising technologies of the current era because of their small size, lower cost, and ease of deployment. With the increasing number of wireless sensors, the probability of generating missing data also rises. This incomplete data could lead to disastrous consequences if used for decision-making. There is rich literature dealing with this problem. Howeve… ▽ More

    Submitted 23 December, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: Five pages, two figures, 2022 30th European Signal Processing Conference (EUSIPCO). Published version available at: https://ieeexplore.ieee.org/document/9909940

  33. arXiv:2202.10930  [pdf, other

    cs.LG cs.AI

    Transformation Coding: Simple Objectives for Equivariant Representations

    Authors: Mehran Shakerinava, Arnab Kumar Mondal, Siamak Ravanbakhsh

    Abstract: We present a simple non-generative approach to deep representation learning that seeks equivariant deep embedding through simple objectives. In contrast to existing equivariant networks, our transformation coding approach does not constrain the choice of the feed-forward layer or the architecture and allows for an unknown group action on the input space. We introduce several such transformation co… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

  34. arXiv:2202.05808  [pdf, other

    cs.LG cs.AI q-bio.NC

    Investigating Power laws in Deep Representation Learning

    Authors: Arna Ghosh, Arnab Kumar Mondal, Kumar Krishna Agrawal, Blake Richards

    Abstract: Representation learning that leverages large-scale labelled datasets, is central to recent progress in machine learning. Access to task relevant labels at scale is often scarce or expensive, motivating the need to learn from unlabelled datasets with self-supervised learning (SSL). Such large unlabelled datasets (with data augmentations) often provide a good coverage of the underlying input distrib… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

  35. arXiv:2202.02817  [pdf, other

    cs.CR cs.LG

    BEAS: Blockchain Enabled Asynchronous & Secure Federated Machine Learning

    Authors: Arup Mondal, Harpreet Virk, Debayan Gupta

    Abstract: Federated Learning (FL) enables multiple parties to distributively train a ML model without revealing their private datasets. However, it assumes trust in the centralized aggregator which stores and aggregates model updates. This makes it prone to gradient tampering and privacy leakage by a malicious aggregator. Malicious parties can also introduce backdoors into the joint model by poisoning the t… ▽ More

    Submitted 6 February, 2022; originally announced February 2022.

    Comments: The Third AAAI Workshop on Privacy-Preserving Artificial Intelligence (PPAI-22) at the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

  36. arXiv:2201.08574  [pdf, other

    cs.CV cs.AI cs.MM

    Classroom Slide Narration System

    Authors: Jobin K. V., Ajoy Mondal, C. V. Jawahar

    Abstract: Slide presentations are an effective and efficient tool used by the teaching community for classroom communication. However, this teaching model can be challenging for blind and visually impaired (VI) students. The VI student required personal human assistance for understand the presented slide. This shortcoming motivates us to design a Classroom Slide Narration System (CSNS) that generates audio… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

    Journal ref: CVIP 2021

  37. arXiv:2201.07730  [pdf, other

    cs.CR cs.LG

    SCOTCH: An Efficient Secure Computation Framework for Secure Aggregation

    Authors: Yash More, Prashanthi Ramachandran, Priyam Panda, Arup Mondal, Harpreet Virk, Debayan Gupta

    Abstract: Federated learning enables multiple data owners to jointly train a machine learning model without revealing their private datasets. However, a malicious aggregation server might use the model parameters to derive sensitive information about the training dataset used. To address such leakage, differential privacy and cryptographic techniques have been investigated in prior work, but these often res… ▽ More

    Submitted 15 February, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22), Third AAAI Privacy-Preserving Artificial Intelligence (PPAI-22) Workshop

  38. arXiv:2112.04948  [pdf, other

    cs.LG

    PARL: Enhancing Diversity of Ensemble Networks to Resist Adversarial Attacks via Pairwise Adversarially Robust Loss Function

    Authors: Manaar Alam, Shubhajit Datta, Debdeep Mukhopadhyay, Arijit Mondal, Partha Pratim Chakrabarti

    Abstract: The security of Deep Learning classifiers is a critical field of study because of the existence of adversarial attacks. Such attacks usually rely on the principle of transferability, where an adversarial example crafted on a surrogate classifier tends to mislead the target classifier trained on the same dataset even if both classifiers have quite different architecture. Ensemble methods against ad… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  39. SCLAiR : Supervised Contrastive Learning for User and Device Independent Airwriting Recognition

    Authors: Ayush Tripathi, Arnab Kumar Mondal, Lalan Kumar, Prathosh A. P

    Abstract: Airwriting Recognition is the problem of identifying letters written in free space with finger movement. It is essentially a specialized case of gesture recognition, wherein the vocabulary of gestures corresponds to letters as in a particular language. With the wide adoption of smart wearables in the general population, airwriting recognition using motion sensors from a smart-band can be used as a… ▽ More

    Submitted 29 December, 2021; v1 submitted 25 November, 2021; originally announced November 2021.

  40. New Performance Measures for Object Tracking under Complex Environments

    Authors: Ajoy Mondal

    Abstract: Various performance measures based on the ground truth and without ground truth exist to evaluate the quality of a developed tracking algorithm. The existing popular measures - average center location error (ACLE) and average tracking accuracy (ATA) based on ground truth, may sometimes create confusion to quantify the quality of a developed algorithm for tracking an object under some complex envir… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  41. arXiv:2111.07129  [pdf, other

    cs.CV cs.AI

    Visual Understanding of Complex Table Structures from Document Images

    Authors: Sachin Raja, Ajoy Mondal, C V Jawahar

    Abstract: Table structure recognition is necessary for a comprehensive understanding of documents. Tables in unstructured business documents are tough to parse due to the high diversity of layouts, varying alignments of contents, and the presence of empty cells. The problem is particularly difficult because of challenges in identifying individual cells using visual or linguistic contexts or both. Accurate d… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  42. Deep Neural Networks for Automatic Grain-matrix Segmentation in Plane and Cross-polarized Sandstone Photomicrographs

    Authors: Rajdeep Das, Ajoy Mondal, Tapan Chakraborty, Kuntal Ghosh

    Abstract: Grain segmentation of sandstone that is partitioning the grain from its surrounding matrix/cement in the thin section is the primary step for computer-aided mineral identification and sandstone classification. The microscopic images of sandstone contain many mineral grains and their surrounding matrix/cement. The distinction between adjacent grains and the matrix is often ambiguous, making grain s… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  43. arXiv:2111.06867  [pdf, other

    cs.CR

    Flatee: Federated Learning Across Trusted Execution Environments

    Authors: Arup Mondal, Yash More, Ruthu Hulikal Rooparaghunath, Debayan Gupta

    Abstract: Federated learning allows us to distributively train a machine learning model where multiple parties share local model parameters without sharing private data. However, parameter exchange may still leak information. Several approaches have been proposed to overcome this, based on multi-party computation, fully homomorphic encryption, etc.; many of these protocols are slow and impractical for real-… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: IEEE Euro S&P 2021 Poster; see https://www.ieee-security.org/TC/EuroSP2021/posters.html

  44. Moving Object Detection for Event-based vision using Graph Spectral Clustering

    Authors: Anindya Mondal, Shashant R, Jhony H. Giraldo, Thierry Bouwmans, Ananda S. Chowdhury

    Abstract: Moving object detection has been a central topic of discussion in computer vision for its wide range of applications like in self-driving cars, video surveillance, security, and enforcement. Neuromorphic Vision Sensors (NVS) are bio-inspired sensors that mimic the working of the human eye. Unlike conventional frame-based cameras, these sensors capture a stream of asynchronous 'events' that pose mu… ▽ More

    Submitted 2 December, 2021; v1 submitted 30 September, 2021; originally announced September 2021.

    Comments: Ten pages, five figures, Published in 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Montreal, BC, Canada

  45. Moving Object Detection for Event-based Vision using k-means Clustering

    Authors: Anindya Mondal, Mayukhmali Das

    Abstract: Moving object detection is important in computer vision. Event-based cameras are bio-inspired cameras that work by mimicking the working of the human eye. These cameras have multiple advantages over conventional frame-based cameras, like reduced latency, HDR, reduced motion blur during high motion, low power consumption, etc. In spite of these advantages, event-based cameras are noise-sensitive an… ▽ More

    Submitted 11 January, 2022; v1 submitted 4 September, 2021; originally announced September 2021.

    Comments: Nine pages, five figures, Published in 2021 IEEE 8th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)

  46. arXiv:2109.00659  [pdf, other

    cs.SE

    Semantic Slicing of Architectural Change Commits: Towards Semantic Design Review

    Authors: Amit Kumar Mondal, Chanchal K. Roy, Kevin A. Schneider, Banani Roy, Sristy Sumana Nath

    Abstract: Software architectural changes involve more than one module or component and are complex to analyze compared to local code changes. Development teams aiming to review architectural aspects (design) of a change commit consider many essential scenarios such as access rules and restrictions on usage of program entities across modules. Moreover, design review is essential when proper architectural for… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

  47. arXiv:2108.07793   

    cs.CY

    Modeling Pedagogical Learning Environment with Hybrid Model based on ICT

    Authors: Al Maruf Hassan, Istiak Ahmed Mondal

    Abstract: Pedagogy is a method that handles the ethos and culture of instruction from educators and the learning of learners. Pedagogy of Information and Communications Technology (ICT) refers to the interactions among the teacher, children, and learning environment based on ICT. It is a discipline that deals with the theory and practice of teaching strategies, teaching actions, teaching judgments, and deci… ▽ More

    Submitted 27 August, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: Problem has been solved related to the problem has been solved related to a big error in the basic concept

  48. arXiv:2107.07709  [pdf, other

    cs.LG

    ScRAE: Deterministic Regularized Autoencoders with Flexible Priors for Clustering Single-cell Gene Expression Data

    Authors: Arnab Kumar Mondal, Himanshu Asnani, Parag Singla, Prathosh AP

    Abstract: Clustering single-cell RNA sequence (scRNA-seq) data poses statistical and computational challenges due to their high-dimensionality and data-sparsity, also known as `dropout' events. Recently, Regularized Auto-Encoder (RAE) based deep neural network models have achieved remarkable success in learning robust low-dimensional representations. The basic idea in RAEs is to learn a non-linear mapping f… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: IEEE/ACM Transactions on Computational Biology and Bioinformatics

  49. arXiv:2105.03237  [pdf, other

    cs.CV cs.AI

    Mini-batch graphs for robust image classification

    Authors: Arnab Kumar Mondal, Vineet Jain, Kaleem Siddiqi

    Abstract: Current deep learning models for classification tasks in computer vision are trained using mini-batches. In the present article, we take advantage of the relationships between samples in a mini-batch, using graph neural networks to aggregate information from similar images. This helps mitigate the adverse effects of alterations to the input images on classification performance. Diverse experiments… ▽ More

    Submitted 21 April, 2021; originally announced May 2021.

  50. arXiv:2104.08524  [pdf, other

    cs.CL

    Multilingual and Cross-Lingual Intent Detection from Spoken Data

    Authors: Daniela Gerz, Pei-Hao Su, Razvan Kusztos, Avishek Mondal, Michał Lis, Eshan Singhal, Nikola Mrkšić, Tsung-Hsien Wen, Ivan Vulić

    Abstract: We present a systematic study on multilingual and cross-lingual intent detection from spoken data. The study leverages a new resource put forth in this work, termed MInDS-14, a first training and evaluation resource for the intent detection task with spoken data. It covers 14 intents extracted from a commercial system in the e-banking domain, associated with spoken examples in 14 diverse language… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.