Zum Hauptinhalt springen

Showing 1–43 of 43 results for author: Banerjee, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.13945  [pdf, other

    eess.IV cs.CV physics.med-ph

    Personalized Topology-Informed 12-Lead ECG Electrode Localization from Incomplete Cardiac MRIs for Efficient Cardiac Digital Twins

    Authors: Lei Li, Hannah Smith, Yilin Lyu, Julia Camps, Blanca Rodriguez, Abhirup Banerjee, Vicente Grau

    Abstract: Cardiac digital twins (CDTs) offer personalized \textit{in-silico} cardiac representations for the inference of multi-scale properties tied to cardiac mechanisms. The creation of CDTs requires precise information about the electrode position on the torso, especially for the personalized electrocardiogram (ECG) calibration. However, current studies commonly rely on additional acquisition of torso i… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

    Comments: 12 pages

  2. arXiv:2408.05950  [pdf, other

    cs.NE cs.AI cs.SD eess.AS

    Robust online reconstruction of continuous-time signals from a lean spike train ensemble code

    Authors: Anik Chattopadhyay, Arunava Banerjee

    Abstract: Sensory stimuli in animals are encoded into spike trains by neurons, offering advantages such as sparsity, energy efficiency, and high temporal resolution. This paper presents a signal processing framework that deterministically encodes continuous-time signals into biologically feasible spike trains, and addresses the questions about representable signal classes and reconstruction bounds. The fram… ▽ More

    Submitted 14 August, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

    Comments: 22 pages, including a 9-page appendix, 8 figures. A GitHub link to the project implementation is embedded in the paper

  3. arXiv:2408.01996  [pdf, other

    cs.ET eess.SY

    Configuring Safe Spiking Neural Controllers for Cyber-Physical Systems through Formal Verification

    Authors: Arkaprava Gupta, Sumana Ghosh, Ansuman Banerjee, Swarup Kumar Mohalik

    Abstract: Spiking Neural Networks (SNNs) are a subclass of neuromorphic models that have great potential to be used as controllers in Cyber-Physical Systems (CPSs) due to their energy efficiency. They can benefit from the prevalent approach of first training an Artificial Neural Network (ANN) and then translating to an SNN with subsequent hyperparameter tuning. The tuning is required to ensure that the resu… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: This is the complete version of a paper with the same title that appeared at MEMOCODE 2024

  4. arXiv:2407.14616  [pdf, other

    eess.IV cs.CV

    Deep Learning-based 3D Coronary Tree Reconstruction from Two 2D Non-simultaneous X-ray Angiography Projections

    Authors: Yiying Wang, Abhirup Banerjee, Robin P. Choudhury, Vicente Grau

    Abstract: Cardiovascular diseases (CVDs) are the most common cause of death worldwide. Invasive x-ray coronary angiography (ICA) is one of the most important imaging modalities for the diagnosis of CVDs. ICA typically acquires only two 2D projections, which makes the 3D geometry of coronary vessels difficult to interpret, thus requiring 3D coronary tree reconstruction from two projections. State-of-the-art… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: 16 pages, 13 figures, 3 tables

  5. arXiv:2407.06727  [pdf, other

    eess.IV cs.CV

    Towards Physics-informed Cyclic Adversarial Multi-PSF Lensless Imaging

    Authors: Abeer Banerjee, Sanjay Singh

    Abstract: Lensless imaging has emerged as a promising field within inverse imaging, offering compact, cost-effective solutions with the potential to revolutionize the computational camera market. By circumventing traditional optical components like lenses and mirrors, novel approaches like mask-based lensless imaging eliminate the need for conventional hardware. However, advancements in lensless image recon… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  6. arXiv:2405.11458  [pdf, other

    cs.AI eess.SY

    CPS-LLM: Large Language Model based Safe Usage Plan Generator for Human-in-the-Loop Human-in-the-Plant Cyber-Physical System

    Authors: Ayan Banerjee, Aranyak Maity, Payal Kamboj, Sandeep K. S. Gupta

    Abstract: We explore the usage of large language models (LLM) in human-in-the-loop human-in-the-plant cyber-physical systems (CPS) to translate a high-level prompt into a personalized plan of actions, and subsequently convert that plan into a grounded inference of sequential decision-making automated by a real-world CPS controller to achieve a control goal. We show that it is relatively straightforward to c… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: Accepted for publication in AAAI 2024, Planning for Cyber Physical Systems

  7. arXiv:2404.17045  [pdf, other

    eess.SY cs.RO

    Toward Automated Formation of Composite Micro-Structures Using Holographic Optical Tweezers

    Authors: Tommy Zhang, Nicole Werner, Ashis G. Banerjee

    Abstract: Holographic Optical Tweezers (HOT) are powerful tools that can manipulate micro and nano-scale objects with high accuracy and precision. They are most commonly used for biological applications, such as cellular studies, and more recently, micro-structure assemblies. Automation has been of significant interest in the HOT field, since human-run experiments are time-consuming and require skilled oper… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: To appear in the Proceedings of the 2024 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS)

  8. arXiv:2403.10581  [pdf, other

    q-bio.QM cs.AI cs.CL cs.LG eess.SP

    Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction

    Authors: Chen Chen, Lei Li, Marcel Beetz, Abhirup Banerjee, Ramneek Gupta, Vicente Grau

    Abstract: Heart failure (HF) poses a significant public health challenge, with a rising global mortality rate. Early detection and prevention of HF could significantly reduce its impact. We introduce a novel methodology for predicting HF risk using 12-lead electrocardiograms (ECGs). We present a novel, lightweight dual-attention ECG network designed to capture complex ECG features essential for early HF ris… ▽ More

    Submitted 22 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: Under journal revision

  9. arXiv:2403.02909  [pdf, other

    cs.CV cs.HC eess.IV

    Gaze-Vector Estimation in the Dark with Temporally Encoded Event-driven Neural Networks

    Authors: Abeer Banerjee, Naval K. Mehta, Shyam S. Prasad, Himanshu, Sumeet Saurav, Sanjay Singh

    Abstract: In this paper, we address the intricate challenge of gaze vector prediction, a pivotal task with applications ranging from human-computer interaction to driver monitoring systems. Our innovative approach is designed for the demanding setting of extremely low-light conditions, leveraging a novel temporal event encoding scheme, and a dedicated neural network architecture. The temporal encoding metho… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  10. arXiv:2401.13345  [pdf

    eess.SY

    FPGA Implementation of an Intelligent Traffic Light Controller (I-TLC) in Verilog

    Authors: Apoorva Banerjee

    Abstract: The objective of this paper is to design and implement an intelligent Traffic Light Controller system for a four way road intersection. The design is carried out using Verilog, and the hardware is implemented on a FPGA. The chosen intersection involves a 'main road' (heavy traffic flow) and a 'side road' (less traffic flow), which is equipped with sensors to detect the presence of traffic or pedes… ▽ More

    Submitted 23 February, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: The nature of the changes in the updated version involves incorporating synthesis work. Additionally, hardware implementation results on the FPGA board have been added. Moderate changes have been made, as they introduce new aspects related to synthesis and provide valuable insights into the hardware implementation, but they do not alter or affect the existing simulation results

  11. arXiv:2312.14844  [pdf, other

    eess.AS cs.SD physics.med-ph

    An Implantable Piezofilm Middle Ear Microphone: Performance in Human Cadaveric Temporal Bones

    Authors: John Z. Zhang, Lukas Graf, Annesya Banerjee, Aaron Yeiser, Christopher I. McHugh, Ioannis Kymissis, Jeffrey H. Lang, Elizabeth S. Olson, Hideko Heidi Nakajima

    Abstract: Purpose: One of the major reasons that totally implantable cochlear microphones are not readily available is the lack of good implantable microphones. An implantable microphone has the potential to provide a range of benefits over external microphones for cochlear implant users including the filtering ability of the outer ear, cosmetics, and usability in all situations. This paper presents results… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  12. arXiv:2312.13976  [pdf

    physics.med-ph cs.AI cs.CG eess.IV q-bio.QM

    Anatomical basis of human sex differences in ECG identified by automated torso-cardiac three-dimensional reconstruction

    Authors: Hannah J. Smith, Blanca Rodriguez, Yuling Sang, Marcel Beetz, Robin Choudhury, Vicente Grau, Abhirup Banerjee

    Abstract: Background and Aims: The electrocardiogram (ECG) is routinely used for diagnosis and risk stratification following myocardial infarction (MI), though its interpretation is confounded by anatomical variability and sex differences. Women have a higher incidence of missed MI diagnosis and poorer outcomes following infarction. Sex differences in ECG biomarkers and torso-ventricular anatomy have not be… ▽ More

    Submitted 17 July, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Paper under revision

  13. arXiv:2312.13752  [pdf

    eess.IV cs.AI cs.CV

    Hunting imaging biomarkers in pulmonary fibrosis: Benchmarks of the AIIB23 challenge

    Authors: Yang Nan, Xiaodan Xing, Shiyi Wang, Zeyu Tang, Federico N Felder, Sheng Zhang, Roberta Eufrasia Ledda, Xiaoliu Ding, Ruiqi Yu, Weiping Liu, Feng Shi, Tianyang Sun, Zehong Cao, Minghui Zhang, Yun Gu, Hanxiao Zhang, Jian Gao, Pingyu Wang, Wen Tang, Pengxin Yu, Han Kang, Junqiang Chen, Xing Lu, Boyu Zhang, Michail Mamalakis , et al. (16 additional authors not shown)

    Abstract: Airway-related quantitative imaging biomarkers are crucial for examination, diagnosis, and prognosis in pulmonary diseases. However, the manual delineation of airway trees remains prohibitively time-consuming. While significant efforts have been made towards enhancing airway modelling, current public-available datasets concentrate on lung diseases with moderate morphological variations. The intric… ▽ More

    Submitted 16 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 19 pages

  14. arXiv:2309.06558  [pdf, other

    eess.SY cs.AI math.DS math.NA

    High Fidelity Fast Simulation of Human in the Loop Human in the Plant (HIL-HIP) Systems

    Authors: Ayan Banerjee, Payal Kamboj, Aranyak Maity, Riya Sudhakar Salian, Sandeep K. S. Gupta

    Abstract: Non-linearities in simulation arise from the time variance in wireless mobile networks when integrated with human in the loop, human in the plant (HIL-HIP) physical systems under dynamic contexts, leading to simulation slowdown. Time variance is handled by deriving a series of piece wise linear time invariant simulations (PLIS) in intervals, which are then concatenated in time domain. In this pape… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: To appear in ACM MSWIM 2023

  15. arXiv:2309.04856  [pdf, other

    cs.LG cs.AI eess.IV

    AmbientFlow: Invertible generative models from incomplete, noisy measurements

    Authors: Varun A. Kelkar, Rucha Deshpande, Arindam Banerjee, Mark A. Anastasio

    Abstract: Generative models have gained popularity for their potential applications in imaging science, such as image reconstruction, posterior sampling and data sharing. Flow-based generative models are particularly attractive due to their ability to tractably provide exact density estimates along with fast, inexpensive and diverse samples. Training such models, however, requires a large, high quality data… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: Accepted to Transactions on Machine Learning Research (TMLR). OpenReview: https://openreview.net/forum?id=txpYITR8oa

  16. arXiv:2309.02603  [pdf, other

    cs.AI eess.SY

    Detection of Unknown-Unknowns in Human-in-Plant Human-in-Loop Systems Using Physics Guided Process Models

    Authors: Aranyak Maity, Ayan Banerjee, Sandeep Gupta

    Abstract: Unknown-unknowns are operational scenarios in systems that are not accounted for in the design and test phase. In such scenarios, the operational behavior of the Human-in-loop (HIL) Human-in-Plant (HIP) systems is not guaranteed to meet requirements such as safety and efficacy. We propose a novel framework for analyzing the operational output characteristics of safety-critical HIL-HIP systems that… ▽ More

    Submitted 12 December, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

  17. arXiv:2308.06382  [pdf, other

    cs.SD cs.LG eess.AS

    Phoneme Hallucinator: One-shot Voice Conversion via Set Expansion

    Authors: Siyuan Shan, Yang Li, Amartya Banerjee, Junier B. Oliva

    Abstract: Voice conversion (VC) aims at altering a person's voice to make it sound similar to the voice of another person while preserving linguistic content. Existing methods suffer from a dilemma between content intelligibility and speaker similarity; i.e., methods with higher intelligibility usually have a lower speaker similarity, while methods with higher speaker similarity usually require plenty of ta… ▽ More

    Submitted 30 December, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: AAAI 2024 Demo, Codes: https://phonemehallucinator.github.io/

  18. arXiv:2307.11017  [pdf, other

    cs.CV cs.LG eess.IV

    Multi-objective point cloud autoencoders for explainable myocardial infarction prediction

    Authors: Marcel Beetz, Abhirup Banerjee, Vicente Grau

    Abstract: Myocardial infarction (MI) is one of the most common causes of death in the world. Image-based biomarkers commonly used in the clinic, such as ejection fraction, fail to capture more complex patterns in the heart's 3D anatomy and thus limit diagnostic accuracy. In this work, we present the multi-objective point cloud autoencoder as a novel geometric deep learning approach for explainable infarctio… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  19. arXiv:2307.10927  [pdf, other

    eess.IV cs.CV cs.LG

    Modeling 3D cardiac contraction and relaxation with point cloud deformation networks

    Authors: Marcel Beetz, Abhirup Banerjee, Vicente Grau

    Abstract: Global single-valued biomarkers of cardiac function typically used in clinical practice, such as ejection fraction, provide limited insight on the true 3D cardiac deformation process and hence, limit the understanding of both healthy and pathological cardiac mechanics. In this work, we propose the Point Cloud Deformation Network (PCD-Net) as a novel geometric deep learning approach to model 3D car… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  20. arXiv:2307.08535  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-class point cloud completion networks for 3D cardiac anatomy reconstruction from cine magnetic resonance images

    Authors: Marcel Beetz, Abhirup Banerjee, Julius Ossenberg-Engels, Vicente Grau

    Abstract: Cine magnetic resonance imaging (MRI) is the current gold standard for the assessment of cardiac anatomy and function. However, it typically only acquires a set of two-dimensional (2D) slices of the underlying three-dimensional (3D) anatomy of the heart, thus limiting the understanding and analysis of both healthy and pathological cardiac morphology and physiology. In this paper, we propose a nove… ▽ More

    Submitted 18 July, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  21. arXiv:2307.07298  [pdf, other

    cs.CV cs.LG eess.IV

    3D Shape-Based Myocardial Infarction Prediction Using Point Cloud Classification Networks

    Authors: Marcel Beetz, Yilong Yang, Abhirup Banerjee, Lei Li, Vicente Grau

    Abstract: Myocardial infarction (MI) is one of the most prevalent cardiovascular diseases with associated clinical decision-making typically based on single-valued imaging biomarkers. However, such metrics only approximate the complex 3D structure and physiology of the heart and hence hinder a better understanding and prediction of MI outcomes. In this work, we investigate the utility of complete 3D cardiac… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted at EMBC 2023

  22. arXiv:2307.04421  [pdf, other

    eess.SP cs.CV eess.IV

    Towards Enabling Cardiac Digital Twins of Myocardial Infarction Using Deep Computational Models for Inverse Inference

    Authors: Lei Li, Julia Camps, Zhinuo, Wang, Abhirup Banerjee, Marcel Beetz, Blanca Rodriguez, Vicente Grau

    Abstract: Cardiac digital twins (CDTs) have the potential to offer individualized evaluation of cardiac function in a non-invasive manner, making them a promising approach for personalized diagnosis and treatment planning of my-ocardial infarction (MI). The inference of accurate myocardial tissue properties is crucial in creating a reliable CDT of MI. In this work, we investigate the feasibility of inferrin… ▽ More

    Submitted 14 February, 2024; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: Cardiac digital twins; Inverse inference; Myocardial infarction

    MSC Class: N/A

  23. arXiv:2306.09424  [pdf, other

    cs.LG cs.CV eess.IV

    SSL4EO-L: Datasets and Foundation Models for Landsat Imagery

    Authors: Adam J. Stewart, Nils Lehmann, Isaac A. Corley, Yi Wang, Yi-Chia Chang, Nassim Ait Ali Braham, Shradha Sehgal, Caleb Robinson, Arindam Banerjee

    Abstract: The Landsat program is the longest-running Earth observation program in history, with 50+ years of data acquisition by 8 satellites. The multispectral imagery captured by sensors onboard these satellites is critical for a wide range of scientific fields. Despite the increasing popularity of deep learning and remote sensing, the majority of researchers still use decision trees and random forests fo… ▽ More

    Submitted 22 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  24. arXiv:2306.02680  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    BeAts: Bengali Speech Acts Recognition using Multimodal Attention Fusion

    Authors: Ahana Deb, Sayan Nag, Ayan Mahapatra, Soumitri Chattopadhyay, Aritra Marik, Pijush Kanti Gayen, Shankha Sanyal, Archi Banerjee, Samir Karmakar

    Abstract: Spoken languages often utilise intonation, rhythm, intensity, and structure, to communicate intention, which can be interpreted differently depending on the rhythm of speech of their utterance. These speech acts provide the foundation of communication and are unique in expression to the language. Recent advancements in attention-based models, demonstrating their ability to learn powerful represent… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted at INTERSPEECH 2023

  25. arXiv:2211.03209  [pdf, other

    eess.SY math.DS math.OC

    Robust Decentralized Secondary Control Scheme for Inverter-based Power Networks

    Authors: Siddharth Bhela, Abhishek Banerjee, Ulrich Muenz, Joachim Bamberger

    Abstract: Inverter-dominated microgrids are quickly becoming a key building block of future power systems. They rely on centralized controllers that can provide reliability and resiliency in extreme events. Nonetheless, communication failures due to cyber-physical attacks or natural disasters can make autonomous operation of islanded microgrids challenging. This paper examines a unified decentralized second… ▽ More

    Submitted 9 July, 2023; v1 submitted 6 November, 2022; originally announced November 2022.

    Comments: 7 pages, 9 figures

  26. arXiv:2209.06618  [pdf, ps, other

    eess.SY cs.RO

    Safe Autonomous Docking Maneuvers for a Floating Platform based on Input Sharing Control Barrier Functions

    Authors: Akshit Saradagi, Avijit Banerjee, Sumeet Satpute, George Nikolakopoulos

    Abstract: In this article, we present a control strategy for the problem of safe autonomous docking for a planar floating platform (Slider) that emulates the movement of a satellite. Employing the proposed strategy, Slider approaches a docking port with the right orientation, maintaining a safe distance, while always keeping a visual lock on the docking port throughout the docking maneuver. Control barrier… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: 8 Pages, 5 Figures, Accepted for presentation of 61st IEEE Conference on Decision and Control, Dec. 6-9, 2022, in Cancun, Mexico

  27. A Residual Network based Deep Learning Model for Detection of COVID-19 from Cough Sounds

    Authors: Annesya Banerjee, Achal Nilhani

    Abstract: The present work proposes a deep-learning-based approach for the classification of COVID-19 coughs from non-COVID-19 coughs and that can be used as a low-resource-based tool for early detection of the onset of such respiratory diseases. The proposed system uses the ResNet-50 architecture, a popularly known Convolutional Neural Network (CNN) for image recognition tasks, fed with the log-Mel spectru… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  28. Evaluating Sensor Data Quality in Internet ofThings Smart Agriculture Applications

    Authors: Kaneez Fizza, Prem Prakash Jayaraman, Abhik Banerjee, Dimitrios Georgakopoulos, Rajiv Ranjan

    Abstract: The unprecedented growth of Internet of Things (IoT) and its applications in areas such as Smart Agriculture compels the need to devise newer ways for evaluating the quality of such applications. While existing models for application quality focus on the quality experienced by the end-user (captured using likert scale), IoT applications have minimal human involvement and rely on machine to machine… ▽ More

    Submitted 28 April, 2021; originally announced May 2021.

    Comments: Technical Report under review with IEEE micro

    Report number: 1937-4143

    Journal ref: IEEE Micro 21 December 2021

  29. DenResCov-19: A deep transfer learning network for robust automatic classification of COVID-19, pneumonia, and tuberculosis from X-rays

    Authors: Michail Mamalakis, Andrew J. Swift, Bart Vorselaars, Surajit Ray, Simonne Weeks, Weiping Ding, Richard H. Clayton, Louise S. Mackenzie, Abhirup Banerjee

    Abstract: The global pandemic of COVID-19 is continuing to have a significant effect on the well-being of global population, increasing the demand for rapid testing, diagnosis, and treatment. Along with COVID-19, other etiologies of pneumonia and tuberculosis constitute additional challenges to the medical system. In this regard, the objective of this work is to develop a new deep transfer learning pipeline… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Report number: 102008, 0895-6111

    Journal ref: 2021, Computerized Medical Imaging and Graphics

  30. arXiv:2102.06038  [pdf

    cs.SD cs.CL eess.AS

    A Fractal Approach to Characterize Emotions in Audio and Visual Domain: A Study on Cross-Modal Interaction

    Authors: Sayan Nag, Uddalok Sarkar, Shankha Sanyal, Archi Banerjee, Souparno Roy, Samir Karmakar, Ranjan Sengupta, Dipak Ghosh

    Abstract: It is already known that both auditory and visual stimulus is able to convey emotions in human mind to different extent. The strength or intensity of the emotional arousal vary depending on the type of stimulus chosen. In this study, we try to investigate the emotional arousal in a cross-modal scenario involving both auditory and visual stimulus while studying their source characteristics. A robus… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  31. arXiv:2102.06003  [pdf

    cs.SD cs.CL eess.AS

    Language Independent Emotion Quantification using Non linear Modelling of Speech

    Authors: Uddalok Sarkar, Sayan Nag, Chirayata Bhattacharya, Shankha Sanyal, Archi Banerjee, Ranjan Sengupta, Dipak Ghosh

    Abstract: At present emotion extraction from speech is a very important issue due to its diverse applications. Hence, it becomes absolutely necessary to obtain models that take into consideration the speaking styles of a person, vocal tract information, timbral qualities and other congenital information regarding his voice. Our speech production system is a nonlinear system like most other real world system… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  32. arXiv:2102.00616  [pdf

    cs.SD cs.LG cs.MM eess.AS

    Neural Network architectures to classify emotions in Indian Classical Music

    Authors: Uddalok Sarkar, Sayan Nag, Medha Basu, Archi Banerjee, Shankha Sanyal, Ranjan Sengupta, Dipak Ghosh

    Abstract: Music is often considered as the language of emotions. It has long been known to elicit emotions in human being and thus categorizing music based on the type of emotions they induce in human being is a very intriguing topic of research. When the task comes to classify emotions elicited by Indian Classical Music (ICM), it becomes much more challenging because of the inherent ambiguity associated wi… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

  33. arXiv:2101.06335  [pdf, other

    cs.RO eess.SY math.DS

    Slider: On the Design and Modeling of a 2D Floating Satellite Platform

    Authors: Avijit Banerjee, Jakub Haluska, Sumeet G. Satpute, Dariusz Kominiak, George Nikolakopoulos

    Abstract: In this article, a floating robotic emulation platform for a virtual demonstration of satellite motion in space is presented. The robotic platform design is characterized by its friction-less, levitating, yet planar motion over a hyper-smooth surface. The robotic platform, integrated with sensor and actuator units, is fully designed and manufactured from the Robotics and Artificial Intelligence Te… ▽ More

    Submitted 15 January, 2021; originally announced January 2021.

  34. arXiv:2008.00247  [pdf, other

    cs.CV cs.LG eess.IV

    Meta-DRN: Meta-Learning for 1-Shot Image Segmentation

    Authors: Atmadeep Banerjee

    Abstract: Modern deep learning models have revolutionized the field of computer vision. But, a significant drawback of most of these models is that they require a large number of labelled examples to generalize properly. Recent developments in few-shot learning aim to alleviate this requirement. In this paper, we propose a novel lightweight CNN architecture for 1-shot image segmentation. The proposed model… ▽ More

    Submitted 1 August, 2020; originally announced August 2020.

  35. arXiv:2006.14718  [pdf, other

    cs.LG cs.RO eess.SP stat.ML

    Asynchronous Multi Agent Active Search

    Authors: Ramina Ghods, Arundhati Banerjee, Jeff Schneider

    Abstract: Active search refers to the problem of efficiently locating targets in an unknown environment by actively making data-collection decisions, and has many applications including detecting gas leaks, radiation sources or human survivors of disasters using aerial and/or ground robots (agents). Existing active search methods are in general only amenable to a single agent, or if they extend to multi age… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: Preprint under review

  36. arXiv:2004.08248  [pdf

    eess.AS cs.SD nlin.CD q-bio.NC

    Acoustical classification of different speech acts using nonlinear methods

    Authors: Chirayata Bhattacharyya, Sourya Sengupta, Sayan Nag, Shankha Sanyal, Archi Banerjee, Ranjan Sengupta, Dipak Ghosh

    Abstract: A recitation is a way of combining the words together so that they have a sense of rhythm and thus an emotional content is imbibed within. In this study we envisaged to answer these questions in a scientific manner taking into consideration 5 (five) well known Bengali recitations of different poets conveying a variety of moods ranging from joy to sorrow. The clips were recited as well as read (in… ▽ More

    Submitted 5 August, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: 6 pages, 2 figures; Proceedings of WESPAC 2018, New Delhi, India, November 11-15, 2018

  37. arXiv:2004.07820  [pdf

    cs.SD cs.CL eess.AS

    Speaker Recognition in Bengali Language from Nonlinear Features

    Authors: Uddalok Sarkar, Soumyadeep Pal, Sayan Nag, Chirayata Bhattacharya, Shankha Sanyal, Archi Banerjee, Ranjan Sengupta, Dipak Ghosh

    Abstract: At present Automatic Speaker Recognition system is a very important issue due to its diverse applications. Hence, it becomes absolutely necessary to obtain models that take into consideration the speaking style of a person, vocal tract information, timbral qualities of his voice and other congenital information regarding his voice. The study of Bengali speech recognition and speaker identification… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: arXiv admin note: text overlap with arXiv:1612.00171, arXiv:1601.07709

  38. arXiv:2004.07003  [pdf, other

    eess.IV cs.CV

    MXR-U-Nets for Real Time Hyperspectral Reconstruction

    Authors: Atmadeep Banerjee, Akash Palrecha

    Abstract: In recent times, CNNs have made significant contributions to applications in image generation, super-resolution and style transfer. In this paper, we build upon the work of Howard and Gugger, He et al. and Misra, D. and propose a CNN architecture that accurately reconstructs hyperspectral images from their RGB counterparts. We also propose a much shallower version of our best model with a 10% rela… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    ACM Class: I.4.5; I.4.10

  39. arXiv:1910.11090  [pdf, other

    cs.CV cs.LG eess.AS stat.ML

    Emotion Generation and Recognition: A StarGAN Approach

    Authors: Aritra Banerjee, Dimitrios Kollias

    Abstract: The main idea of this ISO is to use StarGAN (A type of GAN model) to perform training and testing on an emotion dataset resulting in a emotion recognition which can be generated by the valence arousal score of the 7 basic expressions. We have created an entirely new dataset consisting of 4K videos. This dataset consists of all the basic 7 types of emotions: Happy, Sad, Angry, Surprised, Fear, Disg… ▽ More

    Submitted 12 October, 2019; originally announced October 2019.

  40. arXiv:1907.03898  [pdf

    physics.app-ph eess.SY

    Parametrically Amplified Low-Power MEMS Capacitive Humidity Sensor

    Authors: Rugved Likhite, Aishwaryadev Banerjee, Apratim Majumder, Hanseup Kim and, Carlos H. Mastrangelo

    Abstract: We present the design, fabrication, and response of a polymer-based Laterally Amplified Chemo-Mechanical (LACM) humidity sensor based on mechanical leveraging and parametric amplification. The device consists of a sense cantilever asymmetrically patterned with a polymer and flanked by two stationary electrodes on the sides. When exposed to a humidity change, the polymer swells after absorbing the… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

  41. arXiv:1907.03576  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Deep Learning-Based Semantic Segmentation of Microscale Objects

    Authors: Ekta U. Samani, Wei Guo, Ashis G. Banerjee

    Abstract: Accurate estimation of the positions and shapes of microscale objects is crucial for automated imaging-guided manipulation using a non-contact technique such as optical tweezers. Perception methods that use traditional computer vision algorithms tend to fail when the manipulation environments are crowded. In this paper, we present a deep learning model for semantic segmentation of the images repre… ▽ More

    Submitted 3 July, 2019; originally announced July 2019.

    Comments: A condensed version of the paper is published in the Proceedings of the 2019 International Conference on Manipulation, Automation and Robotics at Small Scales

  42. arXiv:1805.08865  [pdf

    eess.AS cs.SD

    Speaker Recognition using Deep Belief Networks

    Authors: Adrish Banerjee, Akash Dubey, Abhishek Menon, Shubham Nanda, Gora Chand Nandi

    Abstract: Short time spectral features such as mel frequency cepstral coefficients(MFCCs) have been previously deployed in state of the art speaker recognition systems, however lesser heed has been paid to short term spectral features that can be learned by generative learning models from speech signals. Higher dimensional encoders such as deep belief networks (DBNs) could improve performance in speaker rec… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

  43. arXiv:1712.08336  [pdf

    q-bio.NC cs.SD eess.AS physics.data-an

    Music of Brain and Music on Brain: A Novel EEG Sonification approach

    Authors: Sayan Nag, Shankha Sanyal, Archi Banerjee, Ranjan Sengupta, Dipak Ghosh

    Abstract: Can we hear the sound of our brain? Is there any technique which can enable us to hear the neuro-electrical impulses originating from the different lobes of brain? The answer to all these questions is YES. In this paper we present a novel method with which we can sonify the Electroencephalogram (EEG) data recorded in rest state as well as under the influence of a simplest acoustical stimuli - a ta… ▽ More

    Submitted 22 December, 2017; originally announced December 2017.

    Comments: 6 pages, 4 figures; Presented in the International Symposium on Frontiers of Research in speech and Music (FRSM)-2017, held at NIT, Rourkela in 15-16 December 2017