Zum Hauptinhalt springen

Showing 1–50 of 100 results for author: Ghosh, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04360  [pdf, other

    cs.SE

    Size biased Multinomial Modelling of detection data in Software testing

    Authors: Pallabi Ghosh, Ashis Kr. Chakraborty, Soumen Dey

    Abstract: Estimation of software reliability often poses a considerable challenge, particularly for critical softwares. Several methods of estimation of reliability of software are already available in the literature. But, so far almost nobody used the concept of size of a bug for estimating software reliability. In this article we make used of the bug size or the eventual bug size which helps us to determi… ▽ More

    Submitted 24 May, 2024; originally announced June 2024.

    Comments: Submitted to OPSEARCH

  2. arXiv:2405.14835  [pdf, other

    cs.DS cs.CC

    Polynomial Pass Semi-Streaming Lower Bounds for K-Cores and Degeneracy

    Authors: Sepehr Assadi, Prantar Ghosh, Bruno Loff, Parth Mittal, Sagnik Mukhopadhyay

    Abstract: The following question arises naturally in the study of graph streaming algorithms: "Is there any graph problem which is "not too hard", in that it can be solved efficiently with total communication (nearly) linear in the number $n$ of vertices, and for which, nonetheless, any streaming algorithm with $\tilde{O}(n)$ space (i.e., a semi-streaming algorithm) needs a polynomial $n^{Ω(1)}$ number of… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted at CCC 2024

  3. arXiv:2405.05952  [pdf, other

    cs.DS

    New Algorithms and Lower Bounds for Streaming Tournaments

    Authors: Prantar Ghosh, Sahil Kuchlous

    Abstract: We study fundamental directed graph (digraph) problems in the streaming model. An initial investigation by Chakrabarti, Ghosh, McGregor, and Vorotnikova [SODA'20] on streaming digraphs showed that while most of these problems are provably hard in general, some of them become tractable when restricted to the well-studied class of tournament graphs where every pair of nodes shares exactly one direct… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  4. Neural network based approach for solving problems in plane wave duct acoustics

    Authors: D. Veerababu, Prasanta K. Ghosh

    Abstract: Neural networks have emerged as a tool for solving differential equations in many branches of engineering and science. But their progress in frequency domain acoustics is limited by the vanishing gradient problem that occurs at higher frequencies. This paper discusses a formulation that can address this issue. The problem of solving the governing differential equation along with the boundary condi… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Published Journal Article

    ACM Class: G.1.6; I.6.4; J.2

    Journal ref: Journal of Sound and Vibration, 585, 2024:118476

  5. arXiv:2404.04530  [pdf, other

    cs.CL

    A Morphology-Based Investigation of Positional Encodings

    Authors: Poulami Ghosh, Shikhar Vashishth, Raj Dabre, Pushpak Bhattacharyya

    Abstract: Contemporary deep learning models effectively handle languages with diverse morphology despite not being directly integrated into them. Morphology and word order are closely linked, with the latter incorporated into transformer-based models through positional encodings. This prompts a fundamental inquiry: Is there a correlation between the morphological complexity of a language and the utilization… ▽ More

    Submitted 30 May, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: Work in Progress

  6. Multimodal Healthcare AI: Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology

    Authors: Nur Yildirim, Hannah Richardson, Maria T. Wetscherek, Junaid Bajwa, Joseph Jacob, Mark A. Pinnock, Stephen Harris, Daniel Coelho de Castro, Shruthi Bannur, Stephanie L. Hyland, Pratik Ghosh, Mercy Ranjit, Kenza Bouzid, Anton Schwaighofer, Fernando Pérez-García, Harshita Sharma, Ozan Oktay, Matthew Lungren, Javier Alvarez-Valle, Aditya Nori, Anja Thieme

    Abstract: Recent advances in AI combine large language models (LLMs) with vision encoders that bring forward unprecedented technical capabilities to leverage for a wide range of healthcare applications. Focusing on the domain of radiology, vision-language models (VLMs) achieve good performance results for tasks such as generating radiology findings based on a patient's medical image, or answering visual que… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: to appear at CHI 2024

  7. Feature Selection using the concept of Peafowl Mating in IDS

    Authors: Partha Ghosh, Joy Sharma, Nilesh Pandey

    Abstract: Cloud computing has high applicability as an Internet based service that relies on sharing computing resources. Cloud computing provides services that are Infrastructure based, Platform based and Software based. The popularity of this technology is due to its superb performance, high level of computing ability, low cost of services, scalability, availability and flexibility. The obtainability and… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Journal ref: International Journal of Computer Networks & Communications (IJCNC) Vol.16, No.1, January 2024

  8. arXiv:2401.06378  [pdf, ps, other

    cs.DS

    New Lower Bounds in Merlin-Arthur Communication and Graph Streaming Verification

    Authors: Prantar Ghosh, Vihan Shah

    Abstract: We show new lower bounds in the \emph{Merlin-Arthur} (MA) communication model and the related \emph{annotated streaming} or stream verification model. The MA communication model is an enhancement of the classical communication model, where in addition to the usual players Alice and Bob, there is an all-powerful but untrusted player Merlin who knows their inputs and tries to convince them about the… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: To appear in ITCS 2024

  9. arXiv:2401.06035  [pdf, other

    cs.CV cs.LG

    RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks

    Authors: Partha Ghosh, Soubhik Sanyal, Cordelia Schmid, Bernhard Schölkopf

    Abstract: We present a novel unconditional video generative model designed to address long-term spatial and temporal dependencies, with attention to computational and dataset efficiency. To capture long spatio-temporal dependencies, our approach incorporates a hybrid explicit-implicit tri-plane representation inspired by 3D-aware generative frameworks developed for three-dimensional object representation an… ▽ More

    Submitted 11 August, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  10. arXiv:2312.01275  [pdf, other

    q-bio.MN cs.LG cs.SI

    A Review of Link Prediction Applications in Network Biology

    Authors: Ahmad F. Al Musawi, Satyaki Roy, Preetam Ghosh

    Abstract: In the domain of network biology, the interactions among heterogeneous genomic and molecular entities are represented through networks. Link prediction (LP) methodologies are instrumental in inferring missing or prospective associations within these biological networks. In this review, we systematically dissect the attributes of local, centrality, and embedding-based LP approaches, applied to stat… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  11. arXiv:2311.05435  [pdf

    cs.LG cs.SD eess.AS

    Parkinson's Disease Detection through Vocal Biomarkers and Advanced Machine Learning Algorithms

    Authors: Md Abu Sayed, Maliha Tayaba, MD Tanvir Islam, Md Eyasin Ul Islam Pavel, Md Tuhin Mia, Eftekhar Hossain Ayon, Nur Nob, Bishnu Padh Ghosh

    Abstract: Parkinson's disease (PD) is a prevalent neurodegenerative disorder known for its impact on motor neurons, causing symptoms like tremors, stiffness, and gait difficulties. This study explores the potential of vocal feature alterations in PD patients as a means of early disease prediction. This research aims to predict the onset of Parkinson's disease. Utilizing a variety of advanced machine-learnin… ▽ More

    Submitted 2 December, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

  12. arXiv:2309.11805  [pdf, other

    cs.AI

    JobRecoGPT -- Explainable job recommendations using LLMs

    Authors: Preetam Ghosh, Vaishali Sadaphal

    Abstract: In today's rapidly evolving job market, finding the right opportunity can be a daunting challenge. With advancements in the field of AI, computers can now recommend suitable jobs to candidates. However, the task of recommending jobs is not same as recommending movies to viewers. Apart from must-have criteria, like skills and experience, there are many subtle aspects to a job which can decide if it… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 10 pages, 29 figures

  13. arXiv:2308.10638  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    SCULPT: Shape-Conditioned Unpaired Learning of Pose-dependent Clothed and Textured Human Meshes

    Authors: Soubhik Sanyal, Partha Ghosh, Jinlong Yang, Michael J. Black, Justus Thies, Timo Bolkart

    Abstract: We present SCULPT, a novel 3D generative model for clothed and textured 3D meshes of humans. Specifically, we devise a deep neural network that learns to represent the geometry and appearance distribution of clothed human bodies. Training such a model is challenging, as datasets of textured 3D meshes for humans are limited in size and accessibility. Our key observation is that there exist medium-s… ▽ More

    Submitted 6 May, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Updated to camera ready version of CVPR 2024

  14. arXiv:2307.09882  [pdf, other

    cs.LG cs.AI

    Adversarial Likelihood Estimation With One-Way Flows

    Authors: Omri Ben-Dov, Pravir Singh Gupta, Victoria Abrevaya, Michael J. Black, Partha Ghosh

    Abstract: Generative Adversarial Networks (GANs) can produce high-quality samples, but do not provide an estimate of the probability density around the samples. However, it has been noted that maximizing the log-likelihood within an energy-based setting can lead to an adversarial framework where the discriminator provides unnormalized density (often called energy). We further develop this perspective, incor… ▽ More

    Submitted 2 October, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

  15. arXiv:2307.07948  [pdf, ps, other

    eess.AS cs.CL

    Model Adaptation for ASR in low-resource Indian Languages

    Authors: Abhayjeet Singh, Arjun Singh Mehta, Ashish Khuraishi K S, Deekshitha G, Gauri Date, Jai Nanavati, Jesuraja Bandekar, Karnalius Basumatary, Karthika P, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, Savitha, Prasanta Kumar Ghosh, Prashanthi V, Priyanka Pai, Raoul Nanavati, Rohan Saxena, Sai Praneeth Reddy Mora, Srinivasa Raghavan

    Abstract: Automatic speech recognition (ASR) performance has improved drastically in recent years, mainly enabled by self-supervised learning (SSL) based acoustic models such as wav2vec2 and large-scale multi-lingual training like Whisper. A huge challenge still exists for low-resource languages where the availability of both audio and text is limited. This is further complicated by the presence of multiple… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: ASRU Special session overview paper

  16. arXiv:2304.14807  [pdf, other

    physics.plasm-ph cs.AI cs.LG physics.comp-ph

    Deep Learning assisted microwave-plasma interaction based technique for plasma density estimation

    Authors: Pratik Ghosh, Bhaskar Chaudhury, Shishir Purohit, Vishv Joshi, Ashray Kothari, Devdeep Shetranjiwala

    Abstract: The electron density is a key parameter to characterize any plasma. Most of the plasma applications and research in the area of low-temperature plasmas (LTPs) are based on the accurate estimations of plasma density and plasma temperature. The conventional methods for electron density measurements offer axial and radial profiles for any given linear LTP device. These methods have major disadvantage… ▽ More

    Submitted 28 June, 2023; v1 submitted 28 April, 2023; originally announced April 2023.

  17. arXiv:2304.12285  [pdf, ps, other

    cs.DS

    Low-Memory Algorithms for Online and W-Streaming Edge Coloring

    Authors: Prantar Ghosh, Manuel Stoeckl

    Abstract: For edge coloring, the online and the W-streaming models seem somewhat orthogonal: the former needs edges to be assigned colors immediately after insertion, typically without any space restrictions, while the latter limits memory to sublinear in the input size but allows an edge's color to be announced any time after its insertion. We aim for the best of both worlds by designing small-space online… ▽ More

    Submitted 31 May, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 36 pages, 1 figure; improvements to Thm 1.8 and minor edits since v1

  18. arXiv:2303.12343  [pdf, other

    cs.CV

    LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation

    Authors: Koutilya Pnvr, Bharat Singh, Pallabi Ghosh, Behjat Siddiquie, David Jacobs

    Abstract: Large-scale pre-training tasks like image classification, captioning, or self-supervised techniques do not incentivize learning the semantic boundaries of objects. However, recent generative foundation models built using text-based latent diffusion techniques may learn semantic boundaries. This is because they have to synthesize intricate details about all objects in an image based on a text descr… ▽ More

    Submitted 23 August, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Supplementary material is included in the paper following the references section

  19. arXiv:2303.11434  [pdf, other

    cs.LG q-bio.QM

    ResDTA: Predicting Drug-Target Binding Affinity Using Residual Skip Connections

    Authors: Partho Ghosh, Md. Aynal Haque

    Abstract: The discovery of novel drug target (DT) interactions is an important step in the drug development process. The majority of computer techniques for predicting DT interactions have focused on binary classification, with the goal of determining whether or not a DT pair interacts. Protein ligand interactions, on the other hand, assume a continuous range of binding strength values, also known as bindin… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 40 pages, 10 figures, 2 tables. arXiv admin note: substantial text overlap with arXiv:1801.10193, arXiv:1902.04166 by other authors

  20. arXiv:2302.01374  [pdf, other

    cs.LG

    Neural Network Architecture for Database Augmentation Using Shared Features

    Authors: William C. Sleeman IV, Rishabh Kapoor, Preetam Ghosh

    Abstract: The popularity of learning from data with machine learning and neural networks has lead to the creation of many new datasets for almost every problem domain. However, even within a single domain, these datasets are often collected with disparate features, sampled from different sub-populations, and recorded at different time points. Even with the plethora of individual datasets, large data science… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: 22 pages, 8 figures, 4 tables

    ACM Class: I.5.1; I.5.2

  21. arXiv:2212.10641  [pdf, ps, other

    cs.DS

    Coloring in Graph Streams via Deterministic and Adversarially Robust Algorithms

    Authors: Sepehr Assadi, Amit Chakrabarti, Prantar Ghosh, Manuel Stoeckl

    Abstract: In recent years, there has been a growing interest in solving various graph coloring problems in the streaming model. The initial algorithms in this line of work are all crucially randomized, raising natural questions about how important a role randomization plays in streaming graph coloring. A couple of very recent works have made progress on this question: they prove that deterministic or even a… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: 29 pages

  22. arXiv:2212.09284  [pdf, other

    cs.CL

    An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations

    Authors: Shelly Jain, Priyanshi Pal, Anil Vuppala, Prasanta Ghosh, Chiranjeevi Yarra

    Abstract: Speech systems are sensitive to accent variations. This is especially challenging in the Indian context, with an abundance of languages but a dearth of linguistic studies characterising pronunciation variations. The growing number of L2 English speakers in India reinforces the need to study accents and L1-L2 interactions. We investigate the accents of Indian English (IE) speakers and report in det… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 9 pages, 1 figure

  23. arXiv:2211.03109  [pdf, other

    eess.IV cs.CV

    A Sequence Agnostic Multimodal Preprocessing for Clogged Blood Vessel Detection in Alzheimer's Diagnosis

    Authors: Partho Ghosh, Md. Abrar Istiak, Mir Sayeed Mohammad, Swapnil Saha, Uday Kamal

    Abstract: Successful identification of blood vessel blockage is a crucial step for Alzheimer's disease diagnosis. These blocks can be identified from the spatial and time-depth variable Two-Photon Excitation Microscopy (TPEF) images of the brain blood vessels using machine learning methods. In this study, we propose several preprocessing schemes to improve the performance of these methods. Our method includ… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

    Comments: 5 pages, 4 figures

  24. arXiv:2210.16881  [pdf, other

    eess.AS cs.SD

    Real-Time MRI Video synthesis from time aligned phonemes with sequence-to-sequence networks

    Authors: Sathvik Udupa, Prasanta Kumar Ghosh

    Abstract: Real-Time Magnetic resonance imaging (rtMRI) of the midsagittal plane of the mouth is of interest for speech production research. In this work, we focus on estimating utterance level rtMRI video from the spoken phoneme sequence. We obtain time-aligned phonemes from forced alignment, to obtain frame-level phoneme sequences which are aligned with rtMRI frames. We propose a sequence-to-sequence learn… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: submitted to ICASSP 2023

  25. arXiv:2210.16871  [pdf, other

    eess.AS cs.SD

    Improved acoustic-to-articulatory inversion using representations from pretrained self-supervised learning models

    Authors: Sathvik Udupa, Siddarth C, Prasanta Kumar Ghosh

    Abstract: In this work, we investigate the effectiveness of pretrained Self-Supervised Learning (SSL) features for learning the mapping for acoustic to articulatory inversion (AAI). Signal processing-based acoustic features such as MFCCs have been predominantly used for the AAI task with deep neural networks. With SSL features working well for various other speech tasks such as speech recognition, emotion c… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: submitted to ICASSP 2023

  26. arXiv:2210.12921  [pdf

    cs.CL cs.SD eess.AS

    Investigating self-supervised, weakly supervised and fully supervised training approaches for multi-domain automatic speech recognition: a study on Bangladeshi Bangla

    Authors: Ahnaf Mozib Samin, M. Humayon Kobir, Md. Mushtaq Shahriyar Rafee, M. Firoz Ahmed, Mehedi Hasan, Partha Ghosh, Shafkat Kibria, M. Shahidur Rahman

    Abstract: Despite huge improvements in automatic speech recognition (ASR) employing neural networks, ASR systems still suffer from a lack of robustness and generalizability issues due to domain shifting. This is mainly because principal corpus design criteria are often not identified and examined adequately while compiling ASR datasets. In this study, we investigate the robustness of the state-of-the-art tr… ▽ More

    Submitted 10 May, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

  27. arXiv:2207.08930  [pdf, other

    cs.RO

    Cooperative Infrastructure Perception

    Authors: Fawad Ahmad, Christina Suyong Shin, Weiwu Pang, Branden Leong, Pradipta Ghosh, Ramesh Govindan

    Abstract: Recent works have considered two qualitatively different approaches to overcome line-of-sight limitations of 3D sensors used for perception: cooperative perception and infrastructure-augmented perception. In this paper, motivated by increasing deployments of infrastructure LiDARs, we explore a third approach, cooperative infrastructure perception. This approach generates perception outputs by fusi… ▽ More

    Submitted 26 June, 2024; v1 submitted 18 July, 2022; originally announced July 2022.

  28. An Automated Deployment and Testing Framework for Resilient Distributed Smart Grid Applications

    Authors: Purboday Ghosh, Hao Tu, Timothy Krentz, Gabor Karsai, Srdjan Lukic

    Abstract: Executing distributed cyber-physical software processes on edge devices that maintains the resiliency of the overall system while adhering to resource constraints is quite a challenging trade-off to consider for developers. Current approaches do not solve this problem of deploying software components to devices in a way that satisfies different resilience requirements that can be encoded by develo… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: accepted, pending publication

  29. arXiv:2206.12952  [pdf, other

    cs.CV cs.LG

    Nonwatertight Mesh Reconstruction

    Authors: Partha Ghosh

    Abstract: Reconstructing 3D non-watertight mesh from an unoriented point cloud is an unexplored area in computer vision and computer graphics. In this project, we tried to tackle this problem by extending the learning-based watertight mesh reconstruction pipeline presented in the paper 'Shape as Points'. The core of our approach is to cast the problem as a semantic segmentation problem that identifies the r… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:2106.03452 by other authors

  30. arXiv:2206.11563  [pdf, other

    cs.LG cs.AI

    LED: Latent Variable-based Estimation of Density

    Authors: Omri Ben-Dov, Pravir Singh Gupta, Victoria Fernandez Abrevaya, Michael J. Black, Partha Ghosh

    Abstract: Modern generative models are roughly divided into two main categories: (1) models that can produce high-quality random samples, but cannot estimate the exact density of new data points and (2) those that provide exact density estimation, at the expense of sample quality and compactness of the latent space. In this work we propose LED, a new generative model closely related to GANs, that allows not… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  31. arXiv:2206.01263  [pdf, other

    physics.comp-ph cs.LG

    Deep Learning Architecture Based Approach For 2D-Simulation of Microwave Plasma Interaction

    Authors: Mihir Desai, Pratik Ghosh, Ahlad Kumar, Bhaskar Chaudhury

    Abstract: This paper presents a convolutional neural network (CNN)-based deep learning model, inspired from UNet with series of encoder and decoder units with skip connections, for the simulation of microwave-plasma interaction. The microwave propagation characteristics in complex plasma medium pertaining to transmission, absorption and reflection primarily depends on the ratio of electromagnetic (EM) wave… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

  32. arXiv:2204.08106  [pdf, other

    cs.DS

    A New Dynamic Algorithm for Densest Subhypergraphs

    Authors: Suman K. Bera, Sayan Bhattacharya, Jayesh Choudhari, Prantar Ghosh

    Abstract: Computing a dense subgraph is a fundamental problem in graph mining, with a diverse set of applications ranging from electronic commerce to community detection in social networks. In many of these applications, the underlying context is better modelled as a weighted hypergraph that keeps evolving with time. This motivates the problem of maintaining the densest subhypergraph of a weighted hypergr… ▽ More

    Submitted 17 April, 2022; originally announced April 2022.

    Comments: Extended abstract appears in TheWebConf (previously WWW) 2022

  33. arXiv:2204.06502  [pdf, ps, other

    cs.CL

    Study of Indian English Pronunciation Variabilities relative to Received Pronunciation

    Authors: Priyanshi Pal, Shelly Jain, Anil Vuppala, Chiranjeevi Yarra, Prasanta Ghosh

    Abstract: Analysis of Indian English (IE) pronunciation variabilities are useful in building systems for Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) synthesis in the Indian context. Typically, these pronunciation variabilities have been explored by comparing IE pronunciation with Received Pronunciation (RP). However, to explore these variabilities, it is required to have labelled pronunciati… ▽ More

    Submitted 9 December, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

  34. Deep Hyperspectral Unmixing using Transformer Network

    Authors: Preetam Ghosh, Swalpa Kumar Roy, Bikram Koirala, Behnood Rasti, Paul Scheunders

    Abstract: Currently, this paper is under review in IEEE. Transformers have intrigued the vision research community with their state-of-the-art performance in natural language processing. With their superior performance, transformers have found their way in the field of hyperspectral image classification and achieved promising results. In this article, we harness the power of transformers to conquer the task… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: Currently, this paper is under review in IEEE

  35. arXiv:2203.06004  [pdf, other

    cs.CV eess.AS

    An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production

    Authors: Anwesha Roy, Varun Belagali, Prasanta Kumar Ghosh

    Abstract: The best performance in Air-tissue boundary (ATB) segmentation of real-time Magnetic Resonance Imaging (rtMRI) videos in speech production is known to be achieved by a 3-dimensional convolutional neural network (3D-CNN) model. However, the evaluation of this model, as well as other ATB segmentation techniques reported in the literature, is done using Dynamic Time Warping (DTW) distance between the… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: accepted for ICASSP 2022

  36. arXiv:2202.12440  [pdf, other

    stat.ML cs.LG

    On Learning and Testing of Counterfactual Fairness through Data Preprocessing

    Authors: Haoyu Chen, Wenbin Lu, Rui Song, Pulak Ghosh

    Abstract: Machine learning has become more important in real-life decision-making but people are concerned about the ethical problems it may bring when used improperly. Recent work brings the discussion of machine learning fairness into the causal framework and elaborates on the concept of Counterfactual Fairness. In this paper, we develop the Fair Learning through dAta Preprocessing (FLAP) algorithm to lea… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

  37. arXiv:2202.07448  [pdf, other

    cs.CY cs.NI

    Towards a Unified Pandemic Management Architecture: Survey, Challenges and Future Directions

    Authors: Satyaki Roy, Nirnay Ghosh, Nitish Uplavikar, Preetam Ghosh

    Abstract: The pandemic caused by SARS-CoV-2 has left an unprecedented impact on health, economy and society worldwide. Emerging strains are making pandemic management increasingly challenging. There is an urge to collect epidemiological, clinical, and physiological data to make an informed decision on mitigation measures. Advances in the Internet of Things (IoT) and edge computing provide solutions for pand… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: 30 pages and 10 figures

  38. arXiv:2112.06848  [pdf, ps, other

    cs.DC

    Peer-to-Peer Communication Trade-Offs for Smart Grid Applications

    Authors: Purboday Ghosh, Shashank Shekhar, Yashen Lin, Ulrich Muenz, Gabor Karsai

    Abstract: Virtual topologies in peer-to-peer networks can reduce the traffic consumed by altering the logical connectivity of peers without altering the underlying network. However, such sparsely connected virtual topologies do not focus on the needs for smart grid applications, which is information dissemination throughout the network, and in turn degrade the performance of distributed control algorithms r… ▽ More

    Submitted 4 May, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: 10 pages, 6 figures

  39. arXiv:2112.04598  [pdf, other

    cs.CV cs.LG stat.ML

    InvGAN: Invertible GANs

    Authors: Partha Ghosh, Dominik Zietlow, Michael J. Black, Larry S. Davis, Xiaochen Hu

    Abstract: Generation of photo-realistic images, semantic editing and representation learning are a few of many potential applications of high resolution generative models. Recent progress in GANs have established them as an excellent choice for such tasks. However, since they do not provide an inference model, image editing or downstream tasks such as classification can not be done on real images using the… ▽ More

    Submitted 10 December, 2021; v1 submitted 8 December, 2021; originally announced December 2021.

  40. arXiv:2112.04151  [pdf, ps, other

    eess.AS cs.CL cs.SD

    A study on native American English speech recognition by Indian listeners with varying word familiarity level

    Authors: Abhayjeet Singh, Achuth Rao MV, Rakesh Vaideeswaran, Chiranjeevi Yarra, Prasanta Kumar Ghosh

    Abstract: In this study, listeners of varied Indian nativities are asked to listen and recognize TIMIT utterances spoken by American speakers. We have three kinds of responses from each listener while they recognize an utterance: 1. Sentence difficulty ratings, 2. Speaker difficulty ratings, and 3. Transcription of the utterance. From these transcriptions, word error rate (WER) is calculated and used as a m… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

    Comments: 6 pages, 5 figues, COCOSDA 2021

  41. arXiv:2109.11130  [pdf, other

    cs.DS

    Adversarially Robust Coloring for Graph Streams

    Authors: Amit Chakrabarti, Prantar Ghosh, Manuel Stoeckl

    Abstract: A streaming algorithm is considered to be adversarially robust if it provides correct outputs with high probability even when the stream updates are chosen by an adversary who may observe and react to the past outputs of the algorithm. We grow the burgeoning body of work on such algorithms in a new direction by studying robust algorithms for the problem of maintaining a valid vertex coloring of an… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

  42. arXiv:2109.09020  [pdf, other

    cs.LG cs.AI

    Multimodal Classification: Current Landscape, Taxonomy and Future Directions

    Authors: William C. Sleeman IV, Rishabh Kapoor, Preetam Ghosh

    Abstract: Multimodal classification research has been gaining popularity in many domains that collect more data from multiple sources including satellite imagery, biometrics, and medicine. However, the lack of consistent terminology and architectural descriptions makes it difficult to compare different existing solutions. We address these challenges by proposing a new taxonomy for describing such systems ba… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

    Comments: 24 pages, 3 tables, 7 figures

    ACM Class: I.5.2

  43. arXiv:2107.07875  [pdf, other

    stat.ML cs.LG

    A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens

    Authors: Palash Ghosh, Trikay Nalamada, Shruti Agarwal, Maria Jahja, Bibhas Chakraborty

    Abstract: A dynamic treatment regimen (DTR) is a set of decision rules to personalize treatments for an individual using their medical history. The Q-learning based Q-shared algorithm has been used to develop DTRs that involve decision rules shared across multiple stages of intervention. We show that the existing Q-shared algorithm can suffer from non-convergence due to the use of linear models in the Q-lea… ▽ More

    Submitted 26 May, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

  44. arXiv:2106.14292  [pdf, other

    eess.IV cs.CV

    Knee Osteoarthritis Severity Prediction using an Attentive Multi-Scale Deep Convolutional Neural Network

    Authors: Rohit Kumar Jain, Prasen Kumar Sharma, Sibaji Gaj, Arijit Sur, Palash Ghosh

    Abstract: Knee Osteoarthritis (OA) is a destructive joint disease identified by joint stiffness, pain, and functional disability concerning millions of lives across the globe. It is generally assessed by evaluating physical symptoms, medical history, and other joint screening tests like radiographs, Magnetic Resonance Imaging (MRI), and Computed Tomography (CT) scans. Unfortunately, the conventional methods… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  45. arXiv:2106.00639  [pdf, other

    eess.AS cs.SD eess.SP

    Multi-modal Point-of-Care Diagnostics for COVID-19 Based On Acoustics and Symptoms

    Authors: Srikanth Raj Chetupalli, Prashant Krishnan, Neeraj Sharma, Ananya Muguli, Rohit Kumar, Viral Nanda, Lancelot Mark Pinto, Prasanta Kumar Ghosh, Sriram Ganapathy

    Abstract: The research direction of identifying acoustic bio-markers of respiratory diseases has received renewed interest following the onset of COVID-19 pandemic. In this paper, we design an approach to COVID-19 diagnostic using crowd-sourced multi-modal data. The data resource, consisting of acoustic signals like cough, breathing, and speech signals, along with the data of symptoms, are recorded using a… ▽ More

    Submitted 5 June, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: The Manuscript is submitted to IEEE-EMBS Journal of Biomedical and Health Informatics on June 1, 2021

  46. arXiv:2105.08215  [pdf, ps, other

    cs.DS

    Vertex Ordering Problems in Directed Graph Streams

    Authors: Amit Chakrabarti, Prantar Ghosh, Andrew McGregor, Sofya Vorotnikova

    Abstract: We consider directed graph algorithms in a streaming setting, focusing on problems concerning orderings of the vertices. This includes such fundamental problems as topological sorting and acyclicity testing. We also study the related problems of finding a minimum feedback arc set (edges whose removal yields an acyclic graph), and finding a sink vertex. We are interested in both adversarially-order… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Appeared in SODA 2020

  47. arXiv:2104.05017  [pdf, other

    eess.AS cs.SD

    Estimating articulatory movements in speech production with transformer networks

    Authors: Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh

    Abstract: We estimate articulatory movements in speech production from different modalities - acoustics and phonemes. Acoustic-to articulatory inversion (AAI) is a sequence-to-sequence task. On the other hand, phoneme to articulatory (PTA) motion estimation faces a key challenge in reliably aligning the text and the articulatory movements. To address this challenge, we explore the use of a transformer archi… ▽ More

    Submitted 12 June, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

    Comments: accepted for oral presentation at INTERSPEECH 2021

  48. Multilingual and code-switching ASR challenges for low resource Indian languages

    Authors: Anuj Diwan, Rakesh Vaideeswaran, Sanket Shah, Ankita Singh, Srinivasa Raghavan, Shreya Khare, Vinit Unni, Saurabh Vyas, Akash Rajpuria, Chiranjeevi Yarra, Ashish Mittal, Prasanta Kumar Ghosh, Preethi Jyothi, Kalika Bali, Vivek Seshadri, Sunayana Sitaram, Samarth Bharadwaj, Jai Nanavati, Raoul Nanavati, Karthik Sankaranarayanan, Tejaswi Seeram, Basil Abraham

    Abstract: Recently, there is increasing interest in multilingual automatic speech recognition (ASR) where a speech recognition system caters to multiple low resource languages by taking advantage of low amounts of labeled corpora in multiple languages. With multilingualism becoming common in today's world, there has been increasing interest in code-switching ASR as well. In code-switching, multiple language… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

    Comments: 6 pages

  49. arXiv:2103.09148  [pdf, other

    eess.AS cs.SD

    DiCOVA Challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics

    Authors: Ananya Muguli, Lancelot Pinto, Nirmala R., Neeraj Sharma, Prashant Krishnan, Prasanta Kumar Ghosh, Rohit Kumar, Shrirama Bhat, Srikanth Raj Chetupalli, Sriram Ganapathy, Shreyas Ramoji, Viral Nanda

    Abstract: The DiCOVA challenge aims at accelerating research in diagnosing COVID-19 using acoustics (DiCOVA), a topic at the intersection of speech and audio processing, respiratory health diagnosis, and machine learning. This challenge is an open call for researchers to analyze a dataset of sound recordings collected from COVID-19 infected and non-COVID-19 individuals for a two-class classification. These… ▽ More

    Submitted 17 June, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: To appear in Proceedings of Interspeech, 2021

  50. arXiv:2012.11581  [pdf, other

    cs.CV

    Populating 3D Scenes by Learning Human-Scene Interaction

    Authors: Mohamed Hassan, Partha Ghosh, Joachim Tesch, Dimitrios Tzionas, Michael J. Black

    Abstract: Humans live within a 3D space and constantly interact with it to perform tasks. Such interactions involve physical contact between surfaces that is semantically meaningful. Our goal is to learn how humans interact with scenes and leverage this to enable virtual characters to do the same. To that end, we introduce a novel Human-Scene Interaction (HSI) model that encodes proximal relationships, call… ▽ More

    Submitted 5 April, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Journal ref: CVPR2021