Zum Hauptinhalt springen

Showing 1–50 of 65 results for author: Chakraborty, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.02793  [pdf, other

    cs.AR

    Evaluating Large Language Models for Automatic Register Transfer Logic Generation via High-Level Synthesis

    Authors: Sneha Swaroopa, Rijoy Mukherjee, Anushka Debnath, Rajat Subhra Chakraborty

    Abstract: The ever-growing popularity of large language models (LLMs) has resulted in their increasing adoption for hardware design and verification. Prior research has attempted to assess the capability of LLMs to automate digital hardware design by producing superior-quality Register Transfer Logic (RTL) descriptions, particularly in Verilog. However, these tests have revealed that Verilog code production… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  2. arXiv:2406.12444  [pdf, other

    cs.CY cs.SI

    Who Checks the Checkers? Exploring Source Credibility in Twitter's Community Notes

    Authors: Uku Kangur, Roshni Chakraborty, Rajesh Sharma

    Abstract: In recent years, the proliferation of misinformation on social media platforms has become a significant concern. Initially designed for sharing information and fostering social connections, platforms like Twitter (now rebranded as X) have also unfortunately become conduits for spreading misinformation. To mitigate this, these platforms have implemented various mechanisms, including the recent sugg… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.09390  [pdf, other

    cs.CV cs.LG

    LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living

    Authors: Rajatsubhra Chakraborty, Arkaprava Sinha, Dominick Reilly, Manish Kumar Govind, Pu Wang, Francois Bremond, Srijan Das

    Abstract: Large Language Vision Models (LLVMs) have demonstrated effectiveness in processing internet videos, yet they struggle with the visually perplexing dynamics present in Activities of Daily Living (ADL) due to limited pertinent datasets and models tailored to relevant cues. To this end, we propose a framework for curating ADL multiview datasets to fine-tune LLVMs, resulting in the creation of ADL-X,… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2405.06551  [pdf, other

    cs.CL cs.SI

    ADSumm: Annotated Ground-truth Summary Datasets for Disaster Tweet Summarization

    Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

    Abstract: Online social media platforms, such as Twitter, provide valuable information during disaster events. Existing tweet disaster summarization approaches provide a summary of these events to aid government agencies, humanitarian organizations, etc., to ensure effective disaster response. In the literature, there are two types of approaches for disaster summarization, namely, supervised and unsupervise… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  5. arXiv:2405.06541  [pdf, other

    cs.CL cs.SI

    ATSumm: Auxiliary information enhanced approach for abstractive disaster Tweet Summarization with sparse training data

    Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

    Abstract: The abundance of situational information on Twitter poses a challenge for users to manually discern vital and relevant information during disasters. A concise and human-interpretable overview of this information helps decision-makers in implementing efficient and quick disaster response. Existing abstractive summarization approaches can be categorized as sentence-based or key-phrase-based approach… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  6. arXiv:2403.13870  [pdf, other

    cs.CV cs.LG

    ExMap: Leveraging Explainability Heatmaps for Unsupervised Group Robustness to Spurious Correlations

    Authors: Rwiddhi Chakraborty, Adrian Sletten, Michael Kampffmeyer

    Abstract: Group robustness strategies aim to mitigate learned biases in deep learning models that arise from spurious correlations present in their training datasets. However, most existing methods rely on the access to the label distribution of the groups, which is time-consuming and expensive to obtain. As a result, unsupervised group robustness strategies are sought. Based on the insight that a trained m… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  7. arXiv:2403.11418  [pdf, other

    cs.LG cs.AI

    Variational Sampling of Temporal Trajectories

    Authors: Jurijs Nazarovs, Zhichun Huang, Xingjian Zhen, Sourav Pal, Rudrasis Chakraborty, Vikas Singh

    Abstract: A deterministic temporal process can be determined by its trajectory, an element in the product space of (a) initial condition $z_0 \in \mathcal{Z}$ and (b) transition function $f: (\mathcal{Z}, \mathcal{T}) \to \mathcal{Z}$ often influenced by the control of the underlying dynamical system. Existing methods often model the transition function as a differential equation or as a recurrent neural ne… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  8. arXiv:2312.04548  [pdf, other

    cs.CV cs.AI cs.LG

    Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve Aerial Visual Perception?

    Authors: Aritra Dutta, Srijan Das, Jacob Nielsen, Rajatsubhra Chakraborty, Mubarak Shah

    Abstract: Despite the commercial abundance of UAVs, aerial data acquisition remains challenging, and the existing Asia and North America-centric open-source UAV datasets are small-scale or low-resolution and lack diversity in scene contextuality. Additionally, the color content of the scenes, solar-zenith angle, and population density of different geographies influence the data diversity. These two factors… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    ACM Class: I.4.0; I.4.8; I.5.1; I.5.4; I.2.10

  9. arXiv:2310.18367  [pdf, other

    physics.chem-ph cs.AI cs.CV cs.LG

    Unsupervised Learning of Molecular Embeddings for Enhanced Clustering and Emergent Properties for Chemical Compounds

    Authors: Jaiveer Gill, Ratul Chakraborty, Reetham Gubba, Amy Liu, Shrey Jain, Chirag Iyer, Obaid Khwaja, Saurav Kumar

    Abstract: The detailed analysis of molecular structures and properties holds great potential for drug development discovery through machine learning. Developing an emergent property in the model to understand molecules would broaden the horizons for development with a new computational tool. We introduce various methods to detect and cluster chemical compounds based on their SMILES data. Our first method, a… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  10. arXiv:2309.15670  [pdf

    cs.LG cs.CL

    MONOVAB : An Annotated Corpus for Bangla Multi-label Emotion Detection

    Authors: Sumit Kumar Banshal, Sajal Das, Shumaiya Akter Shammi, Narayan Ranjan Chakraborty

    Abstract: In recent years, Sentiment Analysis (SA) and Emotion Recognition (ER) have been increasingly popular in the Bangla language, which is the seventh most spoken language throughout the entire world. However, the language is structurally complicated, which makes this field arduous to extract emotions in an accurate manner. Several distinct approaches such as the extraction of positive and negative sen… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  11. arXiv:2305.11592  [pdf, ps, other

    cs.CL cs.SI

    IKDSumm: Incorporating Key-phrases into BERT for extractive Disaster Tweet Summarization

    Authors: Piyush Kumar Garg, Roshni Chakraborty, Srishti Gupta, Sourav Kumar Dandapat

    Abstract: Online social media platforms, such as Twitter, are one of the most valuable sources of information during disaster events. Therefore, humanitarian organizations, government agencies, and volunteers rely on a summary of this information, i.e., tweets, for effective disaster management. Although there are several existing supervised and unsupervised approaches for automated tweet summary approaches… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  12. arXiv:2305.11536  [pdf, other

    cs.CL cs.SI

    PORTRAIT: a hybrid aPproach tO cReate extractive ground-TRuth summAry for dIsaster evenT

    Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

    Abstract: Disaster summarization approaches provide an overview of the important information posted during disaster events on social media platforms, such as, Twitter. However, the type of information posted significantly varies across disasters depending on several factors like the location, type, severity, etc. Verification of the effectiveness of disaster summarization approaches still suffer due to the… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  13. arXiv:2303.09352  [pdf, other

    cs.CV

    Hubs and Hyperspheres: Reducing Hubness and Improving Transductive Few-shot Learning with Hyperspherical Embeddings

    Authors: Daniel J. Trosten, Rwiddhi Chakraborty, Sigurd Løkse, Kristoffer Knutsen Wickstrøm, Robert Jenssen, Michael C. Kampffmeyer

    Abstract: Distance-based classification is frequently used in transductive few-shot learning (FSL). However, due to the high-dimensionality of image representations, FSL classifiers are prone to suffer from the hubness problem, where a few points (hubs) occur frequently in multiple nearest neighbour lists of other points. Hubness negatively impacts distance-based classification when hubs from one class appe… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  14. arXiv:2210.06354  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity

    Authors: Swapnil Bhosale, Rupayan Chakraborty, Sunil Kumar Kopparapu

    Abstract: Automatic Audio Captioning (AAC) refers to the task of translating an audio sample into a natural language (NL) text that describes the audio events, source of the events and their relationships. Unlike NL text generation tasks, which rely on metrics like BLEU, ROUGE, METEOR based on lexical semantics for evaluation, the AAC evaluation metric requires an ability to map NL text (phrases) that corre… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: 9 pages, 8 figures,

  15. arXiv:2207.09684  [pdf, other

    cs.CV

    On the Versatile Uses of Partial Distance Correlation in Deep Learning

    Authors: Xingjian Zhen, Zihang Meng, Rudrasis Chakraborty, Vikas Singh

    Abstract: Comparing the functional behavior of neural network models, whether it is a single network over time or two (or more networks) during or post-training, is an essential step in understanding what they are learning (and what they are not), and for identifying strategies for regularization or efficiency improvements. Despite recent progress, e.g., comparing vision transformers to CNNs, systematic com… ▽ More

    Submitted 8 November, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: This paper has been selected as best paper award for ECCV 2022!

    Journal ref: ECCV 2022

  16. arXiv:2203.15234  [pdf, other

    cs.LG cs.AI cs.CV

    Equivariance Allows Handling Multiple Nuisance Variables When Analyzing Pooled Neuroimaging Datasets

    Authors: Vishnu Suresh Lokhande, Rudrasis Chakraborty, Sathya N. Ravi, Vikas Singh

    Abstract: Pooling multiple neuroimaging datasets across institutions often enables improvements in statistical power when evaluating associations (e.g., between risk factors and disease outcomes) that may otherwise be too weak to detect. When there is only a {\em single} source of variability (e.g., different scanners), domain adaptation and matching the distributions of representations may suffice in many… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted at 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

  17. arXiv:2203.13812  [pdf, other

    cs.CV

    Spatially Multi-conditional Image Generation

    Authors: Ritika Chakraborty, Nikola Popovic, Danda Pani Paudel, Thomas Probst, Luc Van Gool

    Abstract: In most scenarios, conditional image generation can be thought of as an inversion of the image understanding process. Since generic image understanding involves solving multiple tasks, it is natural to aim at generating images via multi-conditioning. However, multi-conditional image generation is a very challenging problem due to the heterogeneity and the sparsity of the (in practice) available co… ▽ More

    Submitted 14 July, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

  18. arXiv:2203.01188  [pdf, ps, other

    cs.SI

    EnDSUM: Entropy and Diversity based Disaster Tweet Summarization

    Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

    Abstract: The huge amount of information shared in Twitter during disaster events are utilized by government agencies and humanitarian organizations to ensure quick crisis response and provide situational updates. However, the huge number of tweets posted makes manual identification of the relevant tweets impossible. To address the information overload, there is a need to automatically generate summary of a… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

  19. arXiv:2202.09463  [pdf, other

    cs.LG cs.AI stat.AP

    Mixed Effects Neural ODE: A Variational Approximation for Analyzing the Dynamics of Panel Data

    Authors: Jurijs Nazarovs, Rudrasis Chakraborty, Songwong Tasneeyapant, Sathya N. Ravi, Vikas Singh

    Abstract: Panel data involving longitudinal measurements of the same set of participants taken over multiple time points is common in studies to understand childhood development and disease modeling. Deep hybrid models that marry the predictive power of neural networks with physical simulators such as differential equations, are starting to drive advances in such applications. The task of modeling not just… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of Machine Learning Research; PMLR 161:107-117, 2021

  20. arXiv:2202.08504  [pdf, other

    cs.SI

    Finding Representative Sampling Subsets in Sensor Graphs using Time Series Similarities

    Authors: Roshni Chakraborty, Josefine Holm, Torben Bach Pedersen, Petar Popovski

    Abstract: With the increasing use of IoT-enabled sensors, it is important to have effective methods for querying the sensors. For example, in a dense network of battery-driven temperature sensors, it is often possible to query (sample) just a subset of the sensors at any given time, since the values of the non-sampled sensors can be estimated from the sampled values. If we can divide the set of sensors into… ▽ More

    Submitted 18 February, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

  21. arXiv:2202.03271  [pdf, ps, other

    eess.SP cs.LG

    Spectro Temporal EEG Biomarkers For Binary Emotion Classification

    Authors: Upasana Tiwari, Rupayan Chakraborty, Sunil Kumar Kopparapu

    Abstract: Electroencephalogram (EEG) is one of the most reliable physiological signal for emotion detection. Being non-stationary in nature, EEGs are better analysed by spectro temporal representations. Standard features like Discrete Wavelet Transformation (DWT) can represent temporal changes in spectral dynamics of an EEG, but is insufficient to extract information other way around, i.e. spectral changes… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  22. arXiv:2201.12352  [pdf, other

    cs.SD cs.LG eess.AS

    Automatic Audio Captioning using Attention weighted Event based Embeddings

    Authors: Swapnil Bhosale, Rupayan Chakraborty, Sunil Kumar Kopparapu

    Abstract: Automatic Audio Captioning (AAC) refers to the task of translating audio into a natural language that describes the audio events, source of the events and their relationships. The limited samples in AAC datasets at present, has set up a trend to incorporate transfer learning with Audio Event Detection (AED) as a parent task. Towards this direction, in this paper, we propose an encoder-decoder arch… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  23. arXiv:2201.07472  [pdf, other

    cs.SI

    Detecting Stance in Tweets : A Signed Network based Approach

    Authors: Roshni Chakraborty, Maitry Bhavsar, Sourav Kumar Dandapat, Joydeep Chandra

    Abstract: Identifying user stance related to a political event has several applications, like determination of individual stance, shaping of public opinion, identifying popularity of government measures and many others. The huge volume of political discussions on social media platforms, like, Twitter, provide opportunities in developing automated mechanisms to identify individual stance and subsequently, sc… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

  24. arXiv:2201.06545  [pdf, ps, other

    cs.SI

    OntoDSumm : Ontology based Tweet Summarization for Disaster Events

    Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

    Abstract: The huge popularity of social media platforms like Twitter attracts a large fraction of users to share real-time information and short situational messages during disasters. A summary of these tweets is required by the government organizations, agencies, and volunteers for efficient and quick disaster response. However, the huge influx of tweets makes it difficult to manually get a precise overvie… ▽ More

    Submitted 19 November, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

    ACM Class: H.0

  25. arXiv:2201.06437  [pdf, ps, other

    cs.SI cs.LG

    SigGAN : Adversarial Model for Learning Signed Relationships in Networks

    Authors: Roshni Chakraborty, Ritwika Das, Joydeep Chandra

    Abstract: Signed link prediction in graphs is an important problem that has applications in diverse domains. It is a binary classification problem that predicts whether an edge between a pair of nodes is positive or negative. Existing approaches for link prediction in unsigned networks cannot be directly applied for signed link prediction due to their inherent differences. Further, additional structural con… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

  26. arXiv:2112.00305  [pdf, other

    cs.LG cs.CV stat.ML

    Forward Operator Estimation in Generative Models with Kernel Transfer Operators

    Authors: Zhichun Huang, Rudrasis Chakraborty, Vikas Singh

    Abstract: Generative models which use explicit density modeling (e.g., variational autoencoders, flow-based generative models) involve finding a mapping from a known distribution, e.g. Gaussian, to the unknown input distribution. This often requires searching over a class of non-linear functions (e.g., representable by a deep neural network). While effective in practice, the associated runtime/memory costs… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  27. arXiv:2109.10213  [pdf, other

    cs.CR cs.CY

    Blockchain-based Covid Vaccination Registration and Monitoring

    Authors: Shirajus Salekin Nabil, Md. Sabbir Alam Pran, Ali Abrar Al Haque, Narayan Ranjan Chakraborty, Mohammad Jabed Morshed Chowdhury, Md Sadek Ferdous

    Abstract: Covid-19 (SARS-CoV-2) has changed almost all the aspects of our living. Governments around the world have imposed lockdown to slow down the transmissions. In the meantime, researchers worked hard to find the vaccine. Fortunately, we have found the vaccine, in fact a good number of them. However, managing the testing and vaccination process of the total population is a mammoth job. There are multip… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 12 pages

  28. arXiv:2106.15301  [pdf, other

    cs.CV cs.LG

    VolterraNet: A higher order convolutional network with group equivariance for homogeneous manifolds

    Authors: Monami Banerjee, Rudrasis Chakraborty, Jose Bouza, Baba C. Vemuri

    Abstract: Convolutional neural networks have been highly successful in image-based learning tasks due to their translation equivariance property. Recent work has generalized the traditional convolutional layer of a convolutional neural network to non-Euclidean spaces and shown group equivariance of the generalized convolution operation. In this paper, we present a novel higher order Volterra convolutional n… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence (2020)

  29. arXiv:2106.07479  [pdf, other

    cs.LG cs.AI stat.ML

    An Online Riemannian PCA for Stochastic Canonical Correlation Analysis

    Authors: Zihang Meng, Rudrasis Chakraborty, Vikas Singh

    Abstract: We present an efficient stochastic algorithm (RSG+) for canonical correlation analysis (CCA) using a reparametrization of the projection matrices. We show how this reparametrization (into structured matrices), simple in hindsight, directly presents an opportunity to repurpose/adjust mature techniques for numerical optimization on Riemannian manifolds. Our developments nicely complement existing me… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  30. arXiv:2105.06724  [pdf, other

    cs.LG

    RC2020 Report: Learning De-biased Representations with Biased Representations

    Authors: Rwiddhi Chakraborty, Shubhayu Das

    Abstract: As part of the ML Reproducibility Challenge 2020, we investigated the ICML 2020 paper "Learning De-biased Representations with Biased Representations" by Bahng et al., where the authors formalize and attempt to tackle the so called "cross bias generalization" problem with a new approach they introduce called ReBias. This report contains results of our attempts at reproducing the work in the applic… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Report number: 3742554

  31. arXiv:2104.05888  [pdf, other

    cs.LG cs.AI cs.CV

    Simpler Certified Radius Maximization by Propagating Covariances

    Authors: Xingjian Zhen, Rudrasis Chakraborty, Vikas Singh

    Abstract: One strategy for adversarially training a robust model is to maximize its certified radius -- the neighborhood around a given training sample for which the model's prediction remains unchanged. The scheme typically involves analyzing a "smoothed" classifier where one estimates the prediction corresponding to Gaussian samples in the neighborhood of each sample in the mini-batch, accomplished in pra… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: This paper has been accepted by CVPR 2021 as an oral presentation. An introduction video can be found: https://youtu.be/m1ya2oNf5iE

  32. arXiv:2103.13823  [pdf, ps, other

    cs.LG cs.AI

    A Novel Adaptive Minority Oversampling Technique for Improved Classification in Data Imbalanced Scenarios

    Authors: Ayush Tripathi, Rupayan Chakraborty, Sunil Kumar Kopparapu

    Abstract: Imbalance in the proportion of training samples belonging to different classes often poses performance degradation of conventional classifiers. This is primarily due to the tendency of the classifier to be biased towards the majority classes in the imbalanced dataset. In this paper, we propose a novel three step technique to address imbalanced data. As a first step we significantly oversample the… ▽ More

    Submitted 26 March, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: 8 pages

    Journal ref: ICPR 2020

  33. arXiv:2103.05939  [pdf, other

    cs.LG cs.SE

    A Review and Refinement of Surprise Adequacy

    Authors: Michael Weiss, Rwiddhi Chakraborty, Paolo Tonella

    Abstract: Surprise Adequacy (SA) is one of the emerging and most promising adequacy criteria for Deep Learning (DL) testing. As an adequacy criterion, it has been used to assess the strength of DL test suites. In addition, it has also been used to find inputs to a Deep Neural Network (DNN) which were not sufficiently represented in the training data, or to select samples for DNN retraining. However, computa… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

    Comments: Accepted at DeepTest 2021 (ICSE Workshop)

  34. arXiv:2102.08074  [pdf, other

    cs.SD cs.LG eess.AS

    Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining

    Authors: Swapnil Bhosale, Rupayan Chakraborty, Sunil Kumar Kopparapu

    Abstract: Few-shot learning aims to generalize unseen classes that appear during testing but are unavailable during training. Prototypical networks incorporate few-shot metric learning, by constructing a class prototype in the form of a mean vector of the embedded support points within a class. The performance of prototypical networks in extreme few-shot scenarios (like one-shot) degrades drastically, mainl… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Comments: 5 pages

  35. arXiv:2102.03902  [pdf, other

    cs.CL cs.LG

    Nyströmformer: A Nyström-Based Algorithm for Approximating Self-Attention

    Authors: Yunyang Xiong, Zhanpeng Zeng, Rudrasis Chakraborty, Mingxing Tan, Glenn Fung, Yin Li, Vikas Singh

    Abstract: Transformers have emerged as a powerful tool for a broad range of natural language processing tasks. A key component that drives the impressive performance of Transformers is the self-attention mechanism that encodes the influence or dependence of other tokens on each specific token. While beneficial, the quadratic complexity of self-attention on the input sequence length has limited its applicati… ▽ More

    Submitted 31 March, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

    Comments: AAAI 2021; Code and supplement available at https://github.com/mlpen/Nystromformer

  36. arXiv:2012.10013  [pdf, other

    cs.CV

    Flow-based Generative Models for Learning Manifold to Manifold Mappings

    Authors: Xingjian Zhen, Rudrasis Chakraborty, Liu Yang, Vikas Singh

    Abstract: Many measurements or observations in computer vision and machine learning manifest as non-Euclidean data. While recent proposals (like spherical CNN) have extended a number of deep neural network architectures to manifold-valued data, and this has often provided strong improvements in performance, the literature on generative models for manifold data is quite sparse. Partly due to this gap, there… ▽ More

    Submitted 1 March, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: This paper has been accepted by AAAI 2021. A video introduction is on YouTube: https://youtu.be/0r96U0vXsCM The official GitHub repo is: https://github.com/zhenxingjian/Dual_Manifold_GLOW

  37. arXiv:2006.12590  [pdf, other

    cs.LG stat.ML

    C-SURE: Shrinkage Estimator and Prototype Classifier for Complex-Valued Deep Learning

    Authors: Yifei Xing, Rudrasis Chakraborty, Minxuan Duan, Stella Yu

    Abstract: The James-Stein (JS) shrinkage estimator is a biased estimator that captures the mean of Gaussian random vectors.While it has a desirable statistical property of dominance over the maximum likelihood estimator (MLE) in terms of mean squared error (MSE), not much progress has been made on extending the estimator onto manifold-valued data. We propose C-SURE, a novel Stein's unbiased risk estimate… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: Submitted to CVPR PBVS workshop

  38. arXiv:2003.13869  [pdf, other

    stat.ML cs.LG

    ManifoldNorm: Extending normalizations on Riemannian Manifolds

    Authors: Rudrasis Chakraborty

    Abstract: Many measurements in computer vision and machine learning manifest as non-Euclidean data samples. Several researchers recently extended a number of deep neural network architectures for manifold valued data samples. Researchers have proposed models for manifold valued spatial data which are common in medical image processing including processing of diffusion tensor imaging (DTI) where images are f… ▽ More

    Submitted 4 April, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

  39. arXiv:2002.12788  [pdf, other

    eess.AS cs.AI

    Identification of Dementia Using Audio Biomarkers

    Authors: Rupayan Chakraborty, Meghna Pandharipande, Chitralekha Bhat, Sunil Kumar Kopparapu

    Abstract: Dementia is a syndrome, generally of a chronic nature characterized by a deterioration in cognitive function, especially in the geriatric population and is severe enough to impact their daily activities. Early diagnosis of dementia is essential to provide timely treatment to alleviate the effects and sometimes to slow the progression of dementia. Speech has been known to provide an indication of a… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: 5 pages, 3 figures

  40. arXiv:1912.11151  [pdf, other

    eess.AS cs.CL cs.SD

    A Cycle-GAN Approach to Model Natural Perturbations in Speech for ASR Applications

    Authors: Sri Harsha Dumpala, Imran Sheikh, Rupayan Chakraborty, Sunil Kumar Kopparapu

    Abstract: Naturally introduced perturbations in audio signal, caused by emotional and physical states of the speaker, can significantly degrade the performance of Automatic Speech Recognition (ASR) systems. In this paper, we propose a front-end based on Cycle-Consistent Generative Adversarial Network (CycleGAN) which transforms naturally perturbed speech into normal speech, and hence improves the robustness… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: 7 pages, 3 figures, ICASSP-2019

  41. arXiv:1911.12207  [pdf, other

    cs.CV

    Orthogonal Convolutional Neural Networks

    Authors: Jiayun Wang, Yubei Chen, Rudrasis Chakraborty, Stella X. Yu

    Abstract: Deep convolutional neural networks are hindered by training instability and feature redundancy towards further performance improvement. A promising solution is to impose orthogonality on convolutional filters. We develop an efficient approach to impose filter orthogonality on a convolutional layer based on the doubly block-Toeplitz matrix representation of the convolutional kernel instead of usi… ▽ More

    Submitted 8 April, 2020; v1 submitted 27 November, 2019; originally announced November 2019.

    Comments: To appear in CVPR 2020, project page: http://pwang.pw/ocnn.html

  42. arXiv:1911.03443  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    An "augmentation-free" rotation invariant classification scheme on point-cloud and its application to neuroimaging

    Authors: Liu Yang, Rudrasis Chakraborty

    Abstract: Recent years have witnessed the emergence and increasing popularity of 3D medical imaging techniques with the development of 3D sensors and technology. However, achieving geometric invariance in the processing of 3D medical images is computationally expensive but nonetheless essential due to the presence of possible errors caused by rigid registration techniques. An alternative way to analyze medi… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: arXiv admin note: text overlap with arXiv:1910.13050 and arXiv:1911.01705

  43. arXiv:1911.01705  [pdf, other

    cs.LG eess.IV stat.ML

    A GMM based algorithm to generate point-cloud and its application to neuroimaging

    Authors: Liu Yang, Rudrasis Chakraborty

    Abstract: Recent years have witnessed the emergence of 3D medical imaging techniques with the development of 3D sensors and technology. Due to the presence of noise in image acquisition, registration researchers focused on an alternative way to represent medical images. An alternative way to analyze medical imaging is by understanding the 3D shapes represented in terms of point-cloud. Though in the medical… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

  44. arXiv:1910.13050  [pdf, other

    cs.CV

    POIRot: A rotation invariant omni-directional pointnet

    Authors: Liu Yang, Rudrasis Chakraborty, Stella X. Yu

    Abstract: Point-cloud is an efficient way to represent 3D world. Analysis of point-cloud deals with understanding the underlying 3D geometric structure. But due to the lack of smooth topology, and hence the lack of neighborhood structure, standard correlation can not be directly applied on point-cloud. One of the popular approaches to do point correlation is to partition the point-cloud into voxels and extr… ▽ More

    Submitted 29 October, 2019; v1 submitted 28 October, 2019; originally announced October 2019.

  45. arXiv:1910.11334  [pdf, ps, other

    cs.CV

    SurReal: Complex-Valued Learning as Principled Transformations on a Scaling and Rotation Manifold

    Authors: Rudrasis Chakraborty, Yifei Xing, Stella Yu

    Abstract: Complex-valued data is ubiquitous in signal and image processing applications, and complex-valued representations in deep learning have appealing theoretical properties. While these aspects have long been recognized, complex-valued deep learning continues to lag far behind its real-valued counterpart. We propose a principled geometric approach to complex-valued deep learning. Complex-valued data… ▽ More

    Submitted 6 November, 2020; v1 submitted 18 October, 2019; originally announced October 2019.

    Comments: 12 pages, accepted to TNNLS journal

  46. arXiv:1910.02206  [pdf, other

    cs.CV cs.LG

    Dilated Convolutional Neural Networks for Sequential Manifold-valued Data

    Authors: Xingjian Zhen, Rudrasis Chakraborty, Nicholas Vogt, Barbara B. Bendlin, Vikas Singh

    Abstract: Efforts are underway to study ways via which the power of deep neural networks can be extended to non-standard data types such as structured data (e.g., graphs) or manifold-valued data (e.g., unit vectors or special matrices). Often, sizable empirical improvements are possible when the geometry of such data spaces are incorporated into the design of the model, architecture, and the algorithms. Mot… ▽ More

    Submitted 5 October, 2019; originally announced October 2019.

    Journal ref: ICCV 2019

  47. Spatial Transformer for 3D Point Clouds

    Authors: Jiayun Wang, Rudrasis Chakraborty, Stella X. Yu

    Abstract: Deep neural networks are widely used for understanding 3D point clouds. At each point convolution layer, features are computed from local neighborhoods of 3D points and combined for subsequent processing in order to extract semantic information. Existing methods adopt the same individual point neighborhoods throughout the network layers, defined by the same metric on the fixed input point coordina… ▽ More

    Submitted 29 March, 2021; v1 submitted 26 June, 2019; originally announced June 2019.

    Comments: To appear in IEEE Transactions on PAMI, 2021

  48. arXiv:1906.10048  [pdf, ps, other

    cs.CV

    SurReal: Fréchet Mean and Distance Transform for Complex-Valued Deep Learning

    Authors: Rudrasis Chakraborty, Jiayun Wang, Stella X. Yu

    Abstract: We develop a novel deep learning architecture for naturally complex-valued data, which is often subject to complex scaling ambiguity. We treat each sample as a field in the space of complex numbers. With the polar form of a complex-valued number, the general group that acts in this space is the product of planar rotation and non-zero scaling. This perspective allows us to develop not only a novel… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: IEEE Computer Vision and Pattern Recognition Workshop on Perception Beyond the Visible Spectrum, Long Beach, California, 16 June 2019 Best Paper Award

  49. arXiv:1901.09334  [pdf

    cs.SI cs.IR cs.LG

    Predicting Tomorrow's Headline using Today's Twitter Deliberations

    Authors: Roshni Chakraborty, Abhijeet Kharat, Apalak Khatua, Sourav Kumar Dandapat, Joydeep Chandra

    Abstract: Predicting the popularity of news article is a challenging task. Existing literature mostly focused on article contents and polarity to predict popularity. However, existing research has not considered the users' preference towards a particular article. Understanding users' preference is an important aspect for predicting the popularity of news articles. Hence, we consider the social media data, f… ▽ More

    Submitted 27 January, 2019; originally announced January 2019.

    Comments: This paper was accepted in CIKM Workshop on News Recommendation and Analytics (INRA), 2018, Turin, Italy

  50. arXiv:1809.06211  [pdf, other

    cs.CV

    ManifoldNet: A Deep Network Framework for Manifold-valued Data

    Authors: Rudrasis Chakraborty, Jose Bouza, Jonathan Manton, Baba C. Vemuri

    Abstract: Deep neural networks have become the main work horse for many tasks involving learning from data in a variety of applications in Science and Engineering. Traditionally, the input to these networks lie in a vector space and the operations employed within the network are well defined on vector-spaces. In the recent past, due to technological advances in sensing, it has become possible to acquire man… ▽ More

    Submitted 20 September, 2018; v1 submitted 10 September, 2018; originally announced September 2018.