Skip to main content

Showing 1–50 of 73 results for author: Dasgupta, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11973  [pdf, other

    cs.HC cs.AI cs.CY

    Preliminary Study of the Impact of AI-Based Interventions on Health and Behavioral Outcomes in Maternal Health Programs

    Authors: Arpan Dasgupta, Niclas Boehmer, Neha Madhiwalla, Aparna Hedge, Bryan Wilder, Milind Tambe, Aparna Taneja

    Abstract: Automated voice calls are an effective method of delivering maternal and child health information to mothers in underserved communities. One method to fight dwindling listenership is through an intervention in which health workers make live service calls. Previous work has shown that we can use AI to identify beneficiaries whose listenership gets the greatest boost from an intervention. It has als… ▽ More

    Submitted 23 May, 2024; originally announced July 2024.

    Comments: Accepted at Autonomous Agents for Social Good (AASG) workshop at AAMAS'24

  2. arXiv:2406.13154  [pdf, other

    stat.ML cs.AI cs.LG

    Conditional score-based diffusion models for solving inverse problems in mechanics

    Authors: Agnimitra Dasgupta, Harisankar Ramaswamy, Javier Murgoitio Esandi, Ken Foo, Runze Li, Qifa Zhou, Brendan Kennedy, Assad Oberai

    Abstract: We propose a framework to perform Bayesian inference using conditional score-based diffusion models to solve a class of inverse problems in mechanics involving the inference of a specimen's spatially varying material properties from noisy measurements of its mechanical response to loading. Conditional score-based diffusion models are generative models that learn to approximate the score function o… ▽ More

    Submitted 21 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2405.11738  [pdf, other

    cs.RO

    Diffusion Models for Generating Ballistic Spacecraft Trajectories

    Authors: Tyler Presser, Agnimitra Dasgupta, Daniel Erwin, Assad Oberai

    Abstract: Generative modeling has drawn much attention in creative and scientific data generation tasks. Score-based Diffusion Models, a type of generative model that iteratively learns to denoise data, have shown state-of-the-art results on tasks such as image generation, multivariate time series forecasting, and robotic trajectory planning. Using score-based diffusion models, this work implements a novel… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: To be presented at the 2024 Astrodynamics Specialist Conference

  4. arXiv:2403.18864  [pdf, other

    physics.ao-ph cs.AI cs.LG

    Interpretable Machine Learning for Weather and Climate Prediction: A Survey

    Authors: Ruyi Yang, Jingyu Hu, Zihao Li, Jianli Mu, Tingzhao Yu, Jiangjiang Xia, Xuhong Li, Aritra Dasgupta, Haoyi Xiong

    Abstract: Advanced machine learning models have recently achieved high predictive accuracy for weather and climate prediction. However, these complex models often lack inherent transparency and interpretability, acting as "black boxes" that impede user trust and hinder further model improvements. As such, interpretable machine learning techniques have become crucial in enhancing the credibility and utility… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 26 pages, 5 figures

  5. arXiv:2312.09885  [pdf, other

    cs.LG cs.AI cs.DS

    Simple Weak Coresets for Non-Decomposable Classification Measures

    Authors: Jayesh Malaviya, Anirban Dasgupta, Rachit Chhaya

    Abstract: While coresets have been growing in terms of their application, barring few exceptions, they have mostly been limited to unsupervised settings. We consider supervised classification problems, and non-decomposable evaluation measures in such settings. We show that stratified uniform sampling based coresets have excellent empirical performance that are backed by theoretical guarantees too. We focus… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024

  6. arXiv:2311.18572  [pdf, other

    cs.CV

    Overcoming Label Noise for Source-free Unsupervised Video Domain Adaptation

    Authors: Avijit Dasgupta, C. V. Jawahar, Karteek Alahari

    Abstract: Despite the progress seen in classification methods, current approaches for handling videos with distribution shifts in source and target domains remain source-dependent as they require access to the source data during the adaptation stage. In this paper, we present a self-training based source-free video domain adaptation approach to address this challenge by bridging the gap between the source a… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: Extended version of our ICVGIP paper

  7. arXiv:2311.18259  [pdf, other

    cs.CV cs.AI

    Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

    Authors: Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, Jing Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, Jing Huang, Md Mohaiminul Islam, Suyog Jain , et al. (76 additional authors not shown)

    Abstract: We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric and exocentric video of skilled human activities (e.g., sports, music, dance, bike repair). 740 participants from 13 cities worldwide performed these activities in 123 different natural scene contexts, yielding long-form captures from… ▽ More

    Submitted 29 April, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: updated baseline results and dataset statistics to match the released v2 data; added table to appendix comparing stats of Ego-Exo4D alongside other datasets

  8. arXiv:2311.10524  [pdf, ps, other

    cs.IT quant-ph

    Quantum intersection and union

    Authors: Naqueeb Ahmad Warsi, Ayanava Dasgupta

    Abstract: In information theory, we often use intersection and union of the typical sets to analyze various communication problems. However, in the quantum setting it is not very clear how to construct a measurement which behaves analogous to intersection and union of the typical sets. In this work, we construct a projection operator which behaves very similar to intersection and union of the typical sets.… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  9. arXiv:2311.07584  [pdf

    cs.CL cs.AI cs.IR cs.IT cs.LG

    Performance Prediction of Data-Driven Knowledge summarization of High Entropy Alloys (HEAs) literature implementing Natural Language Processing algorithms

    Authors: Akshansh Mishra, Vijaykumar S Jatti, Vaishnavi More, Anish Dasgupta, Devarrishi Dixit, Eyob Messele Sefene

    Abstract: The ability to interpret spoken language is connected to natural language processing. It involves teaching the AI how words relate to one another, how they are meant to be used, and in what settings. The goal of natural language processing (NLP) is to get a machine intelligence to process words the same way a human brain does. This enables machine intelligence to interpret, arrange, and comprehend… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  10. arXiv:2311.06413  [pdf, other

    cs.HC cs.AI cs.LG eess.SP

    Forte: An Interactive Visual Analytic Tool for Trust-Augmented Net Load Forecasting

    Authors: Kaustav Bhattacharjee, Soumya Kundu, Indrasis Chakraborty, Aritra Dasgupta

    Abstract: Accurate net load forecasting is vital for energy planning, aiding decisions on trade and load distribution. However, assessing the performance of forecasting models across diverse input variables, like temperature and humidity, remains challenging, particularly for eliciting a high degree of trust in the model outcomes. In this context, there is a growing need for data-driven technological interv… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: Accepted for publication in the proceedings of 2024 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference, North America (ISGT NA)

  11. arXiv:2310.14748  [pdf

    q-fin.PM cs.LG

    A Comparative Study of Portfolio Optimization Methods for the Indian Stock Market

    Authors: Jaydip Sen, Arup Dasgupta, Partha Pratim Sengupta, Sayantani Roy Choudhury

    Abstract: This chapter presents a comparative study of the three portfolio optimization methods, MVP, HRP, and HERC, on the Indian stock market, particularly focusing on the stocks chosen from 15 sectors listed on the National Stock Exchange of India. The top stocks of each cluster are identified based on their free-float market capitalization from the report of the NSE published on July 1, 2022 (NSE Websit… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: This is the draft version of the chapter that has been accepted for publication in the edited volume titled "Data Science: Theory and Practice". The volume is edited by Jaydip Sen and Sayantani Roy Choudury and will be published by IntechOpen, London, UK. The chapter is 74 pages long and it contains 32 tables and 62 figures

  12. arXiv:2310.09852  [pdf, other

    cs.LG

    Alpha Elimination: Using Deep Reinforcement Learning to Reduce Fill-In during Sparse Matrix Decomposition

    Authors: Arpan Dasgupta, Pawan Kumar

    Abstract: A large number of computational and scientific methods commonly require decomposing a sparse matrix into triangular factors as LU decomposition. A common problem faced during this decomposition is that even though the given matrix may be very sparse, the decomposition may lead to a denser triangular factors due to fill-in. A significant fill-in may lead to prohibitively larger computational costs… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: accepted to ECML 2023, Research Track

  13. arXiv:2310.09770  [pdf

    q-fin.CP cs.CE

    A Portfolio Rebalancing Approach for the Indian Stock Market

    Authors: Jaydip Sen, Arup Dasgupta, Subhasis Dasgupta, Sayantani Roychoudhury

    Abstract: This chapter presents a calendar rebalancing approach to portfolios of stocks in the Indian stock market. Ten important sectors of the Indian economy are first selected. For each of these sectors, the top ten stocks are identified based on their free-float market capitalization values. Using the ten stocks in each sector, a sector-specific portfolio is designed. In this study, the historical stock… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: This is the draft version of the chapter that will appear in the edited volume titled "Data Science: Theory and Applications" edited by Jaydip Sen and Sayantani Royc Choudhury. The volume will be published by Cambridge Scholars Publishing, New Castle upon Tyne, UK, in March 2024. The chapter has 80 pages, and it consists of 50 figures, and 13 tables

  14. arXiv:2310.05395  [pdf, other

    cs.MM cs.LG

    Robust Image Watermarking based on Cross-Attention and Invariant Domain Learning

    Authors: Agnibh Dasgupta, Xin Zhong

    Abstract: Image watermarking involves embedding and extracting watermarks within a cover image, with deep learning approaches emerging to bolster generalization and robustness. Predominantly, current methods employ convolution and concatenation for watermark embedding, while also integrating conceivable augmentation in the training process. This paper explores a robust image watermarking methodology by harn… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  15. arXiv:2310.04690  [pdf, other

    cs.CE

    A dimension-reduced variational approach for solving physics-based inverse problems using generative adversarial network priors and normalizing flows

    Authors: Agnimitra Dasgupta, Dhruv V Patel, Deep Ray, Erik A Johnson, Assad A Oberai

    Abstract: We propose a novel modular inference approach combining two different generative models -- generative adversarial networks (GAN) and normalizing flows -- to approximate the posterior distribution of physics-based Bayesian inverse problems framed in high-dimensional ambient spaces. We dub the proposed framework GAN-Flow. The proposed method leverages the intrinsic dimension reduction and superior s… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  16. arXiv:2308.14622  [pdf, other

    cs.IR cs.AI cs.HC

    TRIVEA: Transparent Ranking Interpretation using Visual Explanation of Black-Box Algorithmic Rankers

    Authors: Jun Yuan, Kaustav Bhattacharjee, Akm Zahirul Islam, Aritra Dasgupta

    Abstract: Ranking schemes drive many real-world decisions, like, where to study, whom to hire, what to buy, etc. Many of these decisions often come with high consequences. For example, a university can be deemed less prestigious if not featured in a top-k list, and consumers might not even explore products that do not get recommended to buyers. At the heart of most of these decisions are opaque ranking sche… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in SpringerNature's Visual Computer Journal

  17. arXiv:2308.06932  [pdf, other

    cs.CR cs.AR

    DIVAS: An LLM-based End-to-End Framework for SoC Security Analysis and Policy-based Protection

    Authors: Sudipta Paria, Aritra Dasgupta, Swarup Bhunia

    Abstract: Securing critical assets in a bus-based System-On-Chip (SoC) is imperative to mitigate potential vulnerabilities and prevent unauthorized access, ensuring the integrity, availability, and confidentiality of the system. Ensuring security throughout the SoC design process is a formidable task owing to the inherent intricacies in SoC designs and the dispersion of assets across diverse IPs. Large Lang… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 15 pages, 7 figures, 8 tables

  18. arXiv:2307.04245  [pdf, other

    cs.CV cs.AI

    A Novel Pipeline for Improving Optical Character Recognition through Post-processing Using Natural Language Processing

    Authors: Aishik Rakshit, Samyak Mehta, Anirban Dasgupta

    Abstract: Optical Character Recognition (OCR) technology finds applications in digitizing books and unstructured documents, along with applications in other domains such as mobility statistics, law enforcement, traffic, security systems, etc. The state-of-the-art methods work well with the OCR with printed text on license plates, shop names, etc. However, applications such as printed textbooks and handwritt… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Comments: Accepted in IEEE GCON (IEEE Guwahati Subsection Conference) 2023

  19. arXiv:2306.04895  [pdf, other

    stat.ML cs.LG

    Solution of physics-based inverse problems using conditional generative adversarial networks with full gradient penalty

    Authors: Deep Ray, Javier Murgoitio-Esandi, Agnimitra Dasgupta, Assad A. Oberai

    Abstract: The solution of probabilistic inverse problems for which the corresponding forward problem is constrained by physical principles is challenging. This is especially true if the dimension of the inferred vector is large and the prior information about it is in the form of a collection of samples. In this work, a novel deep learning based approach is developed and applied to solving these types of pr… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 34 pages, 9 figures, 3 tables, 1 appendix

  20. arXiv:2306.03723  [pdf, other

    cs.CL cs.AI cs.CE

    Financial Numeric Extreme Labelling: A Dataset and Benchmarking for XBRL Tagging

    Authors: Soumya Sharma, Subhendu Khatuya, Manjunath Hegde, Afreen Shaikh. Koustuv Dasgupta, Pawan Goyal, Niloy Ganguly

    Abstract: The U.S. Securities and Exchange Commission (SEC) mandates all public companies to file periodic financial statements that should contain numerals annotated with a particular label from a taxonomy. In this paper, we formulate the task of automating the assignment of a label to a particular numeral span in a sentence from an extremely large label set. Towards this task, we release a dataset, Financ… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL'23 Findings Paper

  21. arXiv:2305.19659  [pdf, other

    cs.LG cs.DS

    Improving Expressivity of Graph Neural Networks using Localization

    Authors: Anant Kumar, Shrutimoy Das, Shubhajit Roy, Binita Maity, Anirban Dasgupta

    Abstract: In this paper, we propose localized versions of Weisfeiler-Leman (WL) algorithms in an effort to both increase the expressivity, as well as decrease the computational overhead. We focus on the specific problem of subgraph counting and give localized versions of $k-$WL for any $k$. We analyze the power of Local $k-$WL and prove that it is more expressive than $k-$WL and at most as expressive as… ▽ More

    Submitted 29 January, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

  22. arXiv:2305.04073  [pdf, other

    cs.AI cs.LG

    Explaining RL Decisions with Trajectories

    Authors: Shripad Vilasrao Deshmukh, Arpan Dasgupta, Balaji Krishnamurthy, Nan Jiang, Chirag Agarwal, Georgios Theocharous, Jayakumar Subramanian

    Abstract: Explanation is a key component for the adoption of reinforcement learning (RL) in many real-world decision-making problems. In the literature, the explanation is often provided by saliency attribution to the features of the RL agent's state. In this work, we propose a complementary approach to these explanations, particularly for offline RL, where we attribute the policy decisions of a trained RL… ▽ More

    Submitted 22 January, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: Published at International Conference on Learning Representations (ICLR), 2023

  23. Power to the Data Defenders: Human-Centered Disclosure Risk Calibration of Open Data

    Authors: Kaustav Bhattacharjee, Aritra Dasgupta

    Abstract: The open data ecosystem is susceptible to vulnerabilities due to disclosure risks. Though the datasets are anonymized during release, the prevalence of the release-and-forget model makes the data defenders blind to privacy issues arising after the dataset release. One such issue can be the disclosure risks in the presence of newly released datasets which may compromise the privacy of the data subj… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: In Proceedings of the Symposium on Usable Security and Privacy (USEC) 2023

    Journal ref: Proceedings of Symposium on Usable Security and Privacy (USEC) 2023

  24. arXiv:2304.11045  [pdf, other

    cs.LG cs.AI cs.IR

    Light-weight Deep Extreme Multilabel Classification

    Authors: Istasis Mishra, Arpan Dasgupta, Pratik Jawanpuria, Bamdev Mishra, Pawan Kumar

    Abstract: Extreme multi-label (XML) classification refers to the task of supervised multi-label learning that involves a large number of labels. Hence, scalability of the classifier with increasing label dimension is an important consideration. In this paper, we develop a method called LightDXML which modifies the recently developed deep learning based XML framework by using label embeddings instead of feat… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 9 pages, 2 figures, 5 tables

  25. arXiv:2302.05971  [pdf, other

    cs.LG

    Review of Extreme Multilabel Classification

    Authors: Arpan Dasgupta, Siddhant Katyan, Shrutimoy Das, Pawan Kumar

    Abstract: Extreme multilabel classification or XML, is an active area of interest in machine learning. Compared to traditional multilabel classification, here the number of labels is extremely large, hence, the name extreme multilabel classification. Using classical one versus all classification wont scale in this case due to large number of labels, same is true for any other classifiers. Embedding of label… ▽ More

    Submitted 26 March, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: 46 pages, 13 figures

  26. arXiv:2211.02932  [pdf, other

    cs.HC

    Rankers, Rankees, & Rankings: Peeking into the Pandora's Box from a Socio-Technical Perspective

    Authors: Jun Yuan, Julia Stoyanovich, Aritra Dasgupta

    Abstract: Algorithmic rankers have a profound impact on our increasingly data-driven society. From leisurely activities like the movies that we watch, the restaurants that we patronize; to highly consequential decisions, like making educational and occupational choices or getting hired by companies -- these are all driven by sophisticated yet mostly inaccessible rankers. A small change to how these algorith… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: Accepted for Interrogating Human-Centered Data Science workshop at CHI'22

  27. Machine Learning for Optical Motion Capture-driven Musculoskeletal Modelling from Inertial Motion Capture Data

    Authors: Abhishek Dasgupta, Rahul Sharma, Challenger Mishra, Vikranth H. Nagaraja

    Abstract: Marker-based Optical Motion Capture (OMC) systems and associated musculoskeletal (MSK) modelling predictions offer non-invasively obtainable insights into in vivo joint and muscle loading, aiding clinical decision-making. However, an OMC system is lab-based, expensive, and requires a line of sight. Inertial Motion Capture (IMC) systems are widely-used alternatives, which are portable, user-friendl… ▽ More

    Submitted 11 February, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: 23 pages, 12 figures, 5 tables

    Journal ref: Bioengineering 2023, 10(5), 510

  28. arXiv:2209.07404  [pdf

    cs.NE cs.AI cs.LG

    Self-Organizing Map Neural Network Algorithm for the Determination of Fracture Location in Solid-State Process joined Dissimilar Alloys

    Authors: Akshansh Mishra, Anish Dasgupta

    Abstract: The subject area known as computational neuroscience involves the investigation of brain function using mathematical techniques and theories. In order to comprehend how the brain processes information, it can also include various methods from signal processing, computer science, and physics. In the present work, for the first time a neurobiological based unsupervised machine learning algorithm i.e… ▽ More

    Submitted 14 August, 2022; originally announced September 2022.

  29. arXiv:2208.09135  [pdf

    physics.geo-ph cs.CV cs.LG physics.ao-ph

    Towards Daily High-resolution Inundation Observations using Deep Learning and EO

    Authors: Antara Dasgupta, Lasse Hybbeneth, Björn Waske

    Abstract: Satellite remote sensing presents a cost-effective solution for synoptic flood monitoring, and satellite-derived flood maps provide a computationally efficient alternative to numerical flood inundation models traditionally used. While satellites do offer timely inundation information when they happen to cover an ongoing flood event, they are limited by their spatiotemporal resolution in terms of t… ▽ More

    Submitted 2 September, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

  30. PRIVEE: A Visual Analytic Workflow for Proactive Privacy Risk Inspection of Open Data

    Authors: Kaustav Bhattacharjee, Akm Islam, Jaideep Vaidya, Aritra Dasgupta

    Abstract: Open data sets that contain personal information are susceptible to adversarial attacks even when anonymized. By performing low-cost joins on multiple datasets with shared attributes, malicious users of open data portals might get access to information that violates individuals' privacy. However, open data sets are primarily published using a release-and-forget model, whereby data owners and custo… ▽ More

    Submitted 12 August, 2022; originally announced August 2022.

    Comments: Accepted for IEEE Symposium on Visualization in Cyber Security, 2022

  31. arXiv:2202.06481  [pdf, other

    cs.LG cs.AI

    A Survey on Machine Learning Approaches for Modelling Intuitive Physics

    Authors: Jiafei Duan, Arijit Dasgupta, Jason Fischer, Cheston Tan

    Abstract: Research in cognitive science has provided extensive evidence of human cognitive ability in performing physical reasoning of objects from noisy perceptual inputs. Such a cognitive ability is commonly known as intuitive physics. With advancements in deep learning, there is an increasing interest in building intelligent systems that are capable of performing physical reasoning from a given scene for… ▽ More

    Submitted 27 April, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

    Comments: Paper accepted at IJCAI 2022 (Survey Track)

  32. arXiv:2201.05706  [pdf, other

    cs.CV cs.LG

    Perspective Transformation Layer

    Authors: Nishan Khatri, Agnibh Dasgupta, Yucong Shen, Xin Zhong, Frank Y. Shih

    Abstract: Incorporating geometric transformations that reflect the relative position changes between an observer and an object into computer vision and deep learning models has attracted much attention in recent years. However, the existing proposals mainly focus on the affine transformation that is insufficient to reflect such geometric position changes. Furthermore, current solutions often apply a neural… ▽ More

    Submitted 30 October, 2022; v1 submitted 14 January, 2022; originally announced January 2022.

    Comments: This paper has been accepted for publication by the 2022 International Conference on Computational Science & Computational Intelligence (CSCI'22), Research Track on Signal & Image Processing, Computer Vision & Pattern Recognition

  33. arXiv:2111.09083  [pdf, other

    cs.RO eess.SY

    Trajectory Prediction & Path Planning for an Object Intercepting UAV with a Mounted Depth Camera

    Authors: Jasper Tan, Arijit Dasgupta, Arjun Agrawal, Sutthiphong Srigrarom

    Abstract: A novel control & software architecture using ROS C++ is introduced for object interception by a UAV with a mounted depth camera and no external aid. Existing work in trajectory prediction focused on the use of off-board tools like motion capture rooms to intercept thrown objects. The present study designs the UAV architecture to be completely on-board capable of object interception with the use o… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Accepted at the 21st International Conference on Control, Automation and Systems 2021 (ICCAS 2021)

  34. arXiv:2111.09061  [pdf, other

    cs.LG cs.NI

    Exploring Unsupervised Learning Methods for Automated Protocol Analysis

    Authors: Arijit Dasgupta, Yi-Xue Yan, Clarence Ong, Jenn-Yue Teo, Chia-Wei Lim

    Abstract: The ability to analyse and differentiate network protocol traffic is crucial for network resource management to provide differentiated services by Telcos. Automated Protocol Analysis (APA) is crucial to significantly improve efficiency and reduce reliance on human experts. There are numerous automated state-of-the-art unsupervised methods for clustering unknown protocols in APA. However, many such… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Accepted to the IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2021)

  35. arXiv:2111.08826  [pdf, other

    cs.CV cs.AI

    A Benchmark for Modeling Violation-of-Expectation in Physical Reasoning Across Event Categories

    Authors: Arijit Dasgupta, Jiafei Duan, Marcelo H. Ang Jr, Yi Lin, Su-hua Wang, Renée Baillargeon, Cheston Tan

    Abstract: Recent work in computer vision and cognitive reasoning has given rise to an increasing adoption of the Violation-of-Expectation (VoE) paradigm in synthetic datasets. Inspired by infant psychology, researchers are now evaluating a model's ability to label scenes as either expected or surprising with knowledge of only expected scenes. However, existing VoE-based 3D datasets in physical reasoning pro… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: text overlap with arXiv:2110.05836

    ACM Class: I.2.10

  36. arXiv:2111.00745  [pdf, other

    stat.ML cs.LG

    Uncertainty quantification for ptychography using normalizing flows

    Authors: Agnimitra Dasgupta, Zichao Wendy Di

    Abstract: Ptychography, as an essential tool for high-resolution and nondestructive material characterization, presents a challenging large-scale nonlinear and non-convex inverse problem; however, its intrinsic photon statistics create clear opportunities for statistical-based deep learning approaches to tackle these challenges, which has been underexplored. In this work, we explore normalizing flows to obt… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted at the Fourth Workshop on Machine Learning for Physical Sciences, NeurIPS 2021

  37. arXiv:2110.05836  [pdf, other

    cs.CV cs.AI

    AVoE: A Synthetic 3D Dataset on Understanding Violation of Expectation for Artificial Cognition

    Authors: Arijit Dasgupta, Jiafei Duan, Marcelo H. Ang Jr, Cheston Tan

    Abstract: Recent work in cognitive reasoning and computer vision has engendered an increasing popularity for the Violation-of-Expectation (VoE) paradigm in synthetic datasets. Inspired by work in infant psychology, researchers have started evaluating a model's ability to discriminate between expected and surprising scenes as a sign of its reasoning ability. Existing VoE-based 3D datasets in physical reasoni… ▽ More

    Submitted 16 November, 2021; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted at the NeurIPS Workshop for Physical Reasoning and Inductive Biases for the Real World

    ACM Class: I.2.10

  38. arXiv:2103.00147  [pdf, other

    cs.LG

    Statistical Measures For Defining Curriculum Scoring Function

    Authors: Vinu Sankar Sadasivan, Anirban Dasgupta

    Abstract: Curriculum learning is a training strategy that sorts the training examples by some measure of their difficulty and gradually exposes them to the learner to improve the network performance. Motivated by our insights from implicit curriculum ordering, we first introduce a simple curriculum learning strategy that uses statistical measures such as standard deviation and entropy values to score the di… ▽ More

    Submitted 27 July, 2021; v1 submitted 27 February, 2021; originally announced March 2021.

    Comments: Revision 1: Corrected minor typos, added link to open-sourced codes, fixed Figures 7 as per reviews

  39. arXiv:2012.06522  [pdf, other

    cs.DS cs.LG

    Online Coresets for Clustering with Bregman Divergences

    Authors: Rachit Chhaya, Jayesh Choudhari, Anirban Dasgupta, Supratim Shit

    Abstract: We present algorithms that create coresets in an online setting for clustering problems according to a wide subset of Bregman divergences. Notably, our coresets have a small additive error, similar in magnitude to the lightweight coresets Bachem et. al. 2018, and take update time $O(d)$ for every incoming point where $d$ is dimension of the point. Our first algorithm gives online coresets of size… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: Work in Progress

  40. arXiv:2011.01456  [pdf, other

    cs.LG

    Frequency-compensated PINNs for Fluid-dynamic Design Problems

    Authors: Tongtao Zhang, Biswadip Dey, Pratik Kakkar, Arindam Dasgupta, Amit Chakraborty

    Abstract: Incompressible fluid flow around a cylinder is one of the classical problems in fluid-dynamics with strong relevance with many real-world engineering problems, for example, design of offshore structures or design of a pin-fin heat exchanger. Thus learning a high-accuracy surrogate for this problem can demonstrate the efficacy of a novel machine learning approach. In this work, we propose a physics… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: Machine Learning for Engineering Modeling, Simulation, and Design (ML4Eng) Workshop, NeurIPS 2020

  41. arXiv:2010.02912  [pdf, ps, other

    cs.DS cs.DM cs.LG math.CO

    On Additive Approximate Submodularity

    Authors: Flavio Chierichetti, Anirban Dasgupta, Ravi Kumar

    Abstract: A real-valued set function is (additively) approximately submodular if it satisfies the submodularity conditions with an additive error. Approximate submodularity arises in many settings, especially in machine learning, where the function evaluation might not be exact. In this paper we study how close such approximately submodular functions are to truly submodular functions. We show that an appr… ▽ More

    Submitted 7 October, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

    ACM Class: F.2.2

  42. arXiv:2008.10828  [pdf, other

    cs.DS

    Efficient Hierarchical Clustering for Classification and Anomaly Detection

    Authors: Ishita Doshi, Sreekalyan Sajjalla, Jayesh Choudhari, Rushi Bhatt, Anirban Dasgupta

    Abstract: We address the problem of large scale real-time classification of content posted on social networks, along with the need to rapidly identify novel spam types. Obtaining manual labels for user-generated content using editorial labeling and taxonomy development lags compared to the rate at which new content type needs to be classified. We propose a class of hierarchical clustering algorithms that ca… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: 19 pages, 2 figures, 9 tables

    ACM Class: H.3.3; I.5.3; I.7.0; E.1

  43. arXiv:2007.14820  [pdf, other

    cs.SI physics.soc-ph stat.ME

    Scalable Estimation of Epidemic Thresholds via Node Sampling

    Authors: Anirban Dasgupta, Srijan Sengupta

    Abstract: Infectious or contagious diseases can be transmitted from one person to another through social contact networks. In today's interconnected global society, such contagion processes can cause global public health hazards, as exemplified by the ongoing Covid-19 pandemic. It is therefore of great practical relevance to investigate the network trans-mission of contagious diseases from the perspective o… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: 22 pages

  44. arXiv:2006.05440  [pdf, other

    cs.LG cs.DS stat.ML

    On Coresets For Regularized Regression

    Authors: Rachit Chhaya, Anirban Dasgupta, Supratim Shit

    Abstract: We study the effect of norm based regularization on the size of coresets for regression problems. Specifically, given a matrix $ \mathbf{A} \in {\mathbb{R}}^{n \times d}$ with $n\gg d$ and a vector $\mathbf{b} \in \mathbb{R} ^ n $ and $λ> 0$, we analyze the size of coresets for regularized versions of regression of the form $\|\mathbf{Ax}-\mathbf{b}\|_p^r + λ\|{\mathbf{x}}\|_q^s$ . Prior work has… ▽ More

    Submitted 30 June, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: Accepted at ICML 2020. Acknowledgements added. Minor errors fixed

  45. arXiv:2006.01225  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Streaming Coresets for Symmetric Tensor Factorization

    Authors: Rachit Chhaya, Jayesh Choudhari, Anirban Dasgupta, Supratim Shit

    Abstract: Factorizing tensors has recently become an important optimization module in a number of machine learning pipelines, especially in latent variable models. We show how to do this efficiently in the streaming setting. Given a set of $n$ vectors, each in $\mathbb{R}^d$, we present algorithms to select a sublinear number of these vectors as coreset, while guaranteeing that the CP decomposition of the… ▽ More

    Submitted 13 July, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: Accepted at ICML 2020. Included algorithm with improved update time and fixed minor bugs

  46. arXiv:1910.02133  [pdf, other

    eess.IV cond-mat.mtrl-sci cs.LG stat.ML

    A Conditional Generative Model for Predicting Material Microstructures from Processing Methods

    Authors: Akshay Iyer, Biswadip Dey, Arindam Dasgupta, Wei Chen, Amit Chakraborty

    Abstract: Microstructures of a material form the bridge linking processing conditions - which can be controlled, to the material property - which is the primary interest in engineering applications. Thus a critical task in material design is establishing the processing-structure relationship, which requires domain expertise and techniques that can model the high-dimensional material microstructure. This wor… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

  47. arXiv:1907.05943  [pdf

    cs.LG stat.ML

    A Study and Analysis of a Feature Subset Selection Technique using Penguin Search Optimization Algorithm (FS-PeSOA)

    Authors: Agnip Dasgupta, Ardhendu Banerjee, Aniket Ghosh Dastidar, Antara Barman, Sanjay Chakraborty

    Abstract: In today world of enormous amounts of data, it is very important to extract useful knowledge from it. This can be accomplished by feature subset selection. Feature subset selection is a method of selecting a minimum number of features with the help of which our machine can learn and predict which class a particular data belongs to. We will introduce a new adaptive algorithm called Feature selectio… ▽ More

    Submitted 13 July, 2019; originally announced July 2019.

  48. arXiv:1809.04487  [pdf, other

    cs.LG stat.ML

    Discovering Topical Interactions in Text-based Cascades using Hidden Markov Hawkes Processes

    Authors: Srikanta Bedathur, Indrajit Bhattacharya, Jayesh Choudhari, Anirban Dasgupta

    Abstract: Social media conversations unfold based on complex interactions between users, topics and time. While recent models have been proposed to capture network strengths between users, users' topical preferences and temporal patterns between posting and response times, interaction patterns between topics has not been studied. We propose the Hidden Markov Hawkes Process (HMHP) that incorporates topical M… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: Accepted as a short paper at ICDM-2018

  49. arXiv:1804.05051  [pdf, other

    astro-ph.IM cs.LG

    Machine Learning in Astronomy: A Case Study in Quasar-Star Classification

    Authors: Mohammed Viquar, Suryoday Basak, Ariruna Dasgupta, Surbhi Agrawal, Snehanshu Saha

    Abstract: We present the results of various automated classification methods, based on machine learning (ML), of objects from data releases 6 and 7 (DR6 and DR7) of the Sloan Digital Sky Survey (SDSS), primarily distinguishing stars from quasars. We provide a careful scrutiny of approaches available in the literature and have highlighted the pitfalls in those approaches based on the nature of data used for… ▽ More

    Submitted 13 April, 2018; originally announced April 2018.

    Comments: 10 pages, 8 figures

  50. arXiv:1711.11527  [pdf, other

    stat.ML cs.LG

    Improved Linear Embeddings via Lagrange Duality

    Authors: Kshiteej Sheth, Dinesh Garg, Anirban Dasgupta

    Abstract: Near isometric orthogonal embeddings to lower dimensions are a fundamental tool in data science and machine learning. In this paper, we present the construction of such embeddings that minimizes the maximum distortion for a given set of points. We formulate the problem as a non convex constrained optimization problem. We first construct a primal relaxation and then use the theory of Lagrange duali… ▽ More

    Submitted 14 December, 2017; v1 submitted 30 November, 2017; originally announced November 2017.

    Comments: 20 pages