Zum Hauptinhalt springen

Showing 1–50 of 132 results for author: Singh, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.17011  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Disease Classification and Impact of Pretrained Deep Convolution Neural Networks on Diverse Medical Imaging Datasets across Imaging Modalities

    Authors: Jutika Borah, Kumaresh Sarmah, Hidam Kumarjit Singh

    Abstract: Imaging techniques such as Chest X-rays, whole slide images, and optical coherence tomography serve as the initial screening and detection for a wide variety of medical pulmonary and ophthalmic conditions respectively. This paper investigates the intricacies of using pretrained deep convolutional neural networks with transfer learning across diverse medical imaging datasets with varying modalities… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 15 pages, 3 figures, 4 tables

  2. arXiv:2408.15714  [pdf, other

    cs.CV cs.LG

    Pixels to Prose: Understanding the art of Image Captioning

    Authors: Hrishikesh Singh, Aarti Sharma, Millie Pant

    Abstract: In the era of evolving artificial intelligence, machines are increasingly emulating human-like capabilities, including visual perception and linguistic expression. Image captioning stands at the intersection of these domains, enabling machines to interpret visual content and generate descriptive text. This paper provides a thorough review of image captioning techniques, catering to individuals ent… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  3. arXiv:2408.10161  [pdf, other

    cs.CV cs.AI cs.RO

    NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices

    Authors: Zhiyong Zhang, Aniket Gupta, Huaizu Jiang, Hanumant Singh

    Abstract: Real-time high-accuracy optical flow estimation is crucial for various real-world applications. While recent learning-based optical flow methods have achieved high accuracy, they often come with significant computational costs. In this paper, we propose a highly efficient optical flow method that balances high accuracy with reduced computational demands. Building upon NeuFlow v1, we introduce new… ▽ More

    Submitted 21 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  4. arXiv:2408.04490  [pdf, ps, other

    cs.CR math.GR

    Symmetric Encryption Scheme Based on Quasigroup Using Chained Mode of Operation

    Authors: Satish Kumar, Harshdeep Singh, Indivar Gupta, Ashok Ji Gupta

    Abstract: In this paper, we propose a novel construction for a symmetric encryption scheme, referred as SEBQ which is based on the structure of quasigroup. We utilize concepts of chaining like mode of operation and present a block cipher with in-built properties. We prove that SEBQ shows resistance against chosen plaintext attack (CPA) and by applying unbalanced Feistel transformation [19], it achieves secu… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    MSC Class: 20N05; 05B15; 94A60; 68W20

  5. arXiv:2407.10275  [pdf, other

    cs.CL cs.AI

    Cross-Lingual Multi-Hop Knowledge Editing -- Benchmarks, Analysis and a Simple Contrastive Learning based Approach

    Authors: Aditi Khandelwal, Harman Singh, Hengrui Gu, Tianlong Chen, Kaixiong Zhou

    Abstract: Large language models are often expected to constantly adapt to new sources of knowledge and knowledge editing techniques aim to efficiently patch the outdated model knowledge, with minimal modification. Most prior works focus on monolingual knowledge editing in English, even though new information can emerge in any language from any part of the world. We propose the Cross-Lingual Multi-Hop Knowle… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: Paper on Cross-Lingual Multi-Hop Knowledge Editing

  6. arXiv:2407.04708  [pdf, other

    cs.CV cs.LG

    QMViT: A Mushroom is worth 16x16 Words

    Authors: Siddhant Dutta, Hemant Singh, Kalpita Shankhdhar, Sridhar Iyer

    Abstract: Consuming poisonous mushrooms can have severe health consequences, even resulting in fatality and accurately distinguishing edible from toxic mushroom varieties remains a significant challenge in ensuring food safety. So, it's crucial to distinguish between edible and poisonous mushrooms within the existing species. This is essential due to the significant demand for mushrooms in people's daily me… ▽ More

    Submitted 10 May, 2024; originally announced July 2024.

  7. arXiv:2407.03454  [pdf, other

    cs.NE math.OC

    Decomposition of Difficulties in Complex Optimization Problems Using a Bilevel Approach

    Authors: Ankur Sinha, Dhaval Pujara, Hemant Kumar Singh

    Abstract: Practical optimization problems may contain different kinds of difficulties that are often not tractable if one relies on a particular optimization method. Different optimization approaches offer different strengths that are good at tackling one or more difficulty in an optimization problem. For instance, evolutionary algorithms have a niche in handling complexities like discontinuity, non-differe… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 9 pages

    MSC Class: 90C30 ACM Class: G.0

  8. arXiv:2406.19668  [pdf, other

    cs.CV

    PopAlign: Population-Level Alignment for Fair Text-to-Image Generation

    Authors: Shufan Li, Harkanwar Singh, Aditya Grover

    Abstract: Text-to-image (T2I) models achieve high-fidelity generation through extensive training on large datasets. However, these models may unintentionally pick up undesirable biases of their training data, such as over-representation of particular identities in gender or ethnicity neutral prompts. Existing alignment methods such as Reinforcement Learning from Human Feedback (RLHF) and Direct Preference O… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 18 pages, 10 figures

  9. arXiv:2406.02554  [pdf, other

    eess.AS cs.AI cs.CL cs.CV cs.LG cs.MM

    Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition

    Authors: Shijian Deng, Erin E. Kosloski, Siddhi Patel, Zeke A. Barnett, Yiyang Nan, Alexander Kaplan, Sisira Aarukapalli, William T. Doan, Matthew Wang, Harsh Singh, Pamela R. Rollins, Yapeng Tian

    Abstract: In this article, we introduce a novel problem of audio-visual autism behavior recognition, which includes social behavior recognition, an essential aspect previously omitted in AI-assisted autism screening research. We define the task at hand as one that is audio-visual autism behavior recognition, which uses audio and visual cues, including any speech present in the audio, to recognize autism-rel… ▽ More

    Submitted 22 March, 2024; originally announced June 2024.

  10. arXiv:2405.18948  [pdf, other

    cs.RO cs.LG

    Learning to Recover from Plan Execution Errors during Robot Manipulation: A Neuro-symbolic Approach

    Authors: Namasivayam Kalithasan, Arnav Tuli, Vishal Bindal, Himanshu Gaurav Singh, Parag Singla, Rohan Paul

    Abstract: Automatically detecting and recovering from failures is an important but challenging problem for autonomous robots. Most of the recent work on learning to plan from demonstrations lacks the ability to detect and recover from errors in the absence of an explicit state representation and/or a (sub-) goal check function. We propose an approach (blending learning with symbolic search) for automated er… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  11. arXiv:2404.16816  [pdf, other

    cs.CL

    IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages

    Authors: Harman Singh, Nitish Gupta, Shikhar Bharadwaj, Dinesh Tewari, Partha Talukdar

    Abstract: As large language models (LLMs) see increasing adoption across the globe, it is imperative for LLMs to be representative of the linguistic diversity of the world. India is a linguistically diverse country of 1.4 Billion people. To facilitate research on multilingual LLM evaluation, we release IndicGenBench - the largest benchmark for evaluating LLMs on user-facing generation tasks across a diverse… ▽ More

    Submitted 7 August, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: ACL 2024

  12. arXiv:2404.15549  [pdf, other

    cs.CL cs.AI

    PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models

    Authors: Shashi Kant Gupta, Aditya Basu, Mauro Nievas, Jerrin Thomas, Nathan Wolfrath, Adhitya Ramamurthi, Bradley Taylor, Anai N. Kothari, Regina Schwind, Therica M. Miller, Sorena Nadaf-Rahrov, Yanshan Wang, Hrituraj Singh

    Abstract: Clinical trial matching is the task of identifying trials for which patients may be potentially eligible. Typically, this task is labor-intensive and requires detailed verification of patient electronic health records (EHRs) against the stringent inclusion and exclusion criteria of clinical trials. This process is manual, time-intensive, and challenging to scale up, resulting in many patients miss… ▽ More

    Submitted 26 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: 30 Pages, 8 Figures, Supplementary Work Attached

  13. arXiv:2404.08085  [pdf, ps, other

    cs.DS

    Matrix Multiplication Reductions

    Authors: Ashish Gola, Igor Shinkar, Harsimran Singh

    Abstract: In this paper we study a worst case to average case reduction for the problem of matrix multiplication over finite fields. Suppose we have an efficient average case algorithm, that given two random matrices $A,B$ outputs a matrix that has a non-trivial correlation with their product $A \cdot B$. Can we transform it into a worst case algorithm, that outputs the correct answer for all inputs without… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  14. arXiv:2404.07774  [pdf, other

    cs.LG cs.RO

    Sketch-Plan-Generalize: Continual Few-Shot Learning of Inductively Generalizable Spatial Concepts

    Authors: Namasivayam Kalithasan, Sachit Sachdeva, Himanshu Gaurav Singh, Vishal Bindal, Arnav Tuli, Gurarmaan Singh Panjeta, Divyanshu Aggarwal, Rohan Paul, Parag Singla

    Abstract: Our goal is to enable embodied agents to learn inductively generalizable spatial concepts, e.g., learning staircase as an inductive composition of towers of increasing height. Given a human demonstration, we seek a learning architecture that infers a succinct ${program}$ representation that explains the observed instance. Additionally, the approach should generalize inductively to novel structures… ▽ More

    Submitted 29 May, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  15. arXiv:2404.06680  [pdf, other

    cs.CL

    Onco-Retriever: Generative Classifier for Retrieval of EHR Records in Oncology

    Authors: Shashi Kant Gupta, Aditya Basu, Bradley Taylor, Anai Kothari, Hrituraj Singh

    Abstract: Retrieving information from EHR systems is essential for answering specific questions about patient journeys and improving the delivery of clinical care. Despite this fact, most EHR systems still rely on keyword-based searches. With the advent of generative large language models (LLMs), retrieving information can lead to better search and summarization capabilities. Such retrievers can also feed R… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 18 pages

  16. arXiv:2404.04714  [pdf, other

    cs.LG cs.AI cs.CR

    Data Poisoning Attacks on Off-Policy Policy Evaluation Methods

    Authors: Elita Lobo, Harvineet Singh, Marek Petrik, Cynthia Rudin, Himabindu Lakkaraju

    Abstract: Off-policy Evaluation (OPE) methods are a crucial tool for evaluating policies in high-stakes domains such as healthcare, where exploration is often infeasible, unethical, or expensive. However, the extent to which such methods can be trusted under adversarial threats to data quality is largely unexplored. In this work, we make the first attempt at investigating the sensitivity of OPE methods to m… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Accepted at UAI 2022

  17. arXiv:2403.19885  [pdf, other

    cs.CV cs.RO

    Towards Long Term SLAM on Thermal Imagery

    Authors: Colin Keil, Aniket Gupta, Pushyami Kaveti, Hanumant Singh

    Abstract: Visual SLAM with thermal imagery, and other low contrast visually degraded environments such as underwater, or in areas dominated by snow and ice, remain a difficult problem for many state of the art (SOTA) algorithms. In addition to challenging front-end data association, thermal imagery presents an additional difficulty for long term relocalization and map reuse. The relative temperatures of obj… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 8 pages, 7 figures, Submitted to IROS 2024

  18. arXiv:2403.13170  [pdf, other

    cs.RO

    On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine

    Authors: Jagatpreet Singh Nir, Dennis Giaya, Hanumant Singh

    Abstract: Deep learning techniques have significantly advanced in providing accurate visual odometry solutions by leveraging large datasets. However, generating uncertainty estimates for these methods remains a challenge. Traditional sensor fusion approaches in a Bayesian framework are well-established, but deep learning techniques with millions of parameters lack efficient methods for uncertainty estimatio… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Submitted to IROS 2024

  19. arXiv:2403.10425  [pdf, other

    cs.CV cs.AI cs.RO

    NeuFlow: Real-time, High-accuracy Optical Flow Estimation on Robots Using Edge Devices

    Authors: Zhiyong Zhang, Huaizu Jiang, Hanumant Singh

    Abstract: Real-time high-accuracy optical flow estimation is a crucial component in various applications, including localization and mapping in robotics, object tracking, and activity recognition in computer vision. While recent learning-based optical flow methods have achieved high accuracy, they often come with heavy computation costs. In this paper, we propose a highly efficient optical flow architecture… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  20. arXiv:2403.01628  [pdf, ps, other

    cs.LG

    Recent Advances, Applications, and Open Challenges in Machine Learning for Health: Reflections from Research Roundtables at ML4H 2023 Symposium

    Authors: Hyewon Jeong, Sarah Jabbour, Yuzhe Yang, Rahul Thapta, Hussein Mozannar, William Jongwon Han, Nikita Mehandru, Michael Wornow, Vladislav Lialin, Xin Liu, Alejandro Lozano, Jiacheng Zhu, Rafal Dariusz Kocielnik, Keith Harrigian, Haoran Zhang, Edward Lee, Milos Vukadinovic, Aparna Balagopalan, Vincent Jeanselme, Katherine Matton, Ilker Demirel, Jason Fries, Parisa Rashidi, Brett Beaulieu-Jones, Xuhai Orson Xu , et al. (18 additional authors not shown)

    Abstract: The third ML4H symposium was held in person on December 10, 2023, in New Orleans, Louisiana, USA. The symposium included research roundtable sessions to foster discussions between participants and senior researchers on timely and relevant topics for the \ac{ML4H} community. Encouraged by the successful virtual roundtables in the previous year, we organized eleven in-person roundtables and four vir… ▽ More

    Submitted 5 April, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: ML4H 2023, Research Roundtables

  21. arXiv:2402.17412  [pdf, other

    cs.CV

    DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models

    Authors: Shyam Marjit, Harshit Singh, Nityanand Mathur, Sayak Paul, Chia-Mu Yu, Pin-Yu Chen

    Abstract: In the realm of subject-driven text-to-image (T2I) generative models, recent developments like DreamBooth and BLIP-Diffusion have led to impressive results yet encounter limitations due to their intensive fine-tuning demands and substantial parameter requirements. While the low-rank adaptation (LoRA) module within DreamBooth offers a reduction in trainable parameters, it introduces a pronounced se… ▽ More

    Submitted 28 February, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Project Page: https://diffusekrona.github.io/

  22. arXiv:2402.14254  [pdf, other

    cs.LG stat.ML

    A hierarchical decomposition for explaining ML performance discrepancies

    Authors: Jean Feng, Harvineet Singh, Fan Xia, Adarsh Subbaswamy, Alexej Gossmann

    Abstract: Machine learning (ML) algorithms can often differ in performance across domains. Understanding $\textit{why}$ their performance differs is crucial for determining what types of interventions (e.g., algorithmic or operational) are most effective at closing the performance gaps. Existing methods focus on $\textit{aggregate decompositions}$ of the total performance gap into the impact of a shift in t… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 11 pages, 5 figures in main body; 14 pages and 2 figures in appendices

  23. arXiv:2402.05892  [pdf, other

    cs.CV

    Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data

    Authors: Shufan Li, Harkanwar Singh, Aditya Grover

    Abstract: In recent years, Transformers have become the de-facto architecture for sequence modeling on text and a variety of multi-dimensional data, such as images and video. However, the use of self-attention layers in a Transformer incurs prohibitive compute and memory complexity that scales quadratically w.r.t. the sequence length. A recent architecture, Mamba, based on state space models has been shown… ▽ More

    Submitted 13 July, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 24 pages, 7 figures

  24. arXiv:2402.04888  [pdf, other

    cs.IT cs.AI cs.HC cs.LG eess.SP

    RSCNet: Dynamic CSI Compression for Cloud-based WiFi Sensing

    Authors: Borna Barahimi, Hakam Singh, Hina Tabassum, Omer Waqar, Mohammad Omer

    Abstract: WiFi-enabled Internet-of-Things (IoT) devices are evolving from mere communication devices to sensing instruments, leveraging Channel State Information (CSI) extraction capabilities. Nevertheless, resource-constrained IoT devices and the intricacies of deep neural networks necessitate transmitting CSI to cloud servers for sensing. Although feasible, this leads to considerable communication overhea… ▽ More

    Submitted 20 May, 2024; v1 submitted 19 January, 2024; originally announced February 2024.

    Comments: The paper has been accepted by IEEE International Conference on Communications (ICC) 2024

  25. arXiv:2402.02656  [pdf, other

    cs.CL q-bio.QM

    RACER: An LLM-powered Methodology for Scalable Analysis of Semi-structured Mental Health Interviews

    Authors: Satpreet Harcharan Singh, Kevin Jiang, Kanchan Bhasin, Ashutosh Sabharwal, Nidal Moukaddam, Ankit B Patel

    Abstract: Semi-structured interviews (SSIs) are a commonly employed data-collection method in healthcare research, offering in-depth qualitative insights into subject experiences. Despite their value, the manual analysis of SSIs is notoriously time-consuming and labor-intensive, in part due to the difficulty of extracting and categorizing emotional responses, and challenges in scaling human evaluation for l… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  26. arXiv:2312.12630  [pdf, other

    math.DS cs.LG math.CV math.FA math.SP

    Data-driven discovery with Limited Data Acquisition for fluid flow across cylinder

    Authors: Himanshu Singh

    Abstract: One of the central challenge for extracting governing principles of dynamical system via Dynamic Mode Decomposition (DMD) is about the limit data availability or formally called as Limited Data Acquisition in the present paper. In the interest of discovering the governing principles for a dynamical system with limited data acquisition, we provide a variant of Kernelized Extended DMD (KeDMD) based… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 52 Pages, 16 Figures, JULIA Coding Result for Dynamic Mode Decomposition, Part of this work selected for 42nd Annual Dynamic Days 2024 Conference (January 8 to 10) at University of California, Davis

    MSC Class: 37N10 76D05 76D25 47N50 47A25 68T01 28A10 28A35

  27. arXiv:2312.10693  [pdf, other

    cs.LG math.FA

    An appointment with Reproducing Kernel Hilbert Space generated by Generalized Gaussian RBF as $L^2-$measure

    Authors: Himanshu Singh

    Abstract: Gaussian Radial Basis Function (RBF) Kernels are the most-often-employed kernels in artificial intelligence and machine learning routines for providing optimally-best results in contrast to their respective counter-parts. However, a little is known about the application of the Generalized Gaussian Radial Basis Function on various machine learning algorithms namely, kernel regression, support vecto… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 20 pages, MATLAB CODE, 11 figures, Results presented in AMS Spring Eastern Sectional Meeting on April 2023

    MSC Class: NUMBER 68-Computer Science; 68T-Artificial Intelligence and 68T07-Artificial Neural Networks and Deep Learning

  28. A Survey of Classical And Quantum Sequence Models

    Authors: I-Chi Chen, Harshdeep Singh, V L Anukruti, Brian Quanz, Kavitha Yogaraj

    Abstract: Our primary objective is to conduct a brief survey of various classical and quantum neural net sequence models, which includes self-attention and recurrent neural networks, with a focus on recent quantum approaches proposed to work with near-term quantum devices, while exploring some basic enhancements for these quantum models. We re-implement a key representative set of these existing methods, ad… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 6 pages, 10 figures, accepted as a COMSNETS paper

    Journal ref: Conference: 2024 16th International Conference on COMmunication Systems & NETworkS (COMSNETS)

  29. arXiv:2312.09958  [pdf, other

    cs.AI cs.IR

    Distilling Large Language Models for Matching Patients to Clinical Trials

    Authors: Mauro Nievas, Aditya Basu, Yanshan Wang, Hrituraj Singh

    Abstract: The recent success of large language models (LLMs) has paved the way for their adoption in the high-stakes domain of healthcare. Specifically, the application of LLMs in patient-trial matching, which involves assessing patient eligibility against clinical trial's nuanced inclusion and exclusion criteria, has shown promise. Recent research has shown that GPT-3.5, a widely recognized LLM developed b… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  30. arXiv:2312.06738  [pdf, other

    cs.CV

    InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following

    Authors: Shufan Li, Harkanwar Singh, Aditya Grover

    Abstract: The ability to provide fine-grained control for generating and editing visual imagery has profound implications for computer vision and its applications. Previous works have explored extending controllability in two directions: instruction tuning with text-based prompts and multi-modal conditioning. However, these works make one or more unnatural assumptions on the number and/or type of modality i… ▽ More

    Submitted 26 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: 29 pages, 14 figures

  31. arXiv:2312.04745  [pdf, other

    stat.AP cs.LG

    A Brief Tutorial on Sample Size Calculations for Fairness Audits

    Authors: Harvineet Singh, Fan Xia, Mi-Ok Kim, Romain Pirracchio, Rumi Chunara, Jean Feng

    Abstract: In fairness audits, a standard objective is to detect whether a given algorithm performs substantially differently between subgroups. Properly powering the statistical analysis of such audits is crucial for obtaining informative fairness assessments, as it ensures a high probability of detecting unfairness when it exists. However, limited guidance is available on the amount of data necessary for a… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 4 pages, 1 figure, 1 table, Workshop on Regulatable Machine Learning at the 37th Conference on Neural Information Processing Systems

  32. arXiv:2312.00655   

    cs.LG

    Machine Learning for Health symposium 2023 -- Findings track

    Authors: Stefan Hegselmann, Antonio Parziale, Divya Shanmugam, Shengpu Tang, Mercy Nyamewaa Asiedu, Serina Chang, Thomas Hartvigsen, Harvineet Singh

    Abstract: A collection of the accepted Findings papers that were presented at the 3rd Machine Learning for Health symposium (ML4H 2023), which was held on December 10, 2023, in New Orleans, Louisiana, USA. ML4H 2023 invited high-quality submissions on relevant problems in a variety of health-related disciplines including healthcare, biomedicine, and public health. Two submission tracks were offered: the arc… ▽ More

    Submitted 15 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    MSC Class: 68Txx ACM Class: I.2; J.3; I.6; I.4

  33. arXiv:2311.11463  [pdf, other

    cs.LG stat.ML

    Designing monitoring strategies for deployed machine learning algorithms: navigating performativity through a causal lens

    Authors: Jean Feng, Adarsh Subbaswamy, Alexej Gossmann, Harvineet Singh, Berkman Sahiner, Mi-Ok Kim, Gene Pennello, Nicholas Petrick, Romain Pirracchio, Fan Xia

    Abstract: After a machine learning (ML)-based system is deployed, monitoring its performance is important to ensure the safety and effectiveness of the algorithm over time. When an ML algorithm interacts with its environment, the algorithm can affect the data-generating mechanism and be a major source of bias when evaluating its standalone performance, an issue known as performativity. Although prior work h… ▽ More

    Submitted 26 February, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

  34. Adaptive Search Optimization: Dynamic Algorithm Selection and Caching for Enhanced Database Performance

    Authors: Hakikat Singh

    Abstract: Efficient search operations in databases are paramount for timely retrieval of information various applications. This research introduces a novel approach, combining dynamicalgorithm1 selection and caching2 strategies, to optimize search performance. The proposed dynamic search algorithm intelligently switches between Binary3 and Interpolation 4 Search based on dataset characteristics, significant… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  35. arXiv:2309.10698  [pdf, other

    cs.RO

    OASIS: Optimal Arrangements for Sensing in SLAM

    Authors: Pushyami Kaveti, Matthew Giamou, Hanumant Singh, David M. Rosen

    Abstract: The number and arrangement of sensors on mobile robot dramatically influence its perception capabilities. Ensuring that sensors are mounted in a manner that enables accurate detection, localization, and mapping is essential for the success of downstream control tasks. However, when designing a new robotic platform, researchers and practitioners alike usually mimic standard configurations or maximi… ▽ More

    Submitted 21 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

  36. arXiv:2309.10348  [pdf, other

    cs.LG cs.CR cs.CV

    Language Guided Adversarial Purification

    Authors: Himanshu Singh, A V Subramanyam

    Abstract: Adversarial purification using generative models demonstrates strong adversarial defense performance. These methods are classifier and attack-agnostic, making them versatile but often computationally intensive. Recent strides in diffusion and score networks have improved image generation and, by extension, adversarial purification. Another highly efficient class of adversarial defense methods know… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    MSC Class: 68T45 (Primary); 68T10 (Secondary) ACM Class: I.5.4

  37. arXiv:2307.07863  [pdf, other

    cs.LG cs.AI

    Benchmarking the Effectiveness of Classification Algorithms and SVM Kernels for Dry Beans

    Authors: Anant Mehta, Prajit Sengupta, Divisha Garg, Harpreet Singh, Yosi Shacham Diamand

    Abstract: Plant breeders and agricultural researchers can increase crop productivity by identifying desirable features, disease resistance, and nutritional content by analysing the Dry Bean dataset. This study analyses and compares different Support Vector Machine (SVM) classification algorithms, namely linear, polynomial, and radial basis function (RBF), along with other popular classification algorithms.… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: 6 pages, 5 figures

  38. arXiv:2307.01362  [pdf, other

    cs.CV

    A Strong Baseline for Point Cloud Registration via Direct Superpoints Matching

    Authors: Aniket Gupta, Yiming Xie, Hanumant Singh, Huaizu Jiang

    Abstract: Deep neural networks endow the downsampled superpoints with highly discriminative feature representations. Previous dominant point cloud registration approaches match these feature representations as the first step, e.g., using the Sinkhorn algorithm. A RANSAC-like method is then usually adopted as a post-processing refinement to filter the outliers. Other dominant method is to directly predict th… ▽ More

    Submitted 29 March, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

  39. arXiv:2306.14657  [pdf, other

    cs.RO eess.SY

    A Diversity Analysis of Safety Metrics Comparing Vehicle Performance in the Lead-Vehicle Interaction Regime

    Authors: Harnarayan Singh, Bowen Weng, Sughosh J. Rao, Devin Elsasser

    Abstract: Vehicle performance metrics analyze data sets consisting of subject vehicle's interactions with other road users in a nominal driving environment and provide certain performance measures as outputs. To the best of the authors' knowledge, the vehicle safety performance metrics research dates back to at least 1967. To date, there still does not exist a community-wide accepted metric or a set of metr… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: A modified manuscript of this preprint has been accepted to be published as a regular paper at IEEE Transactions on Intelligent Transportation Systems

  40. arXiv:2306.08522  [pdf, other

    cs.RO

    Challenges of Indoor SLAM: A multi-modal multi-floor dataset for SLAM evaluation

    Authors: Pushyami Kaveti, Aniket Gupta, Dennis Giaya, Madeline Karp, Colin Keil, Jagatpreet Nir, Zhiyong Zhang, Hanumant Singh

    Abstract: Robustness in Simultaneous Localization and Mapping (SLAM) remains one of the key challenges for the real-world deployment of autonomous systems. SLAM research has seen significant progress in the last two and a half decades, yet many state-of-the-art (SOTA) algorithms still struggle to perform reliably in real-world environments. There is a general consensus in the research community that we need… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  41. arXiv:2306.02631  [pdf, other

    cs.RO

    Bridging the Domain Gap between Synthetic and Real-World Data for Autonomous Driving

    Authors: Xiangyu Bai, Yedi Luo, Le Jiang, Aniket Gupta, Pushyami Kaveti, Hanumant Singh, Sarah Ostadabbas

    Abstract: Modern autonomous systems require extensive testing to ensure reliability and build trust in ground vehicles. However, testing these systems in the real-world is challenging due to the lack of large and diverse datasets, especially in edge cases. Therefore, simulations are necessary for their development and evaluation. However, existing open-source simulators often exhibit a significant gap betwe… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  42. arXiv:2306.01704  [pdf, other

    cs.RO

    Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis

    Authors: Yedi Luo, Xiangyu Bai, Le Jiang, Aniket Gupta, Eric Mortin, Hanumant Singh, Sarah Ostadabbas

    Abstract: This paper presents a novel approach, TeFS (Temporal-controlled Frame Swap), to generate synthetic stereo driving data for visual simultaneous localization and mapping (vSLAM) tasks. TeFS is designed to overcome the lack of native stereo vision support in commercial driving simulators, and we demonstrate its effectiveness using Grand Theft Auto V (GTA V), a high-budget open-world video game engine… ▽ More

    Submitted 25 December, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  43. arXiv:2305.17611  [pdf, other

    cs.CV

    Bayesian Decision Making to Localize Visual Queries in 2D

    Authors: Syed Asjad, Aniket Gupta, Hanumant Singh

    Abstract: This report describes our approach for the EGO4D 2023 Visual Query 2D Localization Challenge. Our method aims to reduce the number of False Positives (FP) that occur because of high similarity between the visual crop and the proposed bounding boxes from the baseline's Region Proposal Network (RPN). Our method uses a transformer to determine similarity in higher dimensions which is used as our prio… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Comments: Report for the EGO4D 2023 Visual Query 2D Localization Challenge

  44. arXiv:2305.15074  [pdf, other

    cs.CL cs.AI

    Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models

    Authors: Daman Arora, Himanshu Gaurav Singh, Mausam

    Abstract: The performance of large language models (LLMs) on existing reasoning benchmarks has significantly improved over the past years. In response, we present JEEBench, a considerably more challenging benchmark dataset for evaluating the problem solving abilities of LLMs. We curate 515 challenging pre-engineering mathematics, physics and chemistry problems from the highly competitive IIT JEE-Advanced ex… ▽ More

    Submitted 23 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  45. arXiv:2305.14562  [pdf, other

    cs.LG eess.SY

    GiPH: Generalizable Placement Learning for Adaptive Heterogeneous Computing

    Authors: Yi Hu, Chaoran Zhang, Edward Andert, Harshul Singh, Aviral Shrivastava, James Laudon, Yanqi Zhou, Bob Iannucci, Carlee Joe-Wong

    Abstract: Careful placement of a computational application within a target device cluster is critical for achieving low application completion time. The problem is challenging due to its NP-hardness and combinatorial nature. In recent years, learning-based approaches have been proposed to learn a placement policy that can be applied to unseen applications, motivated by the problem of placing a neural networ… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: to be published in Proceedings of Machine Learning and Systems 5 (MLSys 2023)

  46. arXiv:2305.14410  [pdf, other

    cs.CV cs.AI cs.CL

    Image Manipulation via Multi-Hop Instructions -- A New Dataset and Weakly-Supervised Neuro-Symbolic Approach

    Authors: Harman Singh, Poorva Garg, Mohit Gupta, Kevin Shah, Ashish Goswami, Satyam Modi, Arnab Kumar Mondal, Dinesh Khandelwal, Dinesh Garg, Parag Singla

    Abstract: We are interested in image manipulation via natural language text -- a task that is useful for multiple AI applications but requires complex reasoning over multi-modal spaces. We extend recently proposed Neuro Symbolic Concept Learning (NSCL), which has been quite effective for the task of Visual Question Answering (VQA), for the task of image manipulation. Our system referred to as NeuroSIM can p… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 (long paper, main conference)

  47. arXiv:2305.13812  [pdf, other

    cs.CL cs.CV

    Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality

    Authors: Harman Singh, Pengchuan Zhang, Qifan Wang, Mengjiao Wang, Wenhan Xiong, Jingfei Du, Yu Chen

    Abstract: Contrastively trained vision-language models have achieved remarkable progress in vision and language representation learning, leading to state-of-the-art models for various downstream multimodal tasks. However, recent research has highlighted severe limitations of these models in their ability to perform compositional reasoning over objects, attributes, and relations. Scene graphs have emerged as… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 (long paper, main conference)

  48. arXiv:2305.07859  [pdf, other

    cs.LG

    HAiVA: Hybrid AI-assisted Visual Analysis Framework to Study the Effects of Cloud Properties on Climate Patterns

    Authors: Subhashis Hazarika, Haruki Hirasawa, Sookyung Kim, Kalai Ramea, Salva R. Cachay, Peetak Mitra, Dipti Hingmire, Hansi Singh, Phil J. Rasch

    Abstract: Clouds have a significant impact on the Earth's climate system. They play a vital role in modulating Earth's radiation budget and driving regional changes in temperature and precipitation. This makes clouds ideal for climate intervention techniques like Marine Cloud Brightening (MCB) which refers to modification in cloud reflectivity, thereby cooling the surrounding region. However, to avoid unint… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.

  49. arXiv:2302.13330  [pdf, ps, other

    math.CO cs.DM

    Power of $k$ Choices in the Semi-Random Graph Process

    Authors: Paweł Prałat, Harjas Singh

    Abstract: The semi-random graph process is a single player game in which the player is initially presented an empty graph on $n$ vertices. In each round, a vertex $u$ is presented to the player independently and uniformly at random. The player then adaptively selects a vertex $v$, and adds the edge $uv$ to the graph. For a fixed monotone graph property, the objective of the player is to force the graph to s… ▽ More

    Submitted 10 July, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

    Comments: 18 pages

  50. arXiv:2302.03258  [pdf, other

    cs.LG

    Climate Intervention Analysis using AI Model Guided by Statistical Physics Principles

    Authors: Soo Kyung Kim, Kalai Ramea, Salva Rühling Cachay, Haruki Hirasawa, Subhashis Hazarika, Dipti Hingmire, Peetak Mitra, Philip J. Rasch, Hansi A. Singh

    Abstract: The availability of training data remains a significant obstacle for the implementation of machine learning in scientific applications. In particular, estimating how a system might respond to external forcings or perturbations requires specialized labeled data or targeted simulations, which may be computationally intensive to generate at scale. In this study, we propose a novel solution to this ch… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.