Zum Hauptinhalt springen

Showing 1–50 of 102 results for author: Shekhar, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10250  [pdf, ps, other

    cs.IT

    Product and Ratio of Two $α-κ-μ$ Shadowed Random Variables and its Application to Wireless Communication

    Authors: Shashank Shekhar, Sheetal Kalyani

    Abstract: This work studies the product and ratio statistics of independent and non-identically distributed (i.n.i.d) $ α-κ- μ$ shadowed random variables. We derive the series expression for the probability density function (PDF), cumulative distribution function (CDF), and moment generating function (MGF) of the product and ratio of i.n.i.d $ α- κ- μ$ shadowed random variables. We then give the single inte… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2203.15760

  2. arXiv:2407.02536  [pdf, other

    cs.LG cs.IR econ.GN stat.AP

    Reducing False Discoveries in Statistically-Significant Regional-Colocation Mining: A Summary of Results

    Authors: Subhankar Ghosh, Jayant Gupta, Arun Sharma, Shuai An, Shashi Shekhar

    Abstract: Given a set \emph{S} of spatial feature types, its feature instances, a study area, and a neighbor relationship, the goal is to find pairs $<$a region ($r_{g}$), a subset \emph{C} of \emph{S}$>$ such that \emph{C} is a statistically significant regional-colocation pattern in $r_{g}$. This problem is important for applications in various domains including ecology, economics, and sociology. The prob… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    ACM Class: E.m; F.2; E.1; H.3; I.5; J.0

  3. arXiv:2407.00890  [pdf, other

    econ.EM cs.CL cs.LG

    Macroeconomic Forecasting with Large Language Models

    Authors: Andrea Carriero, Davide Pettenuzzo, Shubhranshu Shekhar

    Abstract: This paper presents a comparative analysis evaluating the accuracy of Large Language Models (LLMs) against traditional macro time series forecasting approaches. In recent times, LLMs have surged in popularity for forecasting due to their ability to capture intricate patterns in data and quickly adapt across very different domains. However, their effectiveness in forecasting macroeconomic time seri… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  4. arXiv:2407.00317  [pdf, other

    cs.IR stat.AP

    Towards Statistically Significant Taxonomy Aware Co-location Pattern Detection

    Authors: Subhankar Ghosh, Arun Sharma, Jayant Gupta, Shashi Shekhar

    Abstract: Given a collection of Boolean spatial feature types, their instances, a neighborhood relation (e.g., proximity), and a hierarchical taxonomy of the feature types, the goal is to find the subsets of feature types or their parents whose spatial interaction is statistically significant. This problem is for taxonomy-reliant applications such as ecology (e.g., finding new symbiotic relationships across… ▽ More

    Submitted 4 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: Accepted in The 16th Conference on Spatial Information Theory (COSIT) 2024

    ACM Class: E.m; H.3.3; I.5; J.4; J.4

  5. arXiv:2406.04886  [pdf, other

    cs.CV cs.AI cs.CL

    Seeing the Unseen: Visual Metaphor Captioning for Videos

    Authors: Abisek Rajakumar Kalarani, Pushpak Bhattacharyya, Sumit Shekhar

    Abstract: Metaphors are a common communication tool used in our day-to-day life. The detection and generation of metaphors in textual form have been studied extensively but metaphors in other forms have been under-explored. Recent studies have shown that Vision-Language (VL) models cannot understand visual metaphors in memes and adverts. As of now, no probing studies have been done that involve complex lang… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  6. arXiv:2405.17839  [pdf, other

    cs.DC cs.AI

    PeerFL: A Simulator for Peer-to-Peer Federated Learning at Scale

    Authors: Alka Luqman, Shivanshu Shekhar, Anupam Chattopadhyay

    Abstract: This work integrates peer-to-peer federated learning tools with NS3, a widely used network simulator, to create a novel simulator designed to allow heterogeneous device experiments in federated learning. This cross-platform adaptability addresses a critical gap in existing simulation tools, enhancing the overall utility and user experience. NS3 is leveraged to simulate WiFi dynamics to facilitate… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  7. arXiv:2403.06268  [pdf, other

    cs.CV cs.AI cs.CG cs.DB cs.LG

    Physics-Guided Abnormal Trajectory Gap Detection

    Authors: Arun Sharma, Shashi Shekhar

    Abstract: Given trajectories with gaps (i.e., missing data), we investigate algorithms to identify abnormal gaps in trajectories which occur when a given moving object did not report its location, but other moving objects in the same geographic region periodically did. The problem is important due to its societal applications, such as improving maritime safety and regulatory enforcement for global security… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  8. arXiv:2402.14974  [pdf, other

    eess.IV cs.AI cs.LG

    Towards Spatially-Lucid AI Classification in Non-Euclidean Space: An Application for MxIF Oncology Data

    Authors: Majid Farhadloo, Arun Sharma, Jayant Gupta, Alexey Leontovich, Svetomir N. Markovic, Shashi Shekhar

    Abstract: Given multi-category point sets from different place-types, our goal is to develop a spatially-lucid classifier that can distinguish between two classes based on the arrangements of their points. This problem is important for many applications, such as oncology, for analyzing immune-tumor relationships and designing new immunotherapies. It is challenging due to spatial variability and interpretabi… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: SIAM International Conference on Data Mining (SDM24)

  9. arXiv:2402.10971  [pdf, other

    cs.ET

    Enabling data-driven and bidirectional model development in Verilog-A for photonic devices

    Authors: Dias Azhigulov, Zeqin Lu, James Pond, Lukas Chrostowski, Sudip Shekhar

    Abstract: We present a method to model photonic components in Verilog-A by introducing bidirectional signaling through a single port. To achieve this, the concept of power waves and scattering parameters from electromagnetism are employed. As a consequence, one can simultaneously transmit forward and backward propagating waves on a single wire while also capturing realistic, measurement-backed response of p… ▽ More

    Submitted 3 July, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  10. arXiv:2402.01742  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Optimizing the Costs of LLM Usage

    Authors: Shivanshu Shekhar, Tanishq Dubey, Koyel Mukherjee, Apoorv Saxena, Atharv Tyagi, Nishanth Kotla

    Abstract: Generative AI and LLMs in particular are heavily used nowadays for various document processing tasks such as question answering and summarization. However, different LLMs come with different capabilities for different tasks as well as with different costs, tokenization, and latency. In fact, enterprises are already incurring huge costs of operating or using LLMs for their respective use cases. I… ▽ More

    Submitted 29 January, 2024; originally announced February 2024.

    Comments: 8 pages + Appendix, Total 12 pages

  11. arXiv:2401.16515  [pdf, other

    cs.ET eess.SP eess.SY physics.optics

    Dynamic Electro-Optic Analog Memory for Neuromorphic Photonic Computing

    Authors: Sean Lam, Ahmed Khaled, Simon Bilodeau, Bicky A. Marquez, Paul R. Prucnal, Lukas Chrostowski, Bhavin J. Shastri, Sudip Shekhar

    Abstract: Artificial intelligence (AI) has seen remarkable advancements across various domains, including natural language processing, computer vision, autonomous vehicles, and biology. However, the rapid expansion of AI technologies has escalated the demand for more powerful computing resources. As digital computing approaches fundamental limits, neuromorphic photonics emerges as a promising platform to co… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 22 pages, 10 figures

  12. arXiv:2311.16896  [pdf, other

    physics.optics cs.ET physics.app-ph

    65 GOPS/neuron Photonic Tensor Core with Thin-film Lithium Niobate Photonics

    Authors: Zhongjin Lin, Bhavin J. Shastri, Shangxuan Yu, Jingxiang Song, Yuntao Zhu, Arman Safarnejadian, Wangning Cai, Yanmei Lin, Wei Ke, Mustafa Hammood, Tianye Wang, Mengyue Xu, Zibo Zheng, Mohammed Al-Qadasi, Omid Esmaeeli, Mohamed Rahim, Grzegorz Pakulski, Jens Schmid, Pedro Barrios, Weihong Jiang, Hugh Morison, Matthew Mitchell, Xiaogang Qiang, Xun Guan, Nicolas A. F. Jaeger , et al. (6 additional authors not shown)

    Abstract: Photonics offers a transformative approach to artificial intelligence (AI) and neuromorphic computing by providing low latency, high bandwidth, and energy-efficient computations. Here, we introduce a photonic tensor core processor enabled by time-multiplexed inputs and charge-integrated outputs. This fully integrated processor, comprising only two thin-film lithium niobate (TFLN) modulators, a III… ▽ More

    Submitted 30 November, 2023; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: 19 pages, 6 figures

    MSC Class: 78A05

  13. arXiv:2311.12825  [pdf, ps, other

    cs.AI cs.LG stat.ME

    A PSO Based Method to Generate Actionable Counterfactuals for High Dimensional Data

    Authors: Shashank Shekhar, Asif Salim, Adesh Bansode, Vivaswan Jinturkar, Anirudha Nayak

    Abstract: Counterfactual explanations (CFE) are methods that explain a machine learning model by giving an alternate class prediction of a data point with some minimal changes in its features. It helps the users to identify their data attributes that caused an undesirable prediction like a loan or credit card rejection. We describe an efficient and an actionable counterfactual (CF) generation method based o… ▽ More

    Submitted 30 November, 2023; v1 submitted 30 September, 2023; originally announced November 2023.

    Comments: Accepted in IEEE CSDE 2023

  14. arXiv:2310.19384  [pdf, other

    stat.ML cs.LG

    Deep anytime-valid hypothesis testing

    Authors: Teodora Pandeva, Patrick Forré, Aaditya Ramdas, Shubhanshu Shekhar

    Abstract: We propose a general framework for constructing powerful, sequential hypothesis tests for a large class of nonparametric testing problems. The null hypothesis for these problems is defined in an abstract form using the action of two known operators on the data distribution. This abstraction allows for a unified treatment of several classical tasks, such as two-sample testing, independence testing,… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  15. arXiv:2310.15179  [pdf, other

    physics.ao-ph cs.AI cs.LG math.DS stat.OT

    Reducing Uncertainty in Sea-level Rise Prediction: A Spatial-variability-aware Approach

    Authors: Subhankar Ghosh, Shuai An, Arun Sharma, Jayant Gupta, Shashi Shekhar, Aneesh Subramanian

    Abstract: Given multi-model ensemble climate projections, the goal is to accurately and reliably predict future sea-level rise while lowering the uncertainty. This problem is important because sea-level rise affects millions of people in coastal communities and beyond due to climate change's impacts on polar ice sheets and the ocean. This problem is challenging due to spatial variability and unknowns such a… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 6 pages, 5 figures, I-GUIDE 2023 conference

    ACM Class: J.2; I.2.m; I.2.6; I.2.1; I.2

  16. arXiv:2310.01547  [pdf, other

    math.ST cs.IT cs.LG stat.AP stat.ML

    On the near-optimality of betting confidence sets for bounded means

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: Constructing nonasymptotic confidence intervals (CIs) for the mean of a univariate distribution from independent and identically distributed (i.i.d.) observations is a fundamental task in statistics. For bounded observations, a classical nonparametric approach proceeds by inverting standard concentration bounds, such as Hoeffding's or Bernstein's inequalities. Recently, an alternative betting-base… ▽ More

    Submitted 24 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 53 pages, 2 figures

  17. arXiv:2309.09111  [pdf, ps, other

    math.ST cs.LG stat.ME stat.ML

    Reducing sequential change detection to sequential estimation

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: We consider the problem of sequential change detection, where the goal is to design a scheme for detecting any changes in a parameter or functional $θ$ of the data stream distribution that has small detection delay, but guarantees control on the frequency of false alarms in the absence of changes. In this paper, we describe a simple reduction from sequential change detection to sequential estimati… ▽ More

    Submitted 24 November, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: 11 pages

  18. arXiv:2308.03977  [pdf, other

    cs.CV cs.LG

    PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation Learning

    Authors: Florian Bordes, Shashank Shekhar, Mark Ibrahim, Diane Bouchacourt, Pascal Vincent, Ari S. Morcos

    Abstract: Synthetic image datasets offer unmatched advantages for designing and evaluating deep neural networks: they make it possible to (i) render as many data samples as needed, (ii) precisely control each scene and yield granular ground truth labels (and captions), (iii) precisely control distribution shifts between training and testing to isolate variables of interest for sound experimentation. Despite… ▽ More

    Submitted 12 December, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  19. arXiv:2308.03360  [pdf

    cs.CL

    Coupling Symbolic Reasoning with Language Modeling for Efficient Longitudinal Understanding of Unstructured Electronic Medical Records

    Authors: Shivani Shekhar, Simran Tiwari, T. C. Rensink, Ramy Eskander, Wael Salloum

    Abstract: The application of Artificial Intelligence (AI) in healthcare has been revolutionary, especially with the recent advancements in transformer-based Large Language Models (LLMs). However, the task of understanding unstructured electronic medical records remains a challenge given the nature of the records (e.g., disorganization, inconsistency, and redundancy) and the inability of LLMs to derive reaso… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  20. arXiv:2307.01401  [pdf, other

    cs.CL

    Multi-Task Learning Improves Performance In Deep Argument Mining Models

    Authors: Amirhossein Farzam, Shashank Shekhar, Isaac Mehlhaff, Marco Morucci

    Abstract: The successful analysis of argumentative techniques from user-generated text is central to many downstream tasks such as political and market analysis. Recent argument mining tools use state-of-the-art deep learning methods to extract and annotate argumentative techniques from various online text corpora, however each task is treated as separate and different bespoke models are fine-tuned for each… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  21. arXiv:2306.00931  [pdf, other

    cs.CV cs.CL

    "Let's not Quote out of Context": Unified Vision-Language Pretraining for Context Assisted Image Captioning

    Authors: Abisek Rajakumar Kalarani, Pushpak Bhattacharyya, Niyati Chhaya, Sumit Shekhar

    Abstract: Well-formed context aware image captions and tags in enterprise content such as marketing material are critical to ensure their brand presence and content recall. Manual creation and updates to ensure the same is non trivial given the scale and the tedium towards this task. We propose a new unified Vision-Language (VL) model based on the One For All (OFA) model, with a focus on context-assisted im… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  22. arXiv:2305.09675  [pdf, other

    cs.CY

    Spatial Computing Opportunities in Biomedical Decision Support: The Atlas-EHR Vision

    Authors: Majid Farhadloo, Arun Sharma, Shashi Shekhar, Svetomir N. Markovic

    Abstract: We consider the problem of reducing the time needed by healthcare professionals to understand patient medical history via the next generation of biomedical decision support. This problem is societally important because it has the potential to improve healthcare quality and patient outcomes. However, navigating electronic health records is challenging due to the high patient-doctor ratios, potentia… ▽ More

    Submitted 28 February, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

  23. arXiv:2305.06884  [pdf, ps, other

    stat.ME cs.AI cs.LG math.ST stat.AP stat.ML

    Risk-limiting Financial Audits via Weighted Sampling without Replacement

    Authors: Shubhanshu Shekhar, Ziyu Xu, Zachary C. Lipton, Pierre J. Liang, Aaditya Ramdas

    Abstract: We introduce the notion of a risk-limiting financial auditing (RLFA): given $N$ transactions, the goal is to estimate the total misstated monetary fraction~($m^*$) to a given accuracy $ε$, with confidence $1-δ$. We do this by constructing new confidence sequences (CSs) for the weighted average of $N$ unknown values, based on samples drawn without replacement according to a (randomized) weighted sa… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 23 pages, 8 figures, to appear in the Proceedings of Uncertainty in Artificial Intelligence (UAI) 2023

  24. arXiv:2304.13807  [pdf, other

    cs.NE

    A Survey on Solving and Discovering Differential Equations Using Deep Neural Networks

    Authors: Hyeonjung, Jung, Jayant Gupta, Bharat Jayaprakash, Matthew Eagon, Harish Panneer Selvam, Carl Molnar, William Northrop, Shashi Shekhar

    Abstract: Ordinary and partial differential equations (DE) are used extensively in scientific and mathematical domains to model physical systems. Current literature has focused primarily on deep neural network (DNN) based methods for solving a specific DE or a family of DEs. Research communities with a history of using DE models may view DNN-based differential equation solvers (DNN-DEs) as a faster and tran… ▽ More

    Submitted 19 June, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Under review for ACM Computing Surveys journal. 29 pages

  25. arXiv:2304.13089  [pdf, other

    cs.LG cs.CV eess.IV

    Objectives Matter: Understanding the Impact of Self-Supervised Objectives on Vision Transformer Representations

    Authors: Shashank Shekhar, Florian Bordes, Pascal Vincent, Ari Morcos

    Abstract: Joint-embedding based learning (e.g., SimCLR, MoCo, DINO) and reconstruction-based learning (e.g., BEiT, SimMIM, MAE) are the two leading paradigms for self-supervised learning of vision transformers, but they differ substantially in their transfer performance. Here, we aim to explain these differences by analyzing the impact of these objectives on the structure and transferability of the learned… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  26. arXiv:2304.12210  [pdf, other

    cs.LG cs.CV

    A Cookbook of Self-Supervised Learning

    Authors: Randall Balestriero, Mark Ibrahim, Vlad Sobal, Ari Morcos, Shashank Shekhar, Tom Goldstein, Florian Bordes, Adrien Bardes, Gregoire Mialon, Yuandong Tian, Avi Schwarzschild, Andrew Gordon Wilson, Jonas Geiping, Quentin Garrido, Pierre Fernandez, Amir Bar, Hamed Pirsiavash, Yann LeCun, Micah Goldblum

    Abstract: Self-supervised learning, dubbed the dark matter of intelligence, is a promising path to advance machine learning. Yet, much like cooking, training SSL methods is a delicate art with a high barrier to entry. While many components are familiar, successfully training a SSL method involves a dizzying set of choices from the pretext tasks to training hyper-parameters. Our goal is to lower the barrier… ▽ More

    Submitted 28 June, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

  27. arXiv:2302.14757  [pdf, other

    cs.MM cs.IR cs.SD eess.AS

    Audio Retrieval for Multimodal Design Documents: A New Dataset and Algorithms

    Authors: Prachi Singh, Srikrishna Karanam, Sumit Shekhar

    Abstract: We consider and propose a new problem of retrieving audio files relevant to multimodal design document inputs comprising both textual elements and visual imagery, e.g., birthday/greeting cards. In addition to enhancing user experience, integrating audio that matches the theme/style of these inputs also helps improve the accessibility of these documents (e.g., visually impaired people can listen to… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: 5 pages including references

  28. arXiv:2302.02544  [pdf, other

    math.ST cs.IT cs.LG stat.ME stat.ML

    Sequential change detection via backward confidence sequences

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: We present a simple reduction from sequential estimation to sequential changepoint detection (SCD). In short, suppose we are interested in detecting changepoints in some parameter or functional $θ$ of the underlying distribution. We demonstrate that if we can construct a confidence sequence (CS) for $θ$, then we can also successfully perform SCD for $θ$. This is accomplished by checking if two CSs… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

    Comments: 24 pages, 10 figures

  29. arXiv:2301.05739  [pdf, other

    cs.LG cs.AI

    Eco-PiNN: A Physics-informed Neural Network for Eco-toll Estimation

    Authors: Yan Li, Mingzhou Yang, Matthew Eagon, Majid Farhadloo, Yiqun Xie, William F. Northrop, Shashi Shekhar

    Abstract: The eco-toll estimation problem quantifies the expected environmental cost (e.g., energy consumption, exhaust emissions) for a vehicle to travel along a path. This problem is important for societal applications such as eco-routing, which aims to find paths with the lowest exhaust emissions or energy need. The challenges of this problem are three-fold: (1) the dependence of a vehicle's eco-toll on… ▽ More

    Submitted 18 January, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: Full version of the paper accepted for the SDM23 conference; Yan Li and Mingzhou Yang contributed equally to this paper

  30. arXiv:2301.00750  [pdf, other

    cs.GR cs.CV

    Interactive Control over Temporal Consistency while Stylizing Video Streams

    Authors: Sumit Shekhar, Max Reimann, Moritz Hilscher, Amir Semmo, Jürgen Döllner, Matthias Trapp

    Abstract: Image stylization has seen significant advancement and widespread interest over the years, leading to the development of a multitude of techniques. Extending these stylization techniques, such as Neural Style Transfer (NST), to videos is often achieved by applying them on a per-frame basis. However, per-frame stylization usually lacks temporal consistency, expressed by undesirable flickering artif… ▽ More

    Submitted 29 June, 2023; v1 submitted 2 January, 2023; originally announced January 2023.

  31. arXiv:2301.00270  [pdf, other

    cs.SI cs.LG

    NetEffect: Discovery and Exploitation of Generalized Network Effects

    Authors: Meng-Chieh Lee, Shubhranshu Shekhar, Jaemin Yoo, Christos Faloutsos

    Abstract: Given a large graph with few node labels, how can we (a) identify whether there is generalized network-effects (GNE) or not, (b) estimate GNE to explain the interrelations among node classes, and (c) exploit GNE efficiently to improve the performance on downstream tasks? The knowledge of GNE is valuable for various tasks like node classification, and targeted advertising. However, identifying GNE… ▽ More

    Submitted 12 February, 2024; v1 submitted 31 December, 2022; originally announced January 2023.

    Comments: Accepted to PAKDD 2024

  32. arXiv:2212.09108  [pdf, ps, other

    stat.ME cs.LG math.ST stat.ML

    A Permutation-Free Kernel Independence Test

    Authors: Shubhanshu Shekhar, Ilmun Kim, Aaditya Ramdas

    Abstract: In nonparametric independence testing, we observe i.i.d.\ data $\{(X_i,Y_i)\}_{i=1}^n$, where $X \in \mathcal{X}, Y \in \mathcal{Y}$ lie in any general spaces, and we wish to test the null that $X$ is independent of $Y$. Modern test statistics such as the kernel Hilbert-Schmidt Independence Criterion (HSIC) and Distance Covariance (dCov) have intractable null distributions due to the degeneracy of… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: 52 pages, 4 figures

  33. arXiv:2212.04617  [pdf, other

    eess.IV cs.CV cs.LG

    UNet Based Pipeline for Lung Segmentation from Chest X-Ray Images

    Authors: Shashank Shekhar, Ritika Nandi, H Srikanth Kamath

    Abstract: Biomedical image segmentation is one of the fastest growing fields which has seen extensive automation through the use of Artificial Intelligence. This has enabled widespread adoption of accurate techniques to expedite the screening and diagnostic processes which would otherwise take several days to finalize. In this paper, we present an end-to-end pipeline to segment lungs from chest X-ray images… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 6 Pages

  34. arXiv:2211.14908  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    A Permutation-free Kernel Two-Sample Test

    Authors: Shubhanshu Shekhar, Ilmun Kim, Aaditya Ramdas

    Abstract: The kernel Maximum Mean Discrepancy~(MMD) is a popular multivariate distance metric between distributions that has found utility in two-sample testing. The usual kernel-MMD test statistic is a degenerate U-statistic under the null, and thus it has an intractable limiting distribution. Hence, to design a level-$α$ test, one usually selects the rejection threshold as the $(1-α)$-quantile of the perm… ▽ More

    Submitted 4 February, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Published at the Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS), with an oral presentation

  35. arXiv:2211.03317  [pdf, ps, other

    cs.IT

    Instantaneous Channel Oblivious Phase Shift Design for an IRS-Assisted SIMO System with Quantized Phase Shift

    Authors: Shashank Shekhar, Athira Subhash, Tejesh Kella, Sheetal Kalyani

    Abstract: We design the phase shifts of an intelligent reflecting surface (IRS)-assisted single-input-multiple-output communication system to minimize the outage probability (OP) and to maximize the ergodic rate. Our phase shifts design uses only statistical channel state information since these depend only on the large-scale fading coefficients; the obtained phase shift design remains valid for a longer ti… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  36. arXiv:2211.02927  [pdf, other

    cs.CY cs.CR cs.LG

    Unsupervised Machine Learning for Explainable Health Care Fraud Detection

    Authors: Shubhranshu Shekhar, Jetson Leder-Luis, Leman Akoglu

    Abstract: The US federal government spends more than a trillion dollars per year on health care, largely provided by private third parties and reimbursed by the government. A major concern in this system is overbilling, waste and fraud by providers, who face incentives to misreport on their claims in order to receive higher payments. In this paper, we develop novel machine learning tools to identify provide… ▽ More

    Submitted 23 February, 2023; v1 submitted 5 November, 2022; originally announced November 2022.

    Comments: NBER Working paper #30946

  37. arXiv:2210.08879  [pdf, other

    cs.RO cs.AI cs.HC

    Robust Planning for Human-Robot Joint Tasks with Explicit Reasoning on Human Mental State

    Authors: Anthony Favier, Shashank Shekhar, Rachid Alami

    Abstract: We consider the human-aware task planning problem where a human-robot team is given a shared task with a known objective to achieve. Recent approaches tackle it by modeling it as a team of independent, rational agents, where the robot plans for both agents' (shared) tasks. However, the robot knows that humans cannot be administered like artificial agents, so it emulates and predicts the human's de… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 10 pages, 2 figures, 1 table, AI-HRI AAAI 2022 Fall Symposium Series

    Report number: AIHRI/2022/8188

  38. arXiv:2210.04081  [pdf, other

    cs.LG cs.SI

    Less is More: SlimG for Accurate, Robust, and Interpretable Graph Mining

    Authors: Jaemin Yoo, Meng-Chieh Lee, Shubhranshu Shekhar, Christos Faloutsos

    Abstract: How can we solve semi-supervised node classification in various graphs possibly with noisy features and structures? Graph neural networks (GNNs) have succeeded in many graph mining tasks, but their generalizability to various graph scenarios is limited due to the difficulty of training, hyperparameter tuning, and the selection of a model itself. Einstein said that we should "make everything as sim… ▽ More

    Submitted 16 June, 2023; v1 submitted 8 October, 2022; originally announced October 2022.

    Comments: Accepted to KDD 2023

  39. arXiv:2209.09207  [pdf, other

    cs.CV

    Table Detection in the Wild: A Novel Diverse Table Detection Dataset and Method

    Authors: Mrinal Haloi, Shashank Shekhar, Nikhil Fande, Siddhant Swaroop Dash, Sanjay G

    Abstract: Recent deep learning approaches in table detection achieved outstanding performance and proved to be effective in identifying document layouts. Currently, available table detection benchmarks have many limitations, including the lack of samples diversity, simple table structure, the lack of training cases, and samples quality. In this paper, we introduce a diverse large-scale dataset for table det… ▽ More

    Submitted 30 November, 2023; v1 submitted 31 August, 2022; originally announced September 2022.

    Comments: Open source Table detection dataset and baseline results

    MSC Class: 68T45

  40. arXiv:2208.03664  [pdf, ps, other

    cs.IT

    SINR Analysis of an IRS Assisted MU-MISO System

    Authors: Lakshmi Jayalal, Shashank Shekhar, Athira Subhash, Sheetal Kalyani

    Abstract: In this work, we characterize the outage probability (OP) of an intelligent reflecting surface (IRS) assisted multi-user multiple-input-single-output (MU-MISO) communication system. Using a two-step approximation method, we approximate the signal-to-interference-plus-noise ratio (SINR) for any downlink user by a Log-Normal random variable. The impact of various system parameters is studied using t… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

  41. arXiv:2207.07219  [pdf, other

    cs.NI

    Software-defined Dynamic 5G Network Slice Management for Industrial Internet of Things

    Authors: Ziran Min, Shashank Shekhar, Charif Mahmoudi, Valerio Formicola, Swapna Gokhale, Aniruddha Gokhale

    Abstract: This paper addresses the challenges of delivering fine-grained Quality of Service (QoS) and communication determinism over 5G wireless networks for real-time and autonomous needs of Industrial Internet of Things (IIoT) applications while effectively sharing network resources. Specifically, this work presents DANSM, a software-defined, dynamic and autonomous network slice management middleware for… ▽ More

    Submitted 11 November, 2022; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: 8 pages, 8 figures, conference

  42. arXiv:2206.14486  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Beyond neural scaling laws: beating power law scaling via data pruning

    Authors: Ben Sorscher, Robert Geirhos, Shashank Shekhar, Surya Ganguli, Ari S. Morcos

    Abstract: Widely observed neural scaling laws, in which error falls off as a power of the training set size, model size, or both, have driven substantial performance improvements in deep learning. However, these improvements through scaling alone require considerable costs in compute and energy. Here we focus on the scaling of error with dataset size and show how in theory we can break beyond power law scal… ▽ More

    Submitted 21 April, 2023; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: Outstanding Paper Award @ NeurIPS 2022. Added github link to metric scores

  43. arXiv:2206.12753  [pdf, other

    cs.DB cs.CV cs.DC cs.LG

    Spatiotemporal Data Mining: A Survey

    Authors: Arun Sharma, Zhe Jiang, Shashi Shekhar

    Abstract: Spatiotemporal data mining aims to discover interesting, useful but non-trivial patterns in big spatial and spatiotemporal data. They are used in various application domains such as public safety, ecology, epidemiology, earth science, etc. This problem is challenging because of the high societal cost of spurious patterns and exorbitant computational cost. Recent surveys of spatiotemporal data mini… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

  44. arXiv:2205.12840  [pdf, other

    cs.CV

    SALAD: Source-free Active Label-Agnostic Domain Adaptation for Classification, Segmentation and Detection

    Authors: Divya Kothandaraman, Sumit Shekhar, Abhilasha Sancheti, Manoj Ghuhan, Tripti Shukla, Dinesh Manocha

    Abstract: We present a novel method, SALAD, for the challenging vision task of adapting a pre-trained "source" domain network to a "target" domain, with a small budget for annotation in the "target" domain and a shift in the label space. Further, the task assumes that the source data is not available for adaptation, due to privacy concerns or otherwise. We postulate that such systems need to jointly optimiz… ▽ More

    Submitted 22 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  45. arXiv:2203.15760  [pdf, ps, other

    cs.IT

    A New Expression for the Product of Two $κ-μ$ Shadowed Random Variables and its Application to Wireless Communication

    Authors: Shashank Shekhar, Sheetal Kalyani

    Abstract: In this work, the product of two independent and non-identically distributed (i.n.i.d) $κ- μ$ shadowed random variables is studied. We derive the series expression for the probability density function (PDF), cumulative distribution function (CDF), and moment generating function (MGF) of the product of two (i.n.i.d) $κ- μ$ shadowed random variables. The derived formulation in this work is quite gen… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  46. arXiv:2203.06297  [pdf, other

    cs.LG stat.ML

    Instance-Dependent Regret Analysis of Kernelized Bandits

    Authors: Shubhanshu Shekhar, Tara Javidi

    Abstract: We study the kernelized bandit problem, that involves designing an adaptive strategy for querying a noisy zeroth-order-oracle to efficiently learn about the optimizer of an unknown function $f$ with a norm bounded by $M<\infty$ in a Reproducing Kernel Hilbert Space~(RKHS) associated with a positive definite kernel $K$. Prior results, working in a \emph{minimax framework}, have characterized the wo… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 26 pages, 1 figure

  47. arXiv:2203.04889  [pdf, other

    cs.CV cs.GR

    Low-light Image and Video Enhancement via Selective Manipulation of Chromaticity

    Authors: Sumit Shekhar, Max Reimann, Amir Semmo, Sebastian Pasewaldt, Jürgen Döllner, Matthias Trapp

    Abstract: Image acquisition in low-light conditions suffers from poor quality and significant degradation in visual aesthetics. This affects the visual perception of the acquired image and the performance of various computer vision and image processing algorithms applied after acquisition. Especially for videos, the additional temporal domain makes it more challenging, wherein we need to preserve quality in… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

  48. arXiv:2201.08901  [pdf

    cs.CV

    An Ensemble Model for Face Liveness Detection

    Authors: Shashank Shekhar, Avinash Patel, Mrinal Haloi, Asif Salim

    Abstract: In this paper, we present a passive method to detect face presentation attack a.k.a face liveness detection using an ensemble deep learning technique. Face liveness detection is one of the key steps involved in user identity verification of customers during the online onboarding/transaction processes. During identity verification, an unauthenticated user tries to bypass the verification system by… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

    Comments: Accepted and presented at MLDM 2022. To be published in Lattice journal

  49. arXiv:2201.06955  [pdf, other

    cs.CY cs.DB cs.HC

    Understanding COVID-19 Effects on Mobility: A Community-Engaged Approach

    Authors: Arun Sharma, Majid Farhadloo, Yan Li, Aditya Kulkarni, Jayant Gupta, Shashi Shekhar

    Abstract: Given aggregated mobile device data, the goal is to understand the impact of COVID-19 policy interventions on mobility. This problem is vital due to important societal use cases, such as safely reopening the economy. Challenges include understanding and interpreting questions of interest to policymakers, cross-jurisdictional variability in choice and time of interventions, the large data volume, a… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

  50. arXiv:2201.06433  [pdf, other

    cs.LG

    A Comparative study of Hyper-Parameter Optimization Tools

    Authors: Shashank Shekhar, Adesh Bansode, Asif Salim

    Abstract: Most of the machine learning models have associated hyper-parameters along with their parameters. While the algorithm gives the solution for parameters, its utility for model performance is highly dependent on the choice of hyperparameters. For a robust performance of a model, it is necessary to find out the right hyper-parameter combination. Hyper-parameter optimization (HPO) is a systematic proc… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: Selected and presented at IEEE CSDE 2021. To be published in Proceedings of IEEE CSDE 2021