Zum Hauptinhalt springen

Showing 1–43 of 43 results for author: Shukla, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10993  [pdf, other

    cs.CV

    Facial Demorphing via Identity Preserving Image Decomposition

    Authors: Nitish Shukla, Arun Ross

    Abstract: A face morph is created by combining the face images usually pertaining to two distinct identities. The goal is to generate an image that can be matched with two identities thereby undermining the security of a face recognition system. To deal with this problem, several morph attack detection techniques have been developed. But these methods do not extract any information about the underlying bona… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  2. arXiv:2407.20192  [pdf, other

    cs.LG eess.SY

    Time series forecasting with high stakes: A field study of the air cargo industry

    Authors: Abhinav Garg, Naman Shukla, Maarten Wormer

    Abstract: Time series forecasting in the air cargo industry presents unique challenges due to volatile market dynamics and the significant impact of accurate forecasts on generated revenue. This paper explores a comprehensive approach to demand forecasting at the origin-destination (O\&D) level, focusing on the development and implementation of machine learning models in decision-making for the air cargo in… ▽ More

    Submitted 13 August, 2024; v1 submitted 29 July, 2024; originally announced July 2024.

    Comments: The 10th Mining and Learning from Time Series Workshop: From Classical Methods to LLMs. SIGKDD, Barcelona, Spain, 6 page

  3. arXiv:2404.07449  [pdf, other

    cs.CV

    Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs

    Authors: Kanchana Ranasinghe, Satya Narayan Shukla, Omid Poursaeed, Michael S. Ryoo, Tsung-Yu Lin

    Abstract: Integration of Large Language Models (LLMs) into visual domain tasks, resulting in visual-LLMs (V-LLMs), has enabled exceptional performance in vision-language tasks, particularly for visual question answering (VQA). However, existing V-LLMs (e.g. BLIP-2, LLaVA) demonstrate weak spatial reasoning and localization awareness. Despite generating highly descriptive and elaborate textual answers, these… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  4. arXiv:2403.06436  [pdf

    cs.ET math.OC physics.app-ph

    Designing a K-state P-bit Engine

    Authors: Mohammad Khairul Bashar, Abir Hasan, Nikhil Shukla

    Abstract: Probabilistic bit (p-bit)-based compute engines utilize the unique capability of a p-bit to probabilistically switch between two states to solve computationally challenging problems. However, when solving problems that require more than two states (e.g., problems such as Max-3-Cut, verifying if a graph is K-partite (K>2) etc.), additional pre-processing steps such as graph reduction are required t… ▽ More

    Submitted 27 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  5. arXiv:2312.16339  [pdf, other

    cs.CV cs.LG

    Universal Pyramid Adversarial Training for Improved ViT Performance

    Authors: Ping-yeh Chiang, Yipin Zhou, Omid Poursaeed, Satya Narayan Shukla, Ashish Shah, Tom Goldstein, Ser-Nam Lim

    Abstract: Recently, Pyramid Adversarial training (Herrmann et al., 2022) has been shown to be very effective for improving clean accuracy and distribution-shift robustness of vision transformers. However, due to the iterative nature of adversarial training, the technique is up to 7 times more expensive than standard training. To make the method more efficient, we propose Universal Pyramid Adversarial traini… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  6. arXiv:2310.18375  [pdf

    cs.AR cs.CR

    CMOS-based Single-Cycle In-Memory XOR/XNOR

    Authors: Shamiul Alam, Jack Hutchins, Nikhil Shukla, Kazi Asifuzzaman, Ahmedullah Aziz

    Abstract: Big data applications are on the rise, and so is the number of data centers. The ever-increasing massive data pool needs to be periodically backed up in a secure environment. Moreover, a massive amount of securely backed-up data is required for training binary convolutional neural networks for image classification. XOR and XNOR operations are essential for large-scale data copy verification, encry… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 12 pages, 6 figures, 1 table

  7. arXiv:2310.09322  [pdf

    math.OC cs.ET math.DS

    A Note on Analyzing the Stability of Oscillator Ising Machines

    Authors: Mohammad Khairul Bashar, Zongli Lin, Nikhil Shukla

    Abstract: The rich non-linear dynamics of the coupled oscillators (under second harmonic injection) can be leveraged to solve computationally hard problems in combinatorial optimization such as finding the ground state of the Ising Hamiltonian. While prior work on the stability of the so-called Oscillator Ising Machines (OIMs) has used the linearization method, in this letter, we present a complementary met… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  8. arXiv:2309.11569  [pdf, other

    cs.CV

    Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding

    Authors: Mohamed Afham, Satya Narayan Shukla, Omid Poursaeed, Pengchuan Zhang, Ashish Shah, Sernam Lim

    Abstract: While most modern video understanding models operate on short-range clips, real-world videos are often several minutes long with semantically consistent segments of variable length. A common approach to process long videos is applying a short-form video model over uniformly sampled clips of fixed temporal length and aggregating the outputs. This approach neglects the underlying nature of long vide… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  9. arXiv:2308.16884  [pdf, other

    cs.CL cs.AI cs.LG

    The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants

    Authors: Lucas Bandarkar, Davis Liang, Benjamin Muller, Mikel Artetxe, Satya Narayan Shukla, Donald Husa, Naman Goyal, Abhinandan Krishnan, Luke Zettlemoyer, Madian Khabsa

    Abstract: We present Belebele, a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants. Significantly expanding the language coverage of natural language understanding (NLU) benchmarks, this dataset enables the evaluation of text models in high-, medium-, and low-resource languages. Each question is based on a short passage from the Flores-200 dataset and has four multip… ▽ More

    Submitted 25 July, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: ACL 2024

    ACM Class: I.2.7

  10. arXiv:2308.11442  [pdf, other

    cs.CV

    SDeMorph: Towards Better Facial De-morphing from Single Morph

    Authors: Nitish Shukla

    Abstract: Face Recognition Systems (FRS) are vulnerable to morph attacks. A face morph is created by combining multiple identities with the intention to fool FRS and making it match the morph with multiple identities. Current Morph Attack Detection (MAD) can detect the morph but are unable to recover the identities used to create the morph with satisfactory outcomes. Existing work in de-morphing is mostly r… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  11. arXiv:2307.01719  [pdf, other

    q-fin.PM cs.AI cs.LG

    MOPO-LSI: A User Guide

    Authors: Yong Zheng, Kumar Neelotpal Shukla, Jasmine Xu, David, Wang, Michael O'Leary

    Abstract: MOPO-LSI is an open-source Multi-Objective Portfolio Optimization Library for Sustainable Investments. This document provides a user guide for MOPO-LSI version 1.0, including problem setup, workflow and the hyper-parameters in configurations.

    Submitted 12 July, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

  12. arXiv:2304.04386  [pdf, ps, other

    cs.LG cs.CR cs.CV

    Generating Adversarial Attacks in the Latent Space

    Authors: Nitish Shukla, Sudipta Banerjee

    Abstract: Adversarial attacks in the input (pixel) space typically incorporate noise margins such as $L_1$ or $L_{\infty}$-norm to produce imperceptibly perturbed data that confound deep learning networks. Such noise margins confine the magnitude of permissible noise. In this work, we propose injecting adversarial perturbations in the latent (feature) space using a generative adversarial network, removing t… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  13. arXiv:2303.13974   

    cs.LG

    Mixed-Type Wafer Classification For Low Memory Devices Using Knowledge Distillation

    Authors: Nitish Shukla, Anurima Dey, Srivatsan K

    Abstract: Manufacturing wafers is an intricate task involving thousands of steps. Defect Pattern Recognition (DPR) of wafer maps is crucial for determining the root cause of production defects, which may further provide insight for yield improvement in wafer foundry. During manufacturing, various defects may appear standalone in the wafer or may appear as different combinations. Identifying multiple defects… ▽ More

    Submitted 18 October, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Study is not relevant

  14. arXiv:2303.13827   

    cs.CV cs.LG

    Efficient Mixed-Type Wafer Defect Pattern Recognition Using Compact Deformable Convolutional Transformers

    Authors: Nitish Shukla

    Abstract: Manufacturing wafers is an intricate task involving thousands of steps. Defect Pattern Recognition (DPR) of wafer maps is crucial to find the root cause of the issue and further improving the yield in the wafer foundry. Mixed-type DPR is much more complicated compared to single-type DPR due to varied spatial features, the uncertainty of defects, and the number of defects present. To accurately pre… ▽ More

    Submitted 16 October, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Study is not relevant

  15. arXiv:2303.11632   

    cs.CV cs.LG eess.IV

    An Embarrassingly Simple Approach for Wafer Feature Extraction and Defect Pattern Recognition

    Authors: Nitish Shukla

    Abstract: Identifying defect patterns in a wafer map during manufacturing is crucial to find the root cause of the underlying issue and provides valuable insights on improving yield in the foundry. Currently used methods use deep neural networks to identify the defects. These methods are generally very huge and have significant inference time. They also require GPU support to efficiently operate. All these… ▽ More

    Submitted 16 October, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: study is not relevant

  16. arXiv:2211.04934  [pdf, other

    cs.CL

    DoSA : A System to Accelerate Annotations on Business Documents with Human-in-the-Loop

    Authors: Neelesh K Shukla, Msp Raja, Raghu Katikeri, Amit Vaid

    Abstract: Business documents come in a variety of structures, formats and information needs which makes information extraction a challenging task. Due to these variations, having a document generic model which can work well across all types of documents and for all the use cases seems far-fetched. For document-specific models, we would need customized document-specific labels. We introduce DoSA (Document Sp… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: Accepted at DaSH@EMNLP2022, 5 pages, 4 figures

  17. arXiv:2206.05907  [pdf

    math.OC cs.CC

    Computational Models based on Synchronized Oscillators for Solving Combinatorial Optimization Problems

    Authors: Antik Mallick, Mohammad Khairul Bashar, Zongli Lin, Nikhil Shukla

    Abstract: The equivalence between the natural minimization of energy in a dynamical system and the minimization of an objective function characterizing a combinatorial optimization problem offers a promising approach to designing dynamical system-inspired computational models and solvers for such problems. For instance, the ground state energy of coupled electronic oscillators, under second harmonic injecti… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: 38 pages, 10 figures

  18. arXiv:2205.14729  [pdf

    physics.app-ph cs.ET

    CMOS-Compatible Ising Machines built using Bistable Latches Coupled through Ferroelectric Transistor Arrays

    Authors: Antik Mallick, Zijian Zhao, Mohammad Khairul Bashar, Shamiul Alam, Md Mazharul Islam, Yi Xiao, Yixin Xu, Ahmedullah Aziz, Vijaykrishnan Narayanan, Kai Ni, Nikhil Shukla

    Abstract: Realizing compact and scalable Ising machines that are compatible with CMOS-process technology is crucial to the effectiveness and practicality of using such hardware platforms for accelerating computationally intractable problems. Besides the need for realizing compact Ising spins, the implementation of the coupling network, which describes the spin interaction, is also a potential bottleneck in… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

    Comments: 29 pages, 10 figures

  19. arXiv:2203.08267  [pdf

    cs.CV cs.LG

    2-speed network ensemble for efficient classification of incremental land-use/land-cover satellite image chips

    Authors: Michael James Horry, Subrata Chakraborty, Biswajeet Pradhan, Nagesh Shukla, Sanjoy Paul

    Abstract: The ever-growing volume of satellite imagery data presents a challenge for industry and governments making data-driven decisions based on the timely analysis of very large data sets. Commonly used deep learning algorithms for automatic classification of satellite images are time and resource-intensive to train. The cost of retraining in the context of Big Data presents a practical challenge when n… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: 24 pages, 9 figures, 5 tables

  20. arXiv:2111.14938  [pdf, other

    cs.LG cs.AI cs.CE econ.EM

    Distribution Shift in Airline Customer Behavior during COVID-19

    Authors: Abhinav Garg, Naman Shukla, Lavanya Marla, Sriram Somanchi

    Abstract: Traditional AI approaches in customized (personalized) contextual pricing applications assume that the data distribution at the time of online pricing is similar to that observed during training. However, this assumption may be violated in practice because of the dynamic nature of customer buying patterns, particularly due to unanticipated system shocks such as COVID-19. We study the changes in cu… ▽ More

    Submitted 23 December, 2021; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: 6 pages, 5 figures, NeurIPS 2021 Workshop on Distribution Shifts: connecting methods and applications (DistShift)

  21. arXiv:2111.01105  [pdf

    cs.CV cs.LG eess.IV

    FREGAN : an application of generative adversarial networks in enhancing the frame rate of videos

    Authors: Rishik Mishra, Neeraj Gupta, Nitya Shukla

    Abstract: A digital video is a collection of individual frames, while streaming the video the scene utilized the time slice for each frame. High refresh rate and high frame rate is the demand of all high technology applications. The action tracking in videos becomes easier and motion becomes smoother in gaming applications due to the high refresh rate. It provides a faster response because of less time in b… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    ACM Class: I.2.1

  22. arXiv:2110.13303  [pdf, other

    cs.LG cs.AI cs.CE econ.EM

    Negotiating Networks in Oligopoly Markets for Price-Sensitive Products

    Authors: Naman Shukla, Kartik Yellepeddi

    Abstract: We present a novel framework to learn functions that estimate decisions of sellers and buyers simultaneously in an oligopoly market for a price-sensitive product. In this setting, the aim of the seller network is to come up with a price for a given context such that the expected revenue is maximized by considering the buyer's satisfaction as well. On the other hand, the aim of the buyer network is… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 10 pages, 4 figures, NeurIPS 2021 Workshop on Learning in Presence of Strategic Behavior

  23. arXiv:2109.09897  [pdf

    cs.ET physics.app-ph

    An Oscillator-based MaxSAT solver

    Authors: Mohammad Khairul Bashar, Jaykumar Vaidya, Antik Mallick, R S Surya Kanthi, Shamiul Alam, Nazmul Amin, Chonghan Lee, Feng Shi, Ahmedullah Aziz, Vijaykrishnan Narayanan, Nikhil Shukla

    Abstract: The quest to solve hard combinatorial optimization problems efficiently -- still a longstanding challenge for traditional digital computers -- has inspired the exploration of many alternate computing models and platforms. As a case in point, oscillator networks offer a potentially promising energy efficient and scalable option. However, prior oscillator-based combinatorial optimization solvers hav… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  24. arXiv:2108.10499  [pdf

    cs.ET eess.SY

    Creating Electronic Oscillator-based Ising Machines without External Injection Locking

    Authors: Jaykumar Vaidya, R S Surya Kanthi, Nikhil Shukla

    Abstract: Coupled electronic oscillators have recently been explored as a compact, integrated circuit- and room temperature operation- compatible hardware platform to design Ising machines. However, such implementations presently require the injection of an externally generated second-harmonic signal to impose the phase bipartition among the oscillators. In this work, we experimentally demonstrate a new ele… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: 18 pages, 5 figures

  25. arXiv:2107.11350  [pdf, other

    cs.LG cs.AI

    Heteroscedastic Temporal Variational Autoencoder For Irregularly Sampled Time Series

    Authors: Satya Narayan Shukla, Benjamin M. Marlin

    Abstract: Irregularly sampled time series commonly occur in several domains where they present a significant challenge to standard deep learning models. In this paper, we propose a new deep learning framework for probabilistic interpolation of irregularly sampled time series that we call the Heteroscedastic Temporal Variational Autoencoder (HeTVAE). HeTVAE includes a novel input layer to encode information… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

  26. arXiv:2107.07116  [pdf, other

    cs.NE cs.AI cs.LG

    Transformer-based Machine Learning for Fast SAT Solvers and Logic Synthesis

    Authors: Feng Shi, Chonghan Lee, Mohammad Khairul Bashar, Nikhil Shukla, Song-Chun Zhu, Vijaykrishnan Narayanan

    Abstract: CNF-based SAT and MaxSAT solvers are central to logic synthesis and verification systems. The increasing popularity of these constraint problems in electronic design automation encourages studies on different SAT problems and their properties for further computational efficiency. There has been both theoretical and practical success of modern Conflict-driven clause learning SAT solvers, which allo… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

  27. arXiv:2101.10318  [pdf, other

    cs.LG cs.AI

    Multi-Time Attention Networks for Irregularly Sampled Time Series

    Authors: Satya Narayan Shukla, Benjamin M. Marlin

    Abstract: Irregular sampling occurs in many time series modeling applications where it presents a significant challenge to standard deep learning models. This work is motivated by the analysis of physiological time series data in electronic health records, which are sparse, irregularly sampled, and multivariate. In this paper, we propose a new deep learning framework for this setting that we call Multi-Time… ▽ More

    Submitted 7 June, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: Accepted at International Conference on Learning Representations (ICLR) 2021

  28. arXiv:2012.01355  [pdf

    cs.ET

    Using Noise to Augment Synchronization among Oscillators

    Authors: Jaykumar Vaidya, Mohammad Khairul Bashar, Nikhil Shukla

    Abstract: Noise is expected to play an important role in the dynamics of analog systems such as coupled oscillators which have recently been explored as a hardware platform for application in computing. In this work, we experimentally investigate the effect of noise on the synchronization of relaxation oscillators and their computational properties. Specifically, in contrast to its typically expected advers… ▽ More

    Submitted 10 January, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

  29. arXiv:2012.00168  [pdf, other

    cs.LG stat.ML

    A Survey on Principles, Models and Methods for Learning from Irregularly Sampled Time Series

    Authors: Satya Narayan Shukla, Benjamin M. Marlin

    Abstract: Irregularly sampled time series data arise naturally in many application domains including biology, ecology, climate science, astronomy, and health. Such data represent fundamental challenges to many classical models from machine learning and statistics due to the presence of non-uniform intervals between observations. However, there has been significant progress within the machine learning commun… ▽ More

    Submitted 5 January, 2021; v1 submitted 30 November, 2020; originally announced December 2020.

    Comments: Presented at NeurIPS 2020 Workshop: ML Retrospectives, Surveys & Meta-Analyses (ML-RSA)

  30. arXiv:2010.04205  [pdf, other

    cs.LG

    Gaussian MRF Covariance Modeling for Efficient Black-Box Adversarial Attacks

    Authors: Anit Kumar Sahu, Satya Narayan Shukla, J. Zico Kolter

    Abstract: We study the problem of generating adversarial examples in a black-box setting, where we only have access to a zeroth order oracle, providing us with loss function evaluations. Although this setting has been investigated in previous work, most past approaches using zeroth order optimization implicitly assume that the gradients of the loss function with respect to the input images are \emph{unstruc… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

  31. arXiv:2008.04305  [pdf

    physics.app-ph cs.ET

    Experimental Demonstration of a Reconfigurable Coupled Oscillator Platform to Solve the Max-Cut Problem

    Authors: Mohammad Khairul Bashar, Antik Mallick, Daniel S Truesdell, Benton H. Calhoun, Siddharth Joshi, Nikhil Shukla

    Abstract: In this work, we experimentally demonstrate an integrated circuit (IC) of 30 relaxation oscillators with reconfigurable capacitive coupling to solve the NP-Hard Maximum Cut (Max-Cut) problem. We show that under the influence of an external second-harmonic injection signal, the oscillator phases exhibit a bi-partition which can be used to calculate a high quality approximate Max-Cut solution. Lever… ▽ More

    Submitted 12 October, 2020; v1 submitted 10 August, 2020; originally announced August 2020.

  32. arXiv:2007.07210  [pdf, other

    cs.LG stat.ML

    Simple and Efficient Hard Label Black-box Adversarial Attacks in Low Query Budget Regimes

    Authors: Satya Narayan Shukla, Anit Kumar Sahu, Devin Willmott, J. Zico Kolter

    Abstract: We focus on the problem of black-box adversarial attacks, where the aim is to generate adversarial examples for deep learning models solely based on information limited to output label~(hard label) to a queried data input. We propose a simple and efficient Bayesian Optimization~(BO) based approach for developing black-box adversarial attacks. Issues with BO's performance in high dimensions are avo… ▽ More

    Submitted 11 June, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: Accepted at KDD 2021. arXiv admin note: substantial text overlap with arXiv:1909.13857

  33. arXiv:2003.11059  [pdf, other

    cs.LG cs.CY stat.ML

    Integrating Physiological Time Series and Clinical Notes with Deep Learning for Improved ICU Mortality Prediction

    Authors: Satya Narayan Shukla, Benjamin M. Marlin

    Abstract: Intensive Care Unit Electronic Health Records (ICU EHRs) store multimodal data about patients including clinical notes, sparse and irregularly sampled physiological time series, lab results, and more. To date, most methods designed to learn predictive models from ICU EHR data have focused on a single modality. In this paper, we leverage the recently proposed interpolation-prediction deep learning… ▽ More

    Submitted 18 March, 2021; v1 submitted 24 March, 2020; originally announced March 2020.

    Comments: Presented at ACM Conference on Health, Inference and Learning (Workshop Track), 2020

  34. arXiv:2002.02842  [pdf, other

    cs.LG stat.ML

    Assessing the Adversarial Robustness of Monte Carlo and Distillation Methods for Deep Bayesian Neural Network Classification

    Authors: Meet P. Vadera, Satya Narayan Shukla, Brian Jalaian, Benjamin M. Marlin

    Abstract: In this paper, we consider the problem of assessing the adversarial robustness of deep neural network models under both Markov chain Monte Carlo (MCMC) and Bayesian Dark Knowledge (BDK) inference approximations. We characterize the robustness of each method to two types of adversarial attacks: the fast gradient sign method (FGSM) and projected gradient descent (PGD). We show that full MCMC-based i… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: Presented at SafeAI Workshop, AAAI 2020

  35. arXiv:1909.13857  [pdf, other

    cs.LG stat.ML

    Black-box Adversarial Attacks with Bayesian Optimization

    Authors: Satya Narayan Shukla, Anit Kumar Sahu, Devin Willmott, J. Zico Kolter

    Abstract: We focus on the problem of black-box adversarial attacks, where the aim is to generate adversarial examples using information limited to loss function evaluations of input-output pairs. We use Bayesian optimization~(BO) to specifically cater to scenarios involving low query budgets to develop query efficient adversarial attacks. We alleviate the issues surrounding BO in regards to optimizing high… ▽ More

    Submitted 30 September, 2019; originally announced September 2019.

  36. arXiv:1909.10662  [pdf, other

    cs.LG

    How to Incorporate Monotonicity in Deep Networks While Preserving Flexibility?

    Authors: Akhil Gupta, Naman Shukla, Lavanya Marla, Arinbjörn Kolbeinsson, Kartik Yellepeddi

    Abstract: The importance of domain knowledge in enhancing model performance and making reliable predictions in the real-world is critical. This has led to an increased focus on specific model properties for interpretability. We focus on incorporating monotonic trends, and propose a novel gradient-based point-wise loss function for enforcing partial monotonicity with deep neural networks. While recent develo… ▽ More

    Submitted 2 December, 2019; v1 submitted 23 September, 2019; originally announced September 2019.

    Comments: 8 pages, 5 figures. NeurIPS 2019 Workshop on Machine Learning with Guarantees

  37. arXiv:1909.07782  [pdf, other

    cs.LG stat.ML

    Interpolation-Prediction Networks for Irregularly Sampled Time Series

    Authors: Satya Narayan Shukla, Benjamin M. Marlin

    Abstract: In this paper, we present a new deep learning architecture for addressing the problem of supervised learning with sparse and irregularly sampled multivariate time series. The architecture is based on the use of a semi-parametric interpolation network followed by the application of a prediction network. The interpolation network allows for information to be shared across multiple dimensions of a mu… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

    Comments: International Conference on Learning Representations. arXiv admin note: substantial text overlap with arXiv:1812.00531

  38. arXiv:1905.08874  [pdf, other

    cs.LG stat.ML

    Adaptive Model Selection Framework: An Application to Airline Pricing

    Authors: Naman Shukla, Arinbjörn Kolbeinsson, Lavanya Marla, Kartik Yellepeddi

    Abstract: Multiple machine learning and prediction models are often used for the same prediction or recommendation task. In our recent work, where we develop and deploy airline ancillary pricing models in an online setting, we found that among multiple pricing models developed, no one model clearly dominates other models for all incoming customer requests. Thus, as algorithm designers, we face an exploratio… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

  39. arXiv:1902.02236  [pdf, other

    stat.ML cs.CY cs.LG

    Dynamic Pricing for Airline Ancillaries with Customer Context

    Authors: Naman Shukla, Arinbjörn Kolbeinsson, Ken Otwell, Lavanya Marla, Kartik Yellepeddi

    Abstract: Ancillaries have become a major source of revenue and profitability in the travel industry. Yet, conventional pricing strategies are based on business rules that are poorly optimized and do not respond to changing market conditions. This paper describes the dynamic pricing model developed by Deepair solutions, an AI technology provider for travel suppliers. We present a pricing model that provides… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

  40. arXiv:1812.00531  [pdf, other

    cs.LG stat.ML

    Modeling Irregularly Sampled Clinical Time Series

    Authors: Satya Narayan Shukla, Benjamin M. Marlin

    Abstract: While the volume of electronic health records (EHR) data continues to grow, it remains rare for hospital systems to capture dense physiological data streams, even in the data-rich intensive care unit setting. Instead, typical EHR records consist of sparse and irregularly observed multivariate time series, which are well understood to present particularly challenging problems for machine learning m… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:cs/0101200

    Report number: ML4H/2018/111

  41. arXiv:1609.02079  [pdf, other

    cs.ET cond-mat.other math.DS

    Vertex coloring of graphs via phase dynamics of coupled oscillatory networks

    Authors: Abhinav Parihar, Nikhil Shukla, Matthew Jerry, Suman Datta, Arijit Raychowdhury

    Abstract: While Boolean logic has been the backbone of digital information processing, there are classes of computationally hard problems wherein this conventional paradigm is fundamentally inefficient. Vertex coloring of graphs, belonging to the class of combinatorial optimization represents such a problem; and is well studied for its wide spectrum of applications in data sciences, life sciences, social sc… ▽ More

    Submitted 16 March, 2017; v1 submitted 7 September, 2016; originally announced September 2016.

    Journal ref: Scientific Reports 7 (2017) 911

  42. arXiv:1608.05648  [pdf, other

    cs.ET cond-mat.mes-hall math.DS

    Computing with Dynamical Systems Based on Insulator-Metal-Transition Oscillators

    Authors: Abhinav Parihar, Nikhil Shukla, Matthew Jerry, Suman Datta, Arijit Raychowdhury

    Abstract: In this paper we review recent work on novel computing paradigms using coupled oscillatory dynamical systems. We explore systems of relaxation oscillators based on linear state transitioning devices, which switch between two discrete states with hysteresis. By harnessing the dynamics of complex, connected systems we embrace the philosophy of "let physics do the computing" and demonstrate how compl… ▽ More

    Submitted 19 August, 2016; originally announced August 2016.

    Comments: Submitted to Journal of Nanophotonics for review

  43. arXiv:1503.05085  [pdf, ps, other

    quant-ph cs.IT math-ph

    Stronger Error Disturbance Relations for Incompatible Quantum Measurements

    Authors: Chiranjib Mukhopadhyay, Namrata Shukla, Arun Kumar Pati

    Abstract: We formulate a new error-disturbance relation, which is free from explicit dependence upon variances in observables. This error-disturbance relation shows improvement over the one provided by the Branciard inequality and the Ozawa inequality for some initial states and for particular class of joint measurements under consideration. We also prove a modified form of Ozawa's error-disturbance relatio… ▽ More

    Submitted 13 December, 2016; v1 submitted 17 March, 2015; originally announced March 2015.

    Comments: 5+pages, 3 figures

    Journal ref: Europhysics Letters 113 50002 (2016)