Zum Hauptinhalt springen

Showing 1–50 of 71 results for author: Vig, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14504  [pdf, other

    cs.CL

    Translating Across Cultures: LLMs for Intralingual Cultural Adaptation

    Authors: Pushpdeep Singh, Mayur Patidar, Lovekesh Vig

    Abstract: LLMs are increasingly being deployed for multilingual applications and have demonstrated impressive translation capabilities between several low and high resource languages. An aspect of translation that often gets overlooked is that of cultural adaptation, or modifying source culture references to suit the target culture. Cultural adaptation has applications across several creative industries and… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2405.12842  [pdf, other

    cs.RO cs.CV

    SmartFlow: Robotic Process Automation using LLMs

    Authors: Arushi Jain, Shubham Paliwal, Monika Sharma, Lovekesh Vig, Gautam Shroff

    Abstract: Robotic Process Automation (RPA) systems face challenges in handling complex processes and diverse screen layouts that require advanced human-like decision-making capabilities. These systems typically rely on pixel-level encoding through drag-and-drop or automation frameworks such as Selenium to create navigation workflows, rather than visual understanding of screen elements. In this context, we p… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 32nd ACM International Conference on Information and Knowledge Management

  3. arXiv:2405.12742  [pdf, other

    cs.CV

    Multi-Subject Personalization

    Authors: Arushi Jain, Shubham Paliwal, Monika Sharma, Vikram Jamwal, Lovekesh Vig

    Abstract: Creative story illustration requires a consistent interplay of multiple characters or objects. However, conventional text-to-image models face significant challenges while producing images featuring multiple personalized subjects. For example, they distort the subject rendering, or the text descriptions fail to render coherent subject interactions. We present Multi-Subject Personalization (MSP) to… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 2023 Conference on Neural Information Processing Systems

  4. arXiv:2405.12531  [pdf, other

    cs.CV cs.LG

    CustomText: Customized Textual Image Generation using Diffusion Models

    Authors: Shubham Paliwal, Arushi Jain, Monika Sharma, Vikram Jamwal, Lovekesh Vig

    Abstract: Textual image generation spans diverse fields like advertising, education, product packaging, social media, information visualization, and branding. Despite recent strides in language-guided image synthesis using diffusion models, current models excel in image generation but struggle with accurate text rendering and offer limited control over font attributes. In this paper, we aim to enhance the s… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted by AI for Content Creation (AI4CC) workshop at CVPR 2024

  5. arXiv:2403.04382  [pdf, other

    cs.CL cs.AI

    Acceleron: A Tool to Accelerate Research Ideation

    Authors: Harshit Nigam, Manasi Patwardhan, Lovekesh Vig, Gautam Shroff

    Abstract: Several tools have recently been proposed for assisting researchers during various stages of the research life-cycle. However, these primarily concentrate on tasks such as retrieving and recommending relevant literature, reviewing and critiquing the draft, and writing of research manuscripts. Our investigation reveals a significant gap in availability of tools specifically designed to assist resea… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted at AI2ASE Workshop at AAAI'24 Conference. 13 Pages and 4 Figures

  6. arXiv:2311.13885  [pdf, other

    cs.LG cs.AI math.AP

    Can Physics Informed Neural Operators Self Improve?

    Authors: Ritam Majumdar, Amey Varhade, Shirish Karande, Lovekesh Vig

    Abstract: Self-training techniques have shown remarkable value across many deep learning models and tasks. However, such techniques remain largely unexplored when considered in the context of learning fast solvers for systems of partial differential equations (Eg: Neural Operators). In this work, we explore the use of self-training for Fourier Neural Operators (FNO). Neural Operators emerged as a data drive… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Paper accepted as a Spotlight talk at Symbiosis of Deep Learning and Differential Equations, Neural Information Processing Systems 2023

  7. arXiv:2308.09293  [pdf, other

    cs.LG cs.AI cs.CE math.AP

    How important are specialized transforms in Neural Operators?

    Authors: Ritam Majumdar, Shirish Karande, Lovekesh Vig

    Abstract: Simulating physical systems using Partial Differential Equations (PDEs) has become an indispensible part of modern industrial process optimization. Traditionally, numerical solvers have been used to solve the associated PDEs, however recently Transform-based Neural Operators such as the Fourier Neural Operator and Wavelet Neural Operator have received a lot of attention for their potential to prov… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 8 pages, 3 figures, 4 tables

  8. arXiv:2308.09290  [pdf, other

    cs.LG cs.AI cs.CE math.AP

    HyperLoRA for PDEs

    Authors: Ritam Majumdar, Vishal Jadhav, Anirudh Deodhar, Shirish Karande, Lovekesh Vig, Venkataramana Runkana

    Abstract: Physics-informed neural networks (PINNs) have been widely used to develop neural surrogates for solutions of Partial Differential Equations. A drawback of PINNs is that they have to be retrained with every change in initial-boundary conditions and PDE coefficients. The Hypernetwork, a model-based meta learning technique, takes in a parameterized task embedding as input and predicts the weights of… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 8 pages, 4 figures, 3 Tables

  9. arXiv:2308.02582  [pdf, other

    cs.CL cs.AI cs.LG

    Adapt and Decompose: Efficient Generalization of Text-to-SQL via Domain Adapted Least-To-Most Prompting

    Authors: Aseem Arora, Shabbirhussain Bhaisaheb, Harshit Nigam, Manasi Patwardhan, Lovekesh Vig, Gautam Shroff

    Abstract: Cross-domain and cross-compositional generalization of Text-to-SQL semantic parsing is a challenging task. Existing Large Language Model (LLM) based solutions rely on inference-time retrieval of few-shot exemplars from the training set to synthesize a run-time prompt for each Natural Language (NL) test query. In contrast, we devise an algorithm which performs offline sampling of a minimal set-of f… ▽ More

    Submitted 9 August, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: 22 Pages

  10. arXiv:2304.13350  [pdf, other

    cs.AI cs.SE

    Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate Representation

    Authors: Krishnam Hasija, Shrishti Pradhan, Manasi Patwardhan, Raveendra Kumar Medicherla, Lovekesh Vig, Ravindra Naik

    Abstract: In this paper, we define a neuro-symbolic approach to address the task of finding semantically similar clones for the codes of the legacy programming language COBOL, without training data. We define a meta-model that is instantiated to have an Intermediate Representation (IR) in the form of Abstract Syntax Trees (ASTs) common across codes in C and COBOL. We linearize the IRs using Structure Based… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 10 pages, 4 tables, 2 figures

  11. arXiv:2303.14194  [pdf, other

    cs.LG cs.AI q-bio.QM

    DeepEpiSolver: Unravelling Inverse problems in Covid, HIV, Ebola and Disease Transmission

    Authors: Ritam Majumdar, Shirish Karande, Lovekesh Vig

    Abstract: The spread of many infectious diseases is modeled using variants of the SIR compartmental model, which is a coupled differential equation. The coefficients of the SIR model determine the spread trajectories of disease, on whose basis proactive measures can be taken. Hence, the coefficient estimates must be both fast and accurate. Shaier et al. in the paper "Disease Informed Neural Networks" used P… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: Publication accepted at International Conference for Learning Representations 2023: First Workshop in Machine Learning and Global Health

  12. arXiv:2303.09062  [pdf, other

    cs.SE cs.AI cs.LG

    Knowledge Transfer for Pseudo-code Generation from Low Resource Programming Language

    Authors: Ankita Sontakke, Kanika Kalra, Manasi Patwardhan, Lovekesh Vig, Raveendra Kumar Medicherla, Ravindra Naik, Shrishti Pradhan

    Abstract: Generation of pseudo-code descriptions of legacy source code for software maintenance is a manually intensive task. Recent encoder-decoder language models have shown promise for automating pseudo-code generation for high resource programming languages such as C++, but are heavily reliant on the availability of a large code-pseudocode corpus. Soliciting such pseudocode annotations for codes written… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 11 pages, 1 figure, 5 tables

    ACM Class: I.2.2; I.2.7

  13. arXiv:2303.07009  [pdf, other

    cs.LG cs.AI math.AP

    Symbolic Regression for PDEs using Pruned Differentiable Programs

    Authors: Ritam Majumdar, Vishal Jadhav, Anirudh Deodhar, Shirish Karande, Lovekesh Vig, Venkataramana Runkana

    Abstract: Physics-informed Neural Networks (PINNs) have been widely used to obtain accurate neural surrogates for a system of Partial Differential Equations (PDE). One of the major limitations of PINNs is that the neural solutions are challenging to interpret, and are often treated as black-box solvers. While Symbolic Regression (SR) has been studied extensively, very few works exist which generate analytic… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: Publication accepted at International Conference for Learning Representations 2023: Physics for Machine Learning

  14. IKD+: Reliable Low Complexity Deep Models For Retinopathy Classification

    Authors: Shreyas Bhat Brahmavar, Rohit Rajesh, Tirtharaj Dash, Lovekesh Vig, Tanmay Tulsidas Verlekar, Md Mahmudul Hasan, Tariq Khan, Erik Meijering, Ashwin Srinivasan

    Abstract: Deep neural network (DNN) models for retinopathy have estimated predictive accuracies in the mid-to-high 90%. However, the following aspects remain unaddressed: State-of-the-art models are complex and require substantial computational infrastructure to train and deploy; The reliability of predictions can vary widely. In this paper, we focus on these aspects and propose a form of iterative knowledg… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: Submitted to IEEE International Conference on Image Processing (ICIP 2023)

    Journal ref: IEEE International Conference on Image Processing (ICIP 2023)

  15. Domain-Specific Pre-training Improves Confidence in Whole Slide Image Classification

    Authors: Soham Rohit Chitnis, Sidong Liu, Tirtharaj Dash, Tanmay Tulsidas Verlekar, Antonio Di Ieva, Shlomo Berkovsky, Lovekesh Vig, Ashwin Srinivasan

    Abstract: Whole Slide Images (WSIs) or histopathology images are used in digital pathology. WSIs pose great challenges to deep learning models for clinical diagnosis, owing to their size and lack of pixel-level annotations. With the recent advancements in computational pathology, newer multiple-instance learning-based models have been proposed. Multiple-instance learning for WSIs necessitates creating patch… ▽ More

    Submitted 3 May, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Accepted in EMBC 2023

    Journal ref: Annu Int Conf IEEE Eng Med Biol Soc (EMBC 2023)

  16. arXiv:2302.08996  [pdf, other

    cs.AI cs.LG cs.LO

    Neuro-symbolic Meta Reinforcement Learning for Trading

    Authors: S I Harini, Gautam Shroff, Ashwin Srinivasan, Prayushi Faldu, Lovekesh Vig

    Abstract: We model short-duration (e.g. day) trading in financial markets as a sequential decision-making problem under uncertainty, with the added complication of continual concept-drift. We, therefore, employ meta reinforcement learning via the RL2 algorithm. It is also known that human traders often rely on frequently occurring symbolic patterns in price series. We employ logical program induction to dis… ▽ More

    Submitted 15 January, 2023; originally announced February 2023.

    Comments: To appear in Muffin@AAAI'23

  17. arXiv:2212.10189  [pdf, other

    cs.CL cs.AI

    Do I have the Knowledge to Answer? Investigating Answerability of Knowledge Base Questions

    Authors: Mayur Patidar, Prayushi Faldu, Avinash Singh, Lovekesh Vig, Indrajit Bhattacharya, Mausam

    Abstract: When answering natural language questions over knowledge bases, missing facts, incomplete schema and limited scope naturally lead to many questions being unanswerable. While answerability has been explored in other QA settings, it has not been studied for QA over knowledge bases (KBQA). We create GrailQAbility, a new benchmark KBQA dataset with unanswerability, by first identifying various forms o… ▽ More

    Submitted 24 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  18. arXiv:2212.10032  [pdf, other

    cs.LG cs.AI cs.CE

    Real-time Health Monitoring of Heat Exchangers using Hypernetworks and PINNs

    Authors: Ritam Majumdar, Vishal Jadhav, Anirudh Deodhar, Shirish Karande, Lovekesh Vig, Venkataramana Runkana

    Abstract: We demonstrate a Physics-informed Neural Network (PINN) based model for real-time health monitoring of a heat exchanger, that plays a critical role in improving energy efficiency of thermal power plants. A hypernetwork based approach is used to enable the domain-decomposed PINN learn the thermal behavior of the heat exchanger in response to dynamic boundary conditions, eliminating the need to re-t… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: Neural Information Processing Systems 2022: The Machine Learning and the Physical Sciences workshop

  19. arXiv:2212.10005  [pdf, other

    cs.LG cs.CV

    Calibrating Deep Neural Networks using Explicit Regularisation and Dynamic Data Pruning

    Authors: Ramya Hebbalaguppe, Rishabh Patra, Tirtharaj Dash, Gautam Shroff, Lovekesh Vig

    Abstract: Deep neural networks (DNN) are prone to miscalibrated predictions, often exhibiting a mismatch between the predicted output and the associated confidence scores. Contemporary model calibration techniques mitigate the problem of overconfident predictions by pushing down the confidence of the winning class while increasing the confidence of the remaining classes across all test samples. However, fro… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: The paper is accepted at Winter Conference on applications of Computer Vision (IEEE WACV) in algorithms tracks. 8 pages Main paper; 3 pages supplementary material

  20. arXiv:2211.16047  [pdf, other

    cs.AI cs.LG cs.LO

    Neural Feature-Adaptation for Symbolic Predictions Using Pre-Training and Semantic Loss

    Authors: Vedant Shah, Aditya Agrawal, Lovekesh Vig, Ashwin Srinivasan, Gautam Shroff, Tanmay Verlekar

    Abstract: We are interested in neurosymbolic systems consisting of a high-level symbolic layer for explainable prediction in terms of human-intelligible concepts; and a low-level neural layer for extracting symbols required to generate the symbolic explanation. Real data is often imperfect meaning that even if the symbolic theory remains unchanged, we may still need to address the problem of mapping raw dat… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  21. arXiv:2209.08750  [pdf, other

    cs.AI cs.LG

    Knowledge-based Analogical Reasoning in Neuro-symbolic Latent Spaces

    Authors: Vishwa Shah, Aditya Sharma, Gautam Shroff, Lovekesh Vig, Tirtharaj Dash, Ashwin Srinivasan

    Abstract: Analogical Reasoning problems challenge both connectionist and symbolic AI systems as these entail a combination of background knowledge, reasoning and pattern recognition. While symbolic systems ingest explicit domain knowledge and perform deductive reasoning, they are sensitive to noise and require inputs be mapped to preset symbolic features. Connectionist systems on the other hand can directly… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 13 pages, 4 figures, Accepted at 16th International Workshop on Neural-Symbolic Learning and Reasoning as part of the 2nd International Joint Conference on Learning & Reasoning (IJCLR 2022)

  22. arXiv:2207.06240  [pdf, ps, other

    cs.LG cs.AI math.NA

    Physics Informed Symbolic Networks

    Authors: Ritam Majumdar, Vishal Jadhav, Anirudh Deodhar, Shirish Karande, Lovekesh Vig, Venkataramana Runkana

    Abstract: We introduce Physics Informed Symbolic Networks (PISN) which utilize physics-informed loss to obtain a symbolic solution for a system of Partial Differential Equations (PDE). Given a context-free grammar to describe the language of symbolic expressions, we propose to use weighted sum as continuous approximation for selection of a production rule. We use this approximation to define multilayer symb… ▽ More

    Submitted 20 December, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: Neural Information Processing Systems 2022: The Symbiosis of Deep Learning and Differential Equations Workshop

  23. arXiv:2203.16074  [pdf, other

    cs.CV

    An Efficient Anchor-free Universal Lesion Detection in CT-scans

    Authors: Manu Sheoran, Meghal Dani, Monika Sharma, Lovekesh Vig

    Abstract: Existing universal lesion detection (ULD) methods utilize compute-intensive anchor-based architectures which rely on predefined anchor boxes, resulting in unsatisfactory detection performance, especially in small and mid-sized lesions. Further, these default fixed anchor-sizes and ratios do not generalize well to different datasets. Therefore, we propose a robust one-stage anchor-free lesion detec… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: 4 Pages, 2 figures, 2 tables. Paper accepted at IEEE International Symposium on Biomedical Imaging (ISBI'22)

    Journal ref: IEEE International Symposium on Biomedical Imaging (ISBI) 2022

  24. arXiv:2203.06886  [pdf, other

    cs.CV

    DKMA-ULD: Domain Knowledge augmented Multi-head Attention based Robust Universal Lesion Detection

    Authors: Manu Sheoran, Meghal Dani, Monika Sharma, Lovekesh Vig

    Abstract: Incorporating data-specific domain knowledge in deep networks explicitly can provide important cues beneficial for lesion detection and can mitigate the need for diverse heterogeneous datasets for learning robust detectors. In this paper, we exploit the domain information present in computed tomography (CT) scans and propose a robust universal lesion detection (ULD) network that can detect lesions… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: Main Paper: 13 Pages, 5 Figures, 2 Tables. Supplementary: 4 Pages, 1 Figure, 3 Tables. Paper accepted at The 32nd British Machine Vision Conference (BMVC'21)

    Journal ref: British Machine Vision Conference (BMVC) 2021

  25. arXiv:2203.06873  [pdf, other

    cs.CV

    TSR-DSAW: Table Structure Recognition via Deep Spatial Association of Words

    Authors: Arushi Jain, Shubham Paliwal, Monika Sharma, Lovekesh Vig

    Abstract: Existing methods for Table Structure Recognition (TSR) from camera-captured or scanned documents perform poorly on complex tables consisting of nested rows / columns, multi-line texts and missing cell data. This is because current data-driven methods work by simply training deep models on large volumes of data and fail to generalize when an unseen table structure is encountered. In this paper, we… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 6 pages, 1 figure, 1 table, ESANN 2021 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. Online event, 6-8 October 2021, i6doc.com publ., ISBN 978287587082-7

    Journal ref: In ESANN 2021 proceedings, pages 257-262

  26. arXiv:2203.06852  [pdf, other

    cs.LG

    Continual Learning for Multivariate Time Series Tasks with Variable Input Dimensions

    Authors: Vibhor Gupta, Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

    Abstract: We consider a sequence of related multivariate time series learning tasks, such as predicting failures for different instances of a machine from time series of multi-sensor data, or activity recognition tasks over different individuals from multiple wearable sensors. We focus on two under-explored practical challenges arising in such settings: (i) Each task may have a different subset of sensors,… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted at ICDM 2021

  27. arXiv:2202.12578  [pdf, other

    cs.LG

    Learning to Liquidate Forex: Optimal Stopping via Adaptive Top-K Regression

    Authors: Diksha Garg, Pankaj Malhotra, Anil Bhatia, Sanjay Bhat, Lovekesh Vig, Gautam Shroff

    Abstract: We consider learning a trading agent acting on behalf of the treasury of a firm earning revenue in a foreign currency (FC) and incurring expenses in the home currency (HC). The goal of the agent is to maximize the expected HC at the end of the trading episode by deciding to hold or sell the FC at each time step in the trading episode. We pose this as an optimization problem, and consider a broad s… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: Published at Workshop on AI in Financial Services: Adaptiveness, Resilience & Governance, AAAI-22

  28. arXiv:2202.05517  [pdf, other

    cs.LG

    Electricity Consumption Forecasting for Out-of-distribution Time-of-Use Tariffs

    Authors: Jyoti Narwariya, Chetan Verma, Pankaj Malhotra, Lovekesh Vig, Easwara Subramanian, Sanjay Bhat

    Abstract: In electricity markets, retailers or brokers want to maximize profits by allocating tariff profiles to end consumers. One of the objectives of such demand response management is to incentivize the consumers to adjust their consumption so that the overall electricity procurement in the wholesale markets is minimized, e.g. it is desirable that consumers consume less during peak hours when cost of pr… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: Accepted paper at AAAI workshop AIBSD 2022

  29. arXiv:2201.08137  [pdf, other

    cs.LG cs.AI

    DRTCI: Learning Disentangled Representations for Temporal Causal Inference

    Authors: Garima Gupta, Lovekesh Vig, Gautam Shroff

    Abstract: Medical professionals evaluating alternative treatment plans for a patient often encounter time varying confounders, or covariates that affect both the future treatment assignment and the patient outcome. The recently proposed Counterfactual Recurrent Network (CRN) accounts for time varying confounders by using adversarial training to balance recurrent historical representations of patient data. H… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: Accepted in Workshop on "The Neglected Assumptions in Causal Inference" at ICML 2021 (July)

  30. arXiv:2111.10361  [pdf, other

    cs.LG cs.AI

    Solving Visual Analogies Using Neural Algorithmic Reasoning

    Authors: Atharv Sonwane, Gautam Shroff, Lovekesh Vig, Ashwin Srinivasan, Tirtharaj Dash

    Abstract: We consider a class of visual analogical reasoning problems that involve discovering the sequence of transformations by which pairs of input/output images are related, so as to analogously transform future inputs. This program synthesis task can be easily solved via symbolic search. Using a variation of the `neural analogical reasoning' approach of (Velickovic and Blundell 2021), we instead search… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: 20 pages. Contains extended abstract accepted at the AAAI-22 Student Abstract and Poster Program along with relevent supplementary material

  31. arXiv:2111.00506  [pdf, other

    cs.AI

    PnPOOD : Out-Of-Distribution Detection for Text Classification via Plug andPlay Data Augmentation

    Authors: Mrinal Rawat, Ramya Hebbalaguppe, Lovekesh Vig

    Abstract: While Out-of-distribution (OOD) detection has been well explored in computer vision, there have been relatively few prior attempts in OOD detection for NLP classification. In this paper we argue that these prior attempts do not fully address the OOD problem and may suffer from data leakage and poor calibration of the resulting models. We present PnPOOD, a data augmentation technique to perform OOD… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

    Report number: Accepted in Uncertainty in Deep Learning, ICML'21

  32. arXiv:2110.09947  [pdf, other

    cs.LG cs.AI cs.PL

    Using Program Synthesis and Inductive Logic Programming to solve Bongard Problems

    Authors: Atharv Sonwane, Sharad Chitlangia, Tirtharaj Dash, Lovekesh Vig, Gautam Shroff, Ashwin Srinivasan

    Abstract: The ability to recognise and make analogies is often used as a measure or test of human intelligence. The ability to solve Bongard problems is an example of such a test. It has also been postulated that the ability to rapidly construct novel abstractions is critical to being able to solve analogical problems. Given an image, the ability to construct a program that would generate that image is one… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: Equal contribution from first two authors. Accepted at the 10th International Workshop on Approaches and Applications of Inductive Programming as a Work In Progress Report

  33. arXiv:2109.03849  [pdf, other

    cs.CV

    OSSR-PID: One-Shot Symbol Recognition in P&ID Sheets using Path Sampling and GCN

    Authors: Shubham Paliwal, Monika Sharma, Lovekesh Vig

    Abstract: Piping and Instrumentation Diagrams (P&ID) are ubiquitous in several manufacturing, oil and gas enterprises for representing engineering schematics and equipment layout. There is an urgent need to extract and digitize information from P&IDs without the cost of annotating a varying set of symbols for each new use case. A robust one-shot learning approach for symbol recognition i.e., localization fo… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Journal ref: International Joint Conference on Neural Network (IJCNN), 2021

  34. Digitize-PID: Automatic Digitization of Piping and Instrumentation Diagrams

    Authors: Shubham Paliwal, Arushi Jain, Monika Sharma, Lovekesh Vig

    Abstract: Digitization of scanned Piping and Instrumentation diagrams(P&ID), widely used in manufacturing or mechanical industries such as oil and gas over several decades, has become a critical bottleneck in dynamic inventory management and creation of smart P&IDs that are compatible with the latest CAD tools. Historically, P&ID sheets have been manually generated at the design stage, before being scanned… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: 13 pages

    Journal ref: Trends and Applications in Knowledge Discovery and Data Mining. 168-180, PAKDD 2021

  35. arXiv:2012.11403  [pdf, other

    cs.LG

    CAMTA: Causal Attention Model for Multi-touch Attribution

    Authors: Sachin Kumar, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

    Abstract: Advertising channels have evolved from conventional print media, billboards and radio advertising to online digital advertising (ad), where the users are exposed to a sequence of ad campaigns via social networks, display ads, search etc. While advertisers revisit the design of ad campaigns to concurrently serve the requirements emerging out of new ad channels, it is also critical for advertisers t… ▽ More

    Submitted 16 February, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: Accepted in ICDMW 2020

  36. arXiv:2012.10787  [pdf, other

    eess.IV cs.CV cs.LG

    Constructing and Evaluating an Explainable Model for COVID-19 Diagnosis from Chest X-rays

    Authors: Rishab Khincha, Soundarya Krishnan, Tirtharaj Dash, Lovekesh Vig, Ashwin Srinivasan

    Abstract: In this paper, our focus is on constructing models to assist a clinician in the diagnosis of COVID-19 patients in situations where it is easier and cheaper to obtain X-ray data than to obtain high-quality images like those from CT scans. Deep neural networks have repeatedly been shown to be capable of constructing highly predictive models for disease detection directly from image data. However, th… ▽ More

    Submitted 12 February, 2021; v1 submitted 19 December, 2020; originally announced December 2020.

  37. arXiv:2012.08984  [pdf, other

    cs.LG cs.IR

    Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation

    Authors: Diksha Garg, Priyanka Gupta, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

    Abstract: Most of the existing deep reinforcement learning (RL) approaches for session-based recommendations either rely on costly online interactions with real users, or rely on potentially biased rule-based or data-driven user-behavior models for learning. In this work, we instead focus on learning recommendation policies in the pure batch or offline setting, i.e. learning policies solely from offline his… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: Presented at Offline Reinforcement Learning Workshop at Neural Information Processing Systems, 2020

  38. Incorporating Symbolic Domain Knowledge into Graph Neural Networks

    Authors: Tirtharaj Dash, Ashwin Srinivasan, Lovekesh Vig

    Abstract: Our interest is in scientific problems with the following characteristics: (1) Data are naturally represented as graphs; (2) The amount of data available is typically small; and (3) There is significant domain-knowledge, usually expressed in some symbolic form. These kinds of problems have been addressed effectively in the past by Inductive Logic Programming (ILP), by virtue of 2 important charact… ▽ More

    Submitted 19 February, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted in Machine Learning Journal (MLJ)

    Journal ref: Mach Learn 110, 1609-1636 (2021)

  39. arXiv:2008.09858  [pdf, other

    stat.ME cs.AI cs.LG

    Hi-CI: Deep Causal Inference in High Dimensions

    Authors: Ankit Sharma, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

    Abstract: We address the problem of counterfactual regression using causal inference (CI) in observational studies consisting of high dimensional covariates and high cardinality treatments. Confounding bias, which leads to inaccurate treatment effect estimation, is attributed to covariates that affect both treatments and outcome. The presence of high-dimensional co-variates exacerbates the impact of bias as… ▽ More

    Submitted 9 April, 2021; v1 submitted 22 August, 2020; originally announced August 2020.

    Comments: 23 pages, 5 figures, Accepted in Causal Discovery Workshop - KDD 2020

  40. arXiv:2007.00411  [pdf, other

    cs.LG eess.SP stat.ML

    Handling Variable-Dimensional Time Series with Graph Neural Networks

    Authors: Vibhor Gupta, Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

    Abstract: Several applications of Internet of Things (IoT) technology involve capturing data from multiple sensors resulting in multi-sensor time series. Existing neural networks based approaches for such multi-sensor or multivariate time series modeling assume fixed input dimension or number of sensors. Such approaches can struggle in the practical setting where different instances of the same device or eq… ▽ More

    Submitted 20 July, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: Accepted at AI4IoT@IJCAI'20 workshop

  41. arXiv:2006.16556  [pdf, other

    cs.LG stat.ML

    Graph Neural Networks for Leveraging Industrial Equipment Structure: An application to Remaining Useful Life Estimation

    Authors: Jyoti Narwariya, Pankaj Malhotra, Vishnu TV, Lovekesh Vig, Gautam Shroff

    Abstract: Automated equipment health monitoring from streaming multisensor time-series data can be used to enable condition-based maintenance, avoid sudden catastrophic failures, and ensure high operational availability. We note that most complex machinery has a well-documented and readily accessible underlying structure capturing the inter-dependencies between sub-systems or modules. Deep learning models s… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

    Comments: Accepted at AAAI workshop DLGMA'20

  42. arXiv:2004.13446  [pdf, ps, other

    stat.ME cs.LG cs.MA

    MultiMBNN: Matched and Balanced Causal Inference with Neural Networks

    Authors: Ankit Sharma, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

    Abstract: Causal inference (CI) in observational studies has received a lot of attention in healthcare, education, ad attribution, policy evaluation, etc. Confounding is a typical hazard, where the context affects both, the treatment assignment and response. In a multiple treatment scenario, we propose the neural network based MultiMBNN, where we overcome confounding by employing generalized propensity scor… ▽ More

    Submitted 14 August, 2021; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: 7 pages, 3 figures, Accepted in ESANN 2020

  43. arXiv:2001.01469  [pdf, other

    cs.CV cs.LG eess.IV

    TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images

    Authors: Shubham Paliwal, Vishwanath D, Rohit Rahul, Monika Sharma, Lovekesh Vig

    Abstract: With the widespread use of mobile phones and scanners to photograph and upload documents, the need for extracting the information trapped in unstructured document images such as retail receipts, insurance claim forms and financial invoices is becoming more acute. A major hurdle to this objective is that these images often contain information in the form of tables and extracting data from tabular s… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.

  44. arXiv:1912.03960  [pdf, ps, other

    cs.LG cs.MA stat.ML

    MetaCI: Meta-Learning for Causal Inference in a Heterogeneous Population

    Authors: Ankit Sharma, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

    Abstract: Performing inference on data obtained through observational studies is becoming extremely relevant due to the widespread availability of data in fields such as healthcare, education, retail, etc. Furthermore, this data is accrued from multiple homogeneous subgroups of a heterogeneous population, and hence, generalizing the inference mechanism over such data is essential. We propose the MetaCI fram… ▽ More

    Submitted 17 February, 2021; v1 submitted 9 December, 2019; originally announced December 2019.

    Comments: 10 pages, 4 figures, Accepted in CausalML Workshop - NeurIPS 2019

  45. ChartNet: Visual Reasoning over Statistical Charts using MAC-Networks

    Authors: Monika Sharma, Shikha Gupta, Arindam Chowdhury, Lovekesh Vig

    Abstract: Despite the improvements in perception accuracies brought about via deep learning, developing systems combining accurate visual perception with the ability to reason over the visual percepts remains extremely challenging. A particular application area of interest from an accessibility perspective is that of reasoning over statistical charts such as bar and pie charts. To this end, we formulate the… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Journal ref: International Joint Conference on Neural Networks (IJCNN) 2019

  46. arXiv:1911.05870  [pdf, other

    cs.CV

    Character Keypoint-based Homography Estimation in Scanned Documents for Efficient Information Extraction

    Authors: Kushagra Mahajan, Monika Sharma, Lovekesh Vig

    Abstract: Precise homography estimation between multiple images is a pre-requisite for many computer vision applications. One application that is particularly relevant in today's digital era is the alignment of scanned or camera-captured document images such as insurance claim forms for information extraction. Traditional learning based approaches perform poorly due to the absence of an appropriate gradient… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

    Comments: 6 pages, 4 figures

  47. arXiv:1909.07155  [pdf, other

    cs.LG stat.ML

    Meta-Learning for Few-Shot Time Series Classification

    Authors: Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff, Vishnu Tv

    Abstract: Deep neural networks (DNNs) have achieved state-of-the-art results on time series classification (TSC) tasks. In this work, we focus on leveraging DNNs in the often-encountered practical scenario where access to labeled training data is difficult, and where DNNs would be prone to overfitting. We leverage recent advancements in gradient-based meta-learning, and propose an approach to train a residu… ▽ More

    Submitted 4 March, 2021; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: CoDS COMAD 2020: Proceedings of the 7th ACM IKDD CoDS and 25th COMAD

  48. arXiv:1909.04276  [pdf, other

    cs.IR cs.LG

    NISER: Normalized Item and Session Representations to Handle Popularity Bias

    Authors: Priyanka Gupta, Diksha Garg, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

    Abstract: The goal of session-based recommendation (SR) models is to utilize the information from past actions (e.g. item/product clicks) in a session to recommend items that a user is likely to click next. Recently it has been shown that the sequence of item interactions in a session can be modeled as graph-structured data to better account for complex item transitions. Graph neural networks (GNNs) can lea… ▽ More

    Submitted 4 March, 2021; v1 submitted 10 September, 2019; originally announced September 2019.

    Comments: Presented at 1st International Workshop on Graph Representation Learning and its Applications, CIKM 2019

  49. arXiv:1907.06901  [pdf, other

    cs.LG stat.ML

    Meta-Learning for Black-box Optimization

    Authors: Vishnu TV, Pankaj Malhotra, Jyoti Narwariya, Lovekesh Vig, Gautam Shroff

    Abstract: Recently, neural networks trained as optimizers under the "learning to learn" or meta-learning framework have been shown to be effective for a broad range of optimization tasks including derivative-free black-box function optimization. Recurrent neural networks (RNNs) trained to optimize a diverse set of synthetic non-convex differentiable functions via gradient descent have been effective at opti… ▽ More

    Submitted 2 October, 2019; v1 submitted 16 July, 2019; originally announced July 2019.

    Comments: Accepted at ECML-PKDD 2019 Research Track

  50. arXiv:1906.02427  [pdf, other

    cs.AI cs.LG cs.LO

    One-shot Information Extraction from Document Images using Neuro-Deductive Program Synthesis

    Authors: Vishal Sunder, Ashwin Srinivasan, Lovekesh Vig, Gautam Shroff, Rohit Rahul

    Abstract: Our interest in this paper is in meeting a rapidly growing industrial demand for information extraction from images of documents such as invoices, bills, receipts etc. In practice users are able to provide a very small number of example images labeled with the information that needs to be extracted. We adopt a novel two-level neuro-deductive, approach where (a) we use pre-trained deep neural netwo… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

    Comments: 11 pages, appears in the 13th International Workshop on Neural-Symbolic Learning and Reasoning at IJCAI 2019