Skip to main content

Showing 1–50 of 89 results for author: Agarwal, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00121  [pdf, other

    cs.LG cs.AI cs.CL

    Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks

    Authors: Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal, Sadhana Kumaravel, Matthew Stallone, Rameswar Panda, Yara Rizk, GP Bhargav, Maxwell Crouse, Chulaka Gunasekara, Shajith Ikbal, Sachin Joshi, Hima Karanam, Vineet Kumar, Asim Munawar, Sumit Neelam, Dinesh Raghu, Udit Sharma, Adriana Meza Soria, Dheeraj Sreedhar, Praveen Venkateswaran, Merve Unuvar, David Cox, Salim Roukos, Luis Lastras , et al. (1 additional authors not shown)

    Abstract: Large language models (LLMs) have recently shown tremendous promise in serving as the backbone to agentic systems, as demonstrated by their performance in multi-faceted, challenging benchmarks like SWE-Bench and Agent-Bench. However, to realize the true potential of LLMs as autonomous agents, they must learn to identify, call, and interact with external tools and application program interfaces (AP… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  2. arXiv:2406.17901  [pdf

    cs.CL

    Script-Agnostic Language Identification

    Authors: Milind Agarwal, Joshua Otten, Antonios Anastasopoulos

    Abstract: Language identification is used as the first step in many data collection and crawling efforts because it allows us to sort online text into language-specific buckets. However, many modern languages, such as Konkani, Kashmiri, Punjabi etc., are synchronically written in several scripts. Moreover, languages with different writing systems do not share significant lexical, semantic, and syntactic pro… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Under Review in ACL Rolling Review

  3. arXiv:2405.04324  [pdf, other

    cs.AI cs.CL cs.SE

    Granite Code Models: A Family of Open Foundation Models for Code Intelligence

    Authors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang, Yikang Shen, Aditya Prasad, Adriana Meza Soria, Michele Merler, Parameswaran Selvam, Saptha Surendran, Shivdeep Singh, Manish Sethi, Xuan-Hong Dang, Pengyuan Li, Kun-Lung Wu, Syed Zawad, Andrew Coleman, Matthew White, Mark Lewis, Raju Pavuluri, Yan Koyfman, Boris Lublinsky, Maximilien de Bayser, Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal , et al. (21 additional authors not shown)

    Abstract: Large Language Models (LLMs) trained on code are revolutionizing the software development process. Increasingly, code LLMs are being integrated into software development environments to improve the productivity of human programmers, and LLM-based agents are beginning to show promise for handling complex tasks autonomously. Realizing the full potential of code LLMs requires a wide range of capabili… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Corresponding Authors: Rameswar Panda, Ruchir Puri; Equal Contributors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang

  4. arXiv:2403.04224  [pdf, other

    cs.CL cs.AI cs.LG

    Aligners: Decoupling LLMs and Alignment

    Authors: Lilian Ngweta, Mayank Agarwal, Subha Maity, Alex Gittens, Yuekai Sun, Mikhail Yurochkin

    Abstract: Large Language Models (LLMs) need to be aligned with human expectations to ensure their safety and utility in most applications. Alignment is challenging, costly, and needs to be repeated for every LLM and alignment criterion. We propose to decouple LLMs and alignment by training aligner models that can be used to align any LLM for a given criteria on an as-needed basis, thus also reducing the pot… ▽ More

    Submitted 16 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Tiny Papers at the International Conference on Learning Representations (ICLR) 2024

  5. arXiv:2402.13301  [pdf, other

    cs.SD cs.AI eess.AS

    Structure-informed Positional Encoding for Music Generation

    Authors: Manvi Agarwal, Changhong Wang, Gaël Richard

    Abstract: Music generated by deep learning methods often suffers from a lack of coherence and long-term organization. Yet, multi-scale hierarchical structure is a distinctive feature of music signals. To leverage this information, we propose a structure-informed positional encoding framework for music generation with Transformers. We design three variants in terms of absolute, relative and non-stationary po… ▽ More

    Submitted 28 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2024, Seoul, South Korea

  6. arXiv:2401.10716  [pdf, other

    cs.CL

    Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models

    Authors: Mayank Agarwal, Yikang Shen, Bailin Wang, Yoon Kim, Jie Chen

    Abstract: Current language models tailored for code tasks often adopt the pre-training-then-fine-tuning paradigm from natural language processing, modeling source code as plain text. This approach, however, overlooks the unambiguous structures inherent in programming languages. In this work, we explore data-efficient adaptation of pre-trained code models by further pre-training and fine-tuning them with pro… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  7. arXiv:2311.07070  [pdf, other

    cs.CL

    Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations

    Authors: Zilu Tang, Mayank Agarwal, Alex Shypula, Bailin Wang, Derry Wijaya, Jie Chen, Yoon Kim

    Abstract: This work explores the use of self-generated natural language explanations as an intermediate step for code-to-code translation with language models. Across three types of explanations and 19 programming languages constructed from the MultiPL-E dataset, we find the explanations to be particularly effective in the zero-shot case, improving performance by 12% on average. Improvements with natural la… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: 9 pages, 4 figures, 5 tables, 48 pages total. To be published in EMNLP Findings 2023

  8. arXiv:2310.01583  [pdf, other

    stat.ML cs.LG

    An Investigation of Representation and Allocation Harms in Contrastive Learning

    Authors: Subha Maity, Mayank Agarwal, Mikhail Yurochkin, Yuekai Sun

    Abstract: The effect of underrepresentation on the performance of minority groups is known to be a serious problem in supervised learning settings; however, it has been underexplored so far in the context of self-supervised learning (SSL). In this paper, we demonstrate that contrastive learning (CL), a popular variant of SSL, tends to collapse representations of minority groups with certain majority groups.… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  9. arXiv:2305.14263  [pdf, other

    cs.CL cs.AI

    LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages

    Authors: Milind Agarwal, Md Mahfuz Ibn Alam, Antonios Anastasopoulos

    Abstract: Knowing the language of an input text/audio is a necessary first step for using almost every NLP tool such as taggers, parsers, or translation systems. Language identification is a well-studied problem, sometimes even considered solved; in reality, due to lack of data and computational challenges, current systems cannot accurately identify most of the world's 7000 languages. To tackle this bottlen… ▽ More

    Submitted 6 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: To appear at EMNLP 2023. 24 pages, 2 figures, 12 tables

  10. arXiv:2304.04539  [pdf, other

    cs.CL cs.AI cs.LG

    UATTA-EB: Uncertainty-Aware Test-Time Augmented Ensemble of BERTs for Classifying Common Mental Illnesses on Social Media Posts

    Authors: Pratinav Seth, Mihir Agarwal

    Abstract: Given the current state of the world, because of existing situations around the world, millions of people suffering from mental illnesses feel isolated and unable to receive help in person. Psychological studies have shown that our state of mind can manifest itself in the linguistic features we use to communicate. People have increasingly turned to online platforms to express themselves and seek h… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted at Tiny Papers @ ICLR 2023

  11. arXiv:2304.01322  [pdf

    cs.CL

    PALI: A Language Identification Benchmark for Perso-Arabic Scripts

    Authors: Sina Ahmadi, Milind Agarwal, Antonios Anastasopoulos

    Abstract: The Perso-Arabic scripts are a family of scripts that are widely adopted and used by various linguistic communities around the globe. Identifying various languages using such scripts is crucial to language technologies and challenging in low-resource setups. As such, this paper sheds light on the challenges of detecting languages using Perso-Arabic scripts, especially in bilingual communities wher… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 13 pages - accepted at VarDial at EACL 2023

  12. Queer In AI: A Case Study in Community-Led Participatory AI

    Authors: Organizers Of QueerInAI, :, Anaelia Ovalle, Arjun Subramonian, Ashwin Singh, Claas Voelcker, Danica J. Sutherland, Davide Locatelli, Eva Breznik, Filip Klubička, Hang Yuan, Hetvi J, Huan Zhang, Jaidev Shriram, Kruno Lehman, Luca Soldaini, Maarten Sap, Marc Peter Deisenroth, Maria Leonor Pacheco, Maria Ryskina, Martin Mundt, Milind Agarwal, Nyx McLean, Pan Xu, A Pranav , et al. (26 additional authors not shown)

    Abstract: We present Queer in AI as a case study for community-led participatory design in AI. We examine how participatory design and intersectional tenets started and shaped this community's programs over the years. We discuss different challenges that emerged in the process, look at ways this organization has fallen short of operationalizing participatory and intersectional principles, and then assess th… ▽ More

    Submitted 8 June, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: To appear at FAccT 2023

    Journal ref: 2023 ACM Conference on Fairness, Accountability, and Transparency

  13. arXiv:2303.00673  [pdf, other

    cs.HC cs.CY cs.LG

    Fairness Evaluation in Text Classification: Machine Learning Practitioner Perspectives of Individual and Group Fairness

    Authors: Zahra Ashktorab, Benjamin Hoover, Mayank Agarwal, Casey Dugan, Werner Geyer, Hao Bang Yang, Mikhail Yurochkin

    Abstract: Mitigating algorithmic bias is a critical task in the development and deployment of machine learning models. While several toolkits exist to aid machine learning practitioners in addressing fairness issues, little is known about the strategies practitioners employ to evaluate model fairness and what factors influence their assessment, particularly in the context of text classification. Two common… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: To appear in Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI '23)

  14. arXiv:2301.04650  [pdf, other

    cs.CV

    Geometry-biased Transformers for Novel View Synthesis

    Authors: Naveen Venkat, Mayank Agarwal, Maneesh Singh, Shubham Tulsiani

    Abstract: We tackle the task of synthesizing novel views of an object given a few input images and associated camera viewpoints. Our work is inspired by recent 'geometry-free' approaches where multi-view images are encoded as a (global) set-latent representation, which is then used to predict the color for arbitrary query rays. While this representation yields (coarsely) accurate images corresponding to nov… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: Project page: https://mayankgrwl97.github.io/gbt

  15. arXiv:2211.07675  [pdf, ps, other

    cs.LG cs.AI

    On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization

    Authors: Mudit Gaur, Vaneet Aggarwal, Mridul Agarwal

    Abstract: Deep Q-learning based algorithms have been applied successfully in many decision making problems, while their theoretical foundations are not as well understood. In this paper, we study a Fitted Q-Iteration with two-layer ReLU neural network parameterization, and find the sample complexity guarantees for the algorithm. Our approach estimates the Q-function in each iteration using a convex optimiza… ▽ More

    Submitted 30 January, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    ACM Class: F.2.1

  16. arXiv:2210.03692  [pdf, other

    cs.CV

    Compressing Video Calls using Synthetic Talking Heads

    Authors: Madhav Agarwal, Anchit Gupta, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C V Jawahar

    Abstract: We leverage the modern advancements in talking head generation to propose an end-to-end system for talking head video compression. Our algorithm transmits pivot frames intermittently while the rest of the talking head video is generated by animating them. We use a state-of-the-art face reenactment network to detect key points in the non-pivot frames and transmit them to the receiver. A dense flow… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: British Machine Vision Conference (BMVC), 2022

  17. arXiv:2210.02755  [pdf, other

    cs.CV

    Audio-Visual Face Reenactment

    Authors: Madhav Agarwal, Rudrabha Mukhopadhyay, Vinay Namboodiri, C V Jawahar

    Abstract: This work proposes a novel method to generate realistic talking head videos using audio and visual streams. We animate a source image by transferring head motion from a driving video using a dense motion field generated using learnable keypoints. We improve the quality of lip sync using audio as an additional input, helping the network to attend to the mouth region. We use additional priors using… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: Winter Conference on Applications of Computer Vision (WACV), 2023

  18. arXiv:2209.14250  [pdf

    cs.LG

    B2B Advertising: Joint Dynamic Scoring of Account and Users

    Authors: Atanu R. Sinha, Gautam Choudhary, Mansi Agarwal, Shivansh Bindal, Abhishek Pande, Camille Girabawe

    Abstract: When a business sells to another business (B2B), the buying business is represented by a group of individuals, termed account, who collectively decide whether to buy. The seller advertises to each individual and interacts with them, mostly by digital means. The sales cycle is long, most often over a few months. There is heterogeneity among individuals belonging to an account in seeking information… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: Published at KDD Workshop: AdKDD 2022

  19. arXiv:2205.08234  [pdf, other

    cs.LG cs.AI stat.ML

    Delaytron: Efficient Learning of Multiclass Classifiers with Delayed Bandit Feedbacks

    Authors: Naresh Manwani, Mudit Agarwal

    Abstract: In this paper, we present online algorithm called {\it Delaytron} for learning multi class classifiers using delayed bandit feedbacks. The sequence of feedback delays $\{d_t\}_{t=1}^T$ is unknown to the algorithm. At the $t$-th round, the algorithm observes an example $\mathbf{x}_t$ and predicts a label $\tilde{y}_t$ and receives the bandit feedback $\mathbb{I}[\tilde{y}_t=y_t]$ only $d_t$ rounds… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  20. arXiv:2203.10200  [pdf, other

    quant-ph cond-mat.dis-nn cs.LG

    Emulating Quantum Dynamics with Neural Networks via Knowledge Distillation

    Authors: Yu Yao, Chao Cao, Stephan Haas, Mahak Agarwal, Divyam Khanna, Marcin Abram

    Abstract: High-fidelity quantum dynamics emulators can be used to predict the time evolution of complex physical systems. Here, we introduce an efficient training framework for constructing machine learning-based emulators. Our approach is based on the idea of knowledge distillation and uses elements of curriculum learning. It works by constructing a set of simple, but rich-in-physics training examples (a c… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: 11 pages plus methods plus supplementary information

    ACM Class: I.2.6; I.4.m; I.5.4; I.6.0

  21. Better Together? An Evaluation of AI-Supported Code Translation

    Authors: Justin D. Weisz, Michael Muller, Steven I. Ross, Fernando Martinez, Stephanie Houde, Mayank Agarwal, Kartik Talamadupula, John T. Richards

    Abstract: Generative machine learning models have recently been applied to source code, for use cases including translating code between programming languages, creating documentation from code, and auto-completing methods. Yet, state-of-the-art models often produce code that is erroneous or incomplete. In a controlled study with 32 software engineers, we examined whether such imperfect outputs are helpful i… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: 35 pages, 3 figures. To be published in IUI 2022

  22. arXiv:2202.04903  [pdf, other

    cs.HC cs.AI cs.SE

    Investigating Explainability of Generative AI for Code through Scenario-based Design

    Authors: Jiao Sun, Q. Vera Liao, Michael Muller, Mayank Agarwal, Stephanie Houde, Kartik Talamadupula, Justin D. Weisz

    Abstract: What does it mean for a generative AI model to be explainable? The emergent discipline of explainable AI (XAI) has made great strides in helping people understand discriminative models. Less attention has been paid to generative models that produce artifacts, rather than decisions, as output. Meanwhile, generative AI (GenAI) technologies are maturing and being applied to application domains such a… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

  23. arXiv:2112.15360  [pdf, ps, other

    cs.AI

    Making AI 'Smart': Bridging AI and Cognitive Science

    Authors: Madhav Agarwal, Siddhant Bansal

    Abstract: The last two decades have seen tremendous advances in Artificial Intelligence. The exponential growth in terms of computation capabilities has given us hope of developing humans like robots. The question is: are we there yet? Maybe not. With the integration of cognitive science, the 'artificial' characteristic of Artificial Intelligence (AI) might soon be replaced with 'smart'. This will help deve… ▽ More

    Submitted 1 February, 2022; v1 submitted 31 December, 2021; originally announced December 2021.

  24. A Comprehensive Study on Various Statistical Techniques for Prediction of Movie Success

    Authors: Manav Agarwal, Shreya Venugopal, Rishab Kashyap, R Bharathi

    Abstract: The film industry is one of the most popular entertainment industries and one of the biggest markets for business. Among the contributing factors to this would be the success of a movie in terms of its popularity as well as its box office performance. Hence, we create a comprehensive comparison between the various machine learning models to predict the rate of success of a movie. The effectiveness… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: 14 pages, 12 figures Conference: 2nd International Conference on Machine Learning Techniques and Data Science (MLDS 2021)

  25. arXiv:2111.05546  [pdf, other

    cs.LG

    Biomarker Gene Identification for Breast Cancer Classification

    Authors: Sheetal Rajpal, Ankit Rajpal, Manoj Agarwal, Naveen Kumar

    Abstract: BACKGROUND: Breast cancer has emerged as one of the most prevalent cancers among women leading to a high mortality rate. Due to the heterogeneous nature of breast cancer, there is a need to identify differentially expressed genes associated with breast cancer subtypes for its timely diagnosis and treatment. OBJECTIVE: To identify a small gene set for each of the four breast cancer subtypes that co… ▽ More

    Submitted 29 November, 2021; v1 submitted 10 November, 2021; originally announced November 2021.

  26. arXiv:2111.03923  [pdf, other

    cs.LG q-bio.QM

    Deep Learning Based Model for Breast Cancer Subtype Classification

    Authors: Sheetal Rajpal, Virendra Kumar, Manoj Agarwal, Naveen Kumar

    Abstract: Breast cancer has long been a prominent cause of mortality among women. Diagnosis, therapy, and prognosis are now possible, thanks to the availability of RNA sequencing tools capable of recording gene expression data. Molecular subtyping being closely related to devising clinical strategy and prognosis, this paper focuses on the use of gene expression data for the classification of breast cancer i… ▽ More

    Submitted 9 November, 2021; v1 submitted 6 November, 2021; originally announced November 2021.

    Comments: Paper has been accepted for publication in ICACET 2021

  27. arXiv:2111.01072  [pdf, other

    cs.RO

    A generalized algorithm and framework for online 3-dimensional bin packing in an automated sorting center

    Authors: Ankush Ojha, Marichi Agarwal, Aniruddha Singhal, Chayan Sarkar, Supratim Ghosh, Rajesh Sinha

    Abstract: Online 3-dimensional bin packing problem (O3D-BPP) is getting renewed prominence due to the industrial automation brought by Industry 4.0. However, due to limited attention in the past and its challenging nature, a good approximate algorithm is in scarcity as compared to 1D or 2D problems. This paper considers real-time O$3$D-BPP of cuboidal boxes with partial information (look-ahead) in an automa… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted in The Seventh Indian Control Conference (ICC-7) 2021

  28. arXiv:2110.13953  [pdf, other

    cs.LG

    On sensitivity of meta-learning to support data

    Authors: Mayank Agarwal, Mikhail Yurochkin, Yuekai Sun

    Abstract: Meta-learning algorithms are widely used for few-shot learning. For example, image recognition systems that readily adapt to unseen classes after seeing only a few labeled examples. Despite their success, we show that modern meta-learning algorithms are extremely sensitive to the data used for adaptation, i.e. support data. In particular, we demonstrate the existence of (unaltered, in-distribution… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted at NeurIPS 2021

  29. arXiv:2110.08454  [pdf, other

    cs.CL

    Good Examples Make A Faster Learner: Simple Demonstration-based Learning for Low-resource NER

    Authors: Dong-Ho Lee, Akshen Kadakia, Kangmin Tan, Mahak Agarwal, Xinyu Feng, Takashi Shibuya, Ryosuke Mitani, Toshiyuki Sekiya, Jay Pujara, Xiang Ren

    Abstract: Recent advances in prompt-based learning have shown strong results on few-shot text classification by using cloze-style templates. Similar attempts have been made on named entity recognition (NER) which manually design templates to predict entity types for every text span in a sentence. However, such methods may suffer from error propagation induced by entity span detection, high cost due to enume… ▽ More

    Submitted 30 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Accepted to ACL 2022 main conference. 14 pages, 8 figures, 9 tables

  30. arXiv:2110.05423  [pdf, other

    cs.CL

    Using Document Similarity Methods to create Parallel Datasets for Code Translation

    Authors: Mayank Agarwal, Kartik Talamadupula, Fernando Martinez, Stephanie Houde, Michael Muller, John Richards, Steven I Ross, Justin D. Weisz

    Abstract: Translating source code from one programming language to another is a critical, time-consuming task in modernizing legacy applications and codebases. Recent work in this space has drawn inspiration from the software naturalness hypothesis by applying natural language processing techniques towards automating the code translation task. However, due to the paucity of parallel data in this domain, sup… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  31. arXiv:2110.04640  [pdf, other

    cs.IR

    Lookup or Exploratory: What is Your Search Intent?

    Authors: Manoj K. Agarwal, Tezan Sahu

    Abstract: Search query specificity is broadly divided into two categories - Exploratory or Lookup. If a query specificity can be identified at the run time, it can be used to significantly improve the search results as well as quality of suggestions to alter the query. However, with millions of queries coming every day on a commercial search engine, it is non-trivial to develop a horizontal technique to det… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: 12 pages

    ACM Class: H.3.3

  32. arXiv:2110.02311  [pdf, other

    cs.CL

    COVID-19 India Dataset: Parsing COVID-19 Data in Daily Health Bulletins from States in India

    Authors: Mayank Agarwal, Tathagata Chakraborti, Sachin Grover, Arunima Chaudhary

    Abstract: While India has been one of the hotspots of COVID-19, data about the pandemic from the country has proved to be largely inaccessible at scale. Much of the data exists in unstructured form on the web, and limited aspects of such data are available through public APIs maintained manually through volunteer effort. This has proved to be difficult both in terms of ease of access to detailed data and wi… ▽ More

    Submitted 6 December, 2021; v1 submitted 27 September, 2021; originally announced October 2021.

    Comments: URL: ibm.biz/covid-data-india. Accepted at the Machine Learning in Public Health workshop at NeurIPS 2021

  33. arXiv:2109.06332  [pdf, other

    cs.LG

    Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach

    Authors: Qinbo Bai, Amrit Singh Bedi, Mridul Agarwal, Alec Koppel, Vaneet Aggarwal

    Abstract: Reinforcement learning is widely used in applications where one needs to perform sequential decisions while interacting with the environment. The problem becomes more challenging when the decision requirement includes satisfying some safety constraints. The problem is mathematically formulated as constrained Markov decision process (CMDP). In the literature, various algorithms are available to sol… ▽ More

    Submitted 13 July, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: This paper is the arXiv version with Appendices of the published AAAI paper: "Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach," in Proc. AAAI, Feb 2022. The paper has been further extended with concave utilities and constraints in v2

    Journal ref: AAAI 2022

  34. arXiv:2109.05439  [pdf, other

    cs.LG cs.AI

    Concave Utility Reinforcement Learning with Zero-Constraint Violations

    Authors: Mridul Agarwal, Qinbo Bai, Vaneet Aggarwal

    Abstract: We consider the problem of tabular infinite horizon concave utility reinforcement learning (CURL) with convex constraints. For this, we propose a model-based learning algorithm that also achieves zero constraint violations. Assuming that the concave objective and the convex constraints have a solution interior to the set of feasible occupation measures, we solve a tighter optimization problem to e… ▽ More

    Submitted 16 November, 2023; v1 submitted 12 September, 2021; originally announced September 2021.

    Comments: Transactions on Machine Learning Research, Dec 2022

    Journal ref: Transactions on Machine Learning Research, Dec 2022

  35. arXiv:2109.04024  [pdf, ps, other

    cs.LG cs.AI cs.GT cs.MA

    On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)

    Authors: Washim Uddin Mondal, Mridul Agarwal, Vaneet Aggarwal, Satish V. Ukkusuri

    Abstract: Mean field control (MFC) is an effective way to mitigate the curse of dimensionality of cooperative multi-agent reinforcement learning (MARL) problems. This work considers a collection of $N_{\mathrm{pop}}$ heterogeneous agents that can be segregated into $K$ classes such that the $k$-th class contains $N_k$ homogeneous agents. We aim to prove approximation guarantees of the MARL problem for this… ▽ More

    Submitted 8 May, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: 46 pages

    Journal ref: Journal of Machine Learning Research 23(129): 1-46, 2022

  36. arXiv:2106.06680  [pdf, other

    cs.LG cs.AI eess.SY

    Markov Decision Processes with Long-Term Average Constraints

    Authors: Mridul Agarwal, Qinbo Bai, Vaneet Aggarwal

    Abstract: We consider the problem of constrained Markov Decision Process (CMDP) where an agent interacts with a unichain Markov Decision Process. At every interaction, the agent obtains a reward. Further, there are $K$ cost functions. The agent aims to maximize the long-term average reward while simultaneously keeping the $K$ long-term average costs lower than a certain threshold. In this paper, we propose… ▽ More

    Submitted 20 June, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

  37. arXiv:2105.14125  [pdf, other

    cs.LG cs.AI eess.SY

    Joint Optimization of Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm

    Authors: Qinbo Bai, Mridul Agarwal, Vaneet Aggarwal

    Abstract: Many engineering problems have multiple objectives, and the overall aim is to optimize a non-linear function of these objectives. In this paper, we formulate the problem of maximizing a non-linear concave function of multiple long-term objectives. A policy-gradient based model-free algorithm is proposed for the problem. To compute an estimate of the gradient, a biased estimator is proposed. The pr… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  38. Perfection Not Required? Human-AI Partnerships in Code Translation

    Authors: Justin D. Weisz, Michael Muller, Stephanie Houde, John Richards, Steven I. Ross, Fernando Martinez, Mayank Agarwal, Kartik Talamadupula

    Abstract: Generative models have become adept at producing artifacts such as images, videos, and prose at human-like levels of proficiency. New generative techniques, such as unsupervised neural machine translation (NMT), have recently been applied to the task of generating source code, translating it from one programming language to another. The artifacts produced in this way may contain imperfections, suc… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: 18 pages, 1 figure. To be published in IUI 2021

  39. arXiv:2103.02523  [pdf, other

    cs.CL

    NeurIPS 2020 NLC2CMD Competition: Translating Natural Language to Bash Commands

    Authors: Mayank Agarwal, Tathagata Chakraborti, Quchen Fu, David Gros, Xi Victoria Lin, Jaron Maene, Kartik Talamadupula, Zhongwei Teng, Jules White

    Abstract: The NLC2CMD Competition hosted at NeurIPS 2020 aimed to bring the power of natural language processing to the command line. Participants were tasked with building models that can transform descriptions of command line tasks in English to their Bash syntax. This is a report on the competition with details of the task, metrics, data, attempted solutions, and lessons learned.

    Submitted 8 August, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: Appears in PMLR Volume 133: NeurIPS 2020 Competition and Demonstration Track. Competition URL: http://ibm.biz/nlc2cmd

  40. arXiv:2102.10740  [pdf, other

    cs.LG cs.AI cs.MA

    Communication Efficient Parallel Reinforcement Learning

    Authors: Mridul Agarwal, Bhargav Ganguly, Vaneet Aggarwal

    Abstract: We consider the problem where $M$ agents interact with $M$ identical and independent environments with $S$ states and $A$ actions using reinforcement learning for $T$ rounds. The agents share their data with a central server to minimize their regret. We aim to find an algorithm that allows the agents to minimize the regret with infrequent communication rounds. We provide \NAM\ which runs at each a… ▽ More

    Submitted 21 February, 2021; originally announced February 2021.

  41. arXiv:2102.08462  [pdf, ps, other

    cs.LG cs.AI cs.MA

    Multi-Agent Multi-Armed Bandits with Limited Communication

    Authors: Mridul Agarwal, Vaneet Aggarwal, Kamyar Azizzadenesheli

    Abstract: We consider the problem where $N$ agents collaboratively interact with an instance of a stochastic $K$ arm bandit problem for $K \gg N$. The agents aim to simultaneously minimize the cumulative regret over all the agents for a total of $T$ time steps, the number of communication rounds, and the number of bits in each communication round. We present Limited Communication Collaboration - Upper Confi… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

  42. arXiv:2102.01761  [pdf

    physics.optics cs.AI

    Deep Convolutional Neural Networks to Predict Mutual Coupling Effects in Metasurfaces

    Authors: Sensong An, Bowen Zheng, Mikhail Y. Shalaginov, Hong Tang, Hang Li, Li Zhou, Yunxi Dong, Mohammad Haerinia, Anuradha Murthy Agarwal, Clara Rivero-Baleine, Myungkoo Kang, Kathleen A. Richardson, Tian Gu, Juejun Hu, Clayton Fowler, Hualiang Zhang

    Abstract: Metasurfaces have provided a novel and promising platform for the realization of compact and large-scale optical devices. The conventional metasurface design approach assumes periodic boundary conditions for each element, which is inaccurate in most cases since the near-field coupling effects between elements will change when surrounded by non-identical structures. In this paper, we propose a deep… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: 16 pages, 10 figures

  43. arXiv:2101.01508  [pdf

    cs.DL physics.comp-ph physics.data-an

    Looking Through Glass: Knowledge Discovery from Materials Science Literature using Natural Language Processing

    Authors: Vineeth Venugopal, Sourav Sahoo, Mohd Zaki, Manish Agarwal, Nitya Nand Gosvami, N. M. Anoop Krishnan

    Abstract: Most of the knowledge in materials science literature is in the form of unstructured data such as text and images. Here, we present a framework employing natural language processing, which automates text and image comprehension and precision knowledge extraction from inorganic glasses' literature. The abstracts are automatically categorized using latent Dirichlet allocation (LDA), providing a way… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

    Comments: 17 pages, 5 figures

  44. arXiv:2012.07581  [pdf, other

    cs.SE cs.PL

    Quality Estimation & Interpretability for Code Translation

    Authors: Mayank Agarwal, Kartik Talamadupula, Stephanie Houde, Fernando Martinez, Michael Muller, John Richards, Steven Ross, Justin D. Weisz

    Abstract: Recently, the automated translation of source code from one programming language to another by using automatic approaches inspired by Neural Machine Translation (NMT) methods for natural languages has come under study. However, such approaches suffer from the same problem as previous NMT approaches on natural languages, viz. the lack of an ability to estimate and evaluate the quality of the transl… ▽ More

    Submitted 26 April, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: NeurIPS 2020 Workshop on Computer-Assisted Programming

  45. arXiv:2011.07715  [pdf, ps, other

    cs.LG

    Blind Decision Making: Reinforcement Learning with Delayed Observations

    Authors: Mridul Agarwal, Vaneet Aggarwal

    Abstract: Reinforcement learning typically assumes that the state update from the previous actions happens instantaneously, and thus can be used for making future decisions. However, this may not always be true. When the state update is not available, the decision taken is partly in the blind since it cannot rely on the current state information. This paper proposes an approach, where the delay in the knowl… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Journal ref: in part in ICAPS 2021

  46. arXiv:2011.07687  [pdf, ps, other

    cs.LG stat.ML

    DART: aDaptive Accept RejecT for non-linear top-K subset identification

    Authors: Mridul Agarwal, Vaneet Aggarwal, Christopher J. Quinn, Abhishek Umrawal

    Abstract: We consider the bandit problem of selecting $K$ out of $N$ arms at each time step. The reward can be a non-linear function of the rewards of the selected individual arms. The direct use of a multi-armed bandit algorithm requires choosing among $\binom{N}{K}$ options, making the action space large. To simplify the problem, existing works on combinatorial bandits {typically} assume feedback as a lin… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Journal ref: AAAI 2021

  47. arXiv:2010.12574  [pdf, other

    cs.LG stat.ML

    Online Semi-Supervised Learning with Bandit Feedback

    Authors: Sohini Upadhyay, Mikhail Yurochkin, Mayank Agarwal, Yasaman Khazaeni, DjallelBouneffouf

    Abstract: We formulate a new problem at the intersectionof semi-supervised learning and contextual bandits,motivated by several applications including clini-cal trials and ad recommendations. We demonstratehow Graph Convolutional Network (GCN), a semi-supervised learning approach, can be adjusted tothe new problem formulation. We also propose avariant of the linear contextual bandit with semi-supervised mis… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

  48. arXiv:2008.10831  [pdf, other

    cs.CV

    CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

    Authors: Madhav Agarwal, Ajoy Mondal, C. V. Jawahar

    Abstract: Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a novel end-to-end trainable deep network, (CDeC-Net) for detecting tables present in the documents. The proposed network consists of a multistage extension of Mask R-CNN with a dual backbone having deformable convolution for detecting tables vary… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: 12

  49. arXiv:2007.10987  [pdf, other

    cs.LG cs.CR cs.DC

    IBM Federated Learning: an Enterprise Framework White Paper V0.1

    Authors: Heiko Ludwig, Nathalie Baracaldo, Gegi Thomas, Yi Zhou, Ali Anwar, Shashank Rajamoni, Yuya Ong, Jayaram Radhakrishnan, Ashish Verma, Mathieu Sinn, Mark Purcell, Ambrish Rawat, Tran Minh, Naoise Holohan, Supriyo Chakraborty, Shalisha Whitherspoon, Dean Steuer, Laura Wynter, Hifaz Hassan, Sean Laguna, Mikhail Yurochkin, Mayank Agarwal, Ebube Chuba, Annie Abay

    Abstract: Federated Learning (FL) is an approach to conduct machine learning without centralizing training data in a single place, for reasons of privacy, confidentiality or data volume. However, solving federated machine learning problems raises issues above and beyond those of centralized machine learning. These issues include setting up communication infrastructure between parties, coordinating the learn… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: 17 pages

    ACM Class: I.2.6; I.2.11

  50. arXiv:2007.08637  [pdf, other

    eess.IV cs.CV cs.LG

    COV-ELM classifier: An Extreme Learning Machine based identification of COVID-19 using Chest X-Ray Images

    Authors: Sheetal Rajpal, Manoj Agarwal, Ankit Rajpal, Navin Lakhyani, Arpita Saggar, Naveen Kumar

    Abstract: Coronaviruses constitute a family of viruses that gives rise to respiratory diseases. As COVID-19 is highly contagious, early diagnosis of COVID-19 is crucial for an effective treatment strategy. However, the RT-PCR test which is considered to be a gold standard in the diagnosis of COVID-19 suffers from a high false-negative rate. Chest X-ray (CXR) image analysis has emerged as a feasible and effe… ▽ More

    Submitted 28 September, 2021; v1 submitted 16 July, 2020; originally announced July 2020.