Zum Hauptinhalt springen

Showing 1–30 of 30 results for author: Sodhani, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.15372  [pdf, other

    cs.CL cs.AI

    EpiK-Eval: Evaluation for Language Models as Epistemic Models

    Authors: Gabriele Prato, Jerry Huang, Prasannna Parthasarathi, Shagun Sodhani, Sarath Chandar

    Abstract: In the age of artificial intelligence, the role of large language models (LLMs) is becoming increasingly central. Despite their growing prevalence, their capacity to consolidate knowledge from different training documents - a crucial ability in numerous applications - remains unexplored. This paper presents the first study examining the capability of LLMs to effectively combine such information wi… ▽ More

    Submitted 22 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  2. arXiv:2310.00166  [pdf, other

    cs.AI cs.LG

    Motif: Intrinsic Motivation from Artificial Intelligence Feedback

    Authors: Martin Klissarov, Pierluca D'Oro, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff

    Abstract: Exploring rich environments and evaluating one's actions without prior knowledge is immensely challenging. In this paper, we propose Motif, a general method to interface such prior knowledge from a Large Language Model (LLM) with an agent. Motif is based on the idea of grounding LLMs for decision-making without requiring them to interact with the environment: it elicits preferences from an LLM ove… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: The first two authors equally contributed - order decided by coin flip

  3. arXiv:2306.00577  [pdf, other

    cs.LG cs.AI

    TorchRL: A data-driven decision-making library for PyTorch

    Authors: Albert Bou, Matteo Bettini, Sebastian Dittert, Vikash Kumar, Shagun Sodhani, Xiaomeng Yang, Gianni De Fabritiis, Vincent Moens

    Abstract: PyTorch has ascended as a premier machine learning framework, yet it lacks a native and comprehensive library for decision and control tasks suitable for large development teams dealing with complex real-world data and environments. To address this issue, we propose TorchRL, a generalistic control library for PyTorch that provides well-integrated, yet standalone components. We introduce a new and… ▽ More

    Submitted 27 November, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  4. arXiv:2305.14550  [pdf, other

    cs.LG cs.AI

    When should we prefer Decision Transformers for Offline Reinforcement Learning?

    Authors: Prajjwal Bhargava, Rohan Chitnis, Alborz Geramifard, Shagun Sodhani, Amy Zhang

    Abstract: Offline reinforcement learning (RL) allows agents to learn effective, return-maximizing policies from a static dataset. Three popular algorithms for offline RL are Conservative Q-Learning (CQL), Behavior Cloning (BC), and Decision Transformer (DT), from the class of Q-Learning, Imitation Learning, and Sequence Modeling respectively. A key open question is: which algorithm is preferred under what c… ▽ More

    Submitted 11 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: International Conference on Learning Representations (ICLR) 2024

  5. arXiv:2210.00030  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training

    Authors: Yecheng Jason Ma, Shagun Sodhani, Dinesh Jayaraman, Osbert Bastani, Vikash Kumar, Amy Zhang

    Abstract: Reward and representation learning are two long-standing challenges for learning an expanding set of robot manipulation skills from sensory observations. Given the inherent cost and scarcity of in-domain, task-specific robot data, learning from large, diverse, offline human videos has emerged as a promising path towards acquiring a generally useful visual representation for control; however, how t… ▽ More

    Submitted 6 March, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

    Comments: ICLR 2023, Notable-Top-25% (Spotlight). Project website: https://sites.google.com/view/vip-rl

  6. arXiv:2207.10430  [pdf, other

    cs.LG cs.AI

    The Neural Race Reduction: Dynamics of Abstraction in Gated Networks

    Authors: Andrew M. Saxe, Shagun Sodhani, Sam Lewallen

    Abstract: Our theoretical understanding of deep learning has not kept pace with its empirical success. While network architecture is known to be critical, we do not yet understand its effect on learned representations and network behavior, or how this architecture should reflect task structure.In this work, we begin to address this gap by introducing the Gated Deep Linear Network framework that schematizes… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: ICML 2022; 23 pages; 10 figures

  7. arXiv:2207.04354  [pdf, other

    cs.LG cs.AI

    An Introduction to Lifelong Supervised Learning

    Authors: Shagun Sodhani, Mojtaba Faramarzi, Sanket Vaibhav Mehta, Pranshu Malviya, Mohamed Abdelsalam, Janarthanan Janarthanan, Sarath Chandar

    Abstract: This primer is an attempt to provide a detailed summary of the different facets of lifelong learning. We start with Chapter 2 which provides a high-level overview of lifelong learning systems. In this chapter, we discuss prominent scenarios in lifelong learning (Section 2.4), provide 8 Introduction a high-level organization of different lifelong learning approaches (Section 2.5), enumerate the des… ▽ More

    Submitted 12 July, 2022; v1 submitted 9 July, 2022; originally announced July 2022.

    Comments: Lifelong Learning Primer

  8. arXiv:2202.07013  [pdf, other

    cs.LG cs.AI cs.RO

    Robust Policy Learning over Multiple Uncertainty Sets

    Authors: Annie Xie, Shagun Sodhani, Chelsea Finn, Joelle Pineau, Amy Zhang

    Abstract: Reinforcement learning (RL) agents need to be robust to variations in safety-critical environments. While system identification methods provide a way to infer the variation from online experience, they can fail in settings where fast identification is not possible. Another dominant approach is robust RL which produces a policy that can handle worst-case scenarios, but these methods are generally d… ▽ More

    Submitted 4 March, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Project webpage at https://sites.google.com/view/sirsa-public/home

  9. arXiv:2110.06972  [pdf, other

    cs.LG cs.AI

    Block Contextual MDPs for Continual Learning

    Authors: Shagun Sodhani, Franziska Meier, Joelle Pineau, Amy Zhang

    Abstract: In reinforcement learning (RL), when defining a Markov Decision Process (MDP), the environment dynamics is implicitly assumed to be stationary. This assumption of stationarity, while simplifying, can be unrealistic in many scenarios. In the continual reinforcement learning scenario, the sequence of tasks is another source of nonstationarity. In this work, we propose to examine this continual reinf… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: 26pages, Under Review

  10. arXiv:2102.06177  [pdf, other

    cs.LG cs.AI cs.RO

    Multi-Task Reinforcement Learning with Context-based Representations

    Authors: Shagun Sodhani, Amy Zhang, Joelle Pineau

    Abstract: The benefit of multi-task learning over single-task learning relies on the ability to use relations across tasks to improve performance on any single task. While sharing representations is an important mechanism to share information across tasks, its success depends on how well the structure underlying the tasks is captured. In some real-world situations, we have access to metadata, or additional… ▽ More

    Submitted 10 June, 2021; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: Accepted at the 38th International Conference on Machine Learning (ICML 2021). 17 pages, 4 figures, 20 tables

  11. arXiv:2012.12477  [pdf, other

    cs.CV cs.AI cs.LG

    IIRC: Incremental Implicitly-Refined Classification

    Authors: Mohamed Abdelsalam, Mojtaba Faramarzi, Shagun Sodhani, Sarath Chandar

    Abstract: We introduce the "Incremental Implicitly-Refined Classi-fication (IIRC)" setup, an extension to the class incremental learning setup where the incoming batches of classes have two granularity levels. i.e., each sample could have a high-level (coarse) label like "bear" and a low-level (fine) label like "polar bear". Only one label is provided at a time, and the model has to figure out the other lab… ▽ More

    Submitted 11 January, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

  12. arXiv:2010.02838  [pdf, other

    cs.LG cs.DC math.OC

    A Closer Look at Codistillation for Distributed Training

    Authors: Shagun Sodhani, Olivier Delalleau, Mahmoud Assran, Koustuv Sinha, Nicolas Ballas, Michael Rabbat

    Abstract: Codistillation has been proposed as a mechanism to share knowledge among concurrently trained models by encouraging them to represent the same function through an auxiliary loss. This contrasts with the more commonly used fully-synchronous data-parallel stochastic gradient descent methods, where different model replicas average their gradients (or parameters) at every iteration and thus maintain i… ▽ More

    Submitted 25 July, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: Under review

  13. arXiv:2007.10546  [pdf, ps, other

    cs.CY cs.AI cs.LG

    Ideas for Improving the Field of Machine Learning: Summarizing Discussion from the NeurIPS 2019 Retrospectives Workshop

    Authors: Shagun Sodhani, Mayoore S. Jaiswal, Lauren Baker, Koustuv Sinha, Carl Shneider, Peter Henderson, Joel Lehman, Ryan Lowe

    Abstract: This report documents ideas for improving the field of machine learning, which arose from discussions at the ML Retrospectives workshop at NeurIPS 2019. The goal of the report is to disseminate these ideas more broadly, and in turn encourage continuing discussion about how the field could improve along these axes. We focus on topics that were most discussed at the workshop: incentives for encourag… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  14. arXiv:2007.07206  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Robust State Abstractions for Hidden-Parameter Block MDPs

    Authors: Amy Zhang, Shagun Sodhani, Khimya Khetarpal, Joelle Pineau

    Abstract: Many control tasks exhibit similar dynamics that can be modeled as having common latent structure. Hidden-Parameter Markov Decision Processes (HiP-MDPs) explicitly model this structure to improve sample efficiency in multi-task settings. However, this setting makes strong assumptions on the observability of the state that limit its application in real-world scenarios with rich observation spaces.… ▽ More

    Submitted 11 February, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: Accepted at the 9th International Conference on Learning Representations. 22 pages, 14 figures

  15. arXiv:2004.07213  [pdf, ps, other

    cs.CY

    Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims

    Authors: Miles Brundage, Shahar Avin, Jasmine Wang, Haydn Belfield, Gretchen Krueger, Gillian Hadfield, Heidy Khlaaf, Jingying Yang, Helen Toner, Ruth Fong, Tegan Maharaj, Pang Wei Koh, Sara Hooker, Jade Leung, Andrew Trask, Emma Bluemke, Jonathan Lebensold, Cullen O'Keefe, Mark Koren, Théo Ryffel, JB Rubinovitz, Tamay Besiroglu, Federica Carugati, Jack Clark, Peter Eckersley , et al. (34 additional authors not shown)

    Abstract: With the recent wave of progress in artificial intelligence (AI) has come a growing awareness of the large-scale impacts of AI systems, and recognition that existing regulations and norms in industry and academia are insufficient to ensure responsible AI development. In order for AI developers to earn trust from system users, customers, civil society, governments, and other stakeholders that they… ▽ More

    Submitted 20 April, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

  16. arXiv:2003.06560  [pdf, other

    cs.LG stat.ML

    Evaluating Logical Generalization in Graph Neural Networks

    Authors: Koustuv Sinha, Shagun Sodhani, Joelle Pineau, William L. Hamilton

    Abstract: Recent research has highlighted the role of relational inductive biases in building learning agents that can generalize and reason in a compositional manner. However, while relational learning algorithms such as graph neural networks (GNNs) show promise, we do not understand how effectively these approaches can adapt to new tasks. In this work, we study the task of logical generalization using GNN… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

  17. arXiv:2003.06016  [pdf, other

    cs.LG cs.AI stat.ML

    Invariant Causal Prediction for Block MDPs

    Authors: Amy Zhang, Clare Lyle, Shagun Sodhani, Angelos Filos, Marta Kwiatkowska, Joelle Pineau, Yarin Gal, Doina Precup

    Abstract: Generalization across environments is critical to the successful application of reinforcement learning algorithms to real-world challenges. In this paper, we consider the problem of learning abstractions that generalize in block MDPs, families of environments with a shared latent state space and dynamics structure over that latent space, but varying observations. We leverage tools from causal infe… ▽ More

    Submitted 11 June, 2020; v1 submitted 12 March, 2020; originally announced March 2020.

    Comments: Accepted to ICML 2020. 16 pages, 8 figures

  18. arXiv:1909.10893  [pdf, other

    cs.LG cs.AI stat.ML

    Recurrent Independent Mechanisms

    Authors: Anirudh Goyal, Alex Lamb, Jordan Hoffmann, Shagun Sodhani, Sergey Levine, Yoshua Bengio, Bernhard Schölkopf

    Abstract: Learning modular structures which reflect the dynamics of the environment can lead to better generalization and robustness to changes which only affect a few of the underlying causes. We propose Recurrent Independent Mechanisms (RIMs), a new recurrent architecture in which multiple groups of recurrent cells operate with nearly independent transition dynamics, communicate only sparingly through the… ▽ More

    Submitted 17 November, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

  19. arXiv:1908.06177  [pdf, other

    cs.LG cs.CL cs.LO stat.ML

    CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text

    Authors: Koustuv Sinha, Shagun Sodhani, Jin Dong, Joelle Pineau, William L. Hamilton

    Abstract: The recent success of natural language understanding (NLU) systems has been troubled by results highlighting the failure of these models to generalize in a systematic and robust way. In this work, we introduce a diagnostic benchmark suite, named CLUTRR, to clarify some key issues related to the robustness and systematicity of NLU systems. Motivated by classic work on inductive logic programming, C… ▽ More

    Submitted 3 September, 2019; v1 submitted 16 August, 2019; originally announced August 2019.

    Comments: Accepted at EMNLP 2019, 9 page content + Appendix

  20. arXiv:1906.10667  [pdf, other

    cs.LG cs.AI stat.ML

    Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives

    Authors: Anirudh Goyal, Shagun Sodhani, Jonathan Binas, Xue Bin Peng, Sergey Levine, Yoshua Bengio

    Abstract: Reinforcement learning agents that operate in diverse and complex environments can benefit from the structured decomposition of their behavior. Often, this is addressed in the context of hierarchical reinforcement learning, where the aim is to decompose a policy into lower-level primitives or options, and a higher-level meta-policy that triggers the appropriate behaviors for a given situation. How… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

    Comments: Preprint, Under Review

  21. arXiv:1906.04355  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Powerful Policies by Using Consistent Dynamics Model

    Authors: Shagun Sodhani, Anirudh Goyal, Tristan Deleu, Yoshua Bengio, Sergey Levine, Jian Tang

    Abstract: Model-based Reinforcement Learning approaches have the promise of being sample efficient. Much of the progress in learning dynamics models in RL has been made by learning models via supervised learning. But traditional model-based approaches lead to `compounding errors' when the model is unrolled step by step. Essentially, the state transitions that the learner predicts (by unrolling the model for… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: Accpted at RLDM 2019

  22. arXiv:1811.10732  [pdf, ps, other

    cs.AI cs.LG

    Environments for Lifelong Reinforcement Learning

    Authors: Khimya Khetarpal, Shagun Sodhani, Sarath Chandar, Doina Precup

    Abstract: To achieve general artificial intelligence, reinforcement learning (RL) agents should learn not only to optimize returns for one specific task but also to constantly build more complex skills and scaffold their knowledge about the world, without forgetting what has already been learned. In this paper, we discuss the desired characteristics of environments that can support the training and evaluati… ▽ More

    Submitted 6 December, 2018; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: Accepted at 2nd Continual Learning Workshop, Neural Information Processing Systems (NeurIPS) 2018

  23. arXiv:1811.07017  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Training Recurrent Neural Networks for Lifelong Learning

    Authors: Shagun Sodhani, Sarath Chandar, Yoshua Bengio

    Abstract: Catastrophic forgetting and capacity saturation are the central challenges of any parametric lifelong learning system. In this work, we study these challenges in the context of sequential supervised learning with an emphasis on recurrent neural networks. To evaluate the models in the lifelong learning setting, we propose a curriculum-based, simple, and intuitive benchmark where the models are trai… ▽ More

    Submitted 9 September, 2019; v1 submitted 16 November, 2018; originally announced November 2018.

    Comments: Accepted at Neural Computation

  24. arXiv:1811.02959  [pdf, other

    cs.CL cs.AI

    Compositional Language Understanding with Text-based Relational Reasoning

    Authors: Koustuv Sinha, Shagun Sodhani, William L. Hamilton, Joelle Pineau

    Abstract: Neural networks for natural language reasoning have largely focused on extractive, fact-based question-answering (QA) and common-sense inference. However, it is also crucial to understand the extent to which neural networks can perform relational reasoning and combinatorial generalization from natural language---abilities that are often obscured by annotation artifacts and the dominance of languag… ▽ More

    Submitted 7 November, 2018; v1 submitted 7 November, 2018; originally announced November 2018.

    Comments: 4 pages of main content, to be presented at Relational Representation Learning Workshop, NIPS 2018, Montreal

  25. arXiv:1810.09007  [pdf, other

    cs.DB cs.DC

    Spatial Co-location Pattern Mining - A new perspective using Graph Database

    Authors: Sanket Vaibhav Mehta, Shagun Sodhani, Dhaval Patel

    Abstract: Spatial co-location pattern mining refers to the task of discovering the group of objects or events that co-occur at many places. Extracting these patterns from spatial data is very difficult due to the complexity of spatial data types, spatial relationships, and spatial auto-correlation. These patterns have applications in domains including public safety, geo-marketing, crime prediction and ecolo… ▽ More

    Submitted 21 October, 2018; originally announced October 2018.

  26. arXiv:1805.11016  [pdf, other

    cs.LG stat.ML

    Memory Augmented Self-Play

    Authors: Shagun Sodhani, Vardaan Pahuja

    Abstract: Self-play is an unsupervised training procedure which enables the reinforcement learning agents to explore the environment without requiring any external rewards. We augment the self-play setting by providing an external memory where the agent can store experience from the previous tasks. This enables the agent to come up with more diverse self-play tasks resulting in faster exploration of the env… ▽ More

    Submitted 31 May, 2018; v1 submitted 28 May, 2018; originally announced May 2018.

  27. arXiv:1805.08174  [pdf, other

    cs.CV cs.CL

    Reproducibility Report for "Learning To Count Objects In Natural Images For Visual Question Answering"

    Authors: Shagun Sodhani, Vardaan Pahuja

    Abstract: This is the reproducibility report for the paper "Learning To Count Objects In Natural Images For Visual QuestionAnswering"

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: Submitted to Reproducibility in ML Workshop, ICML'18

  28. arXiv:1709.08203  [pdf, ps, other

    cs.CV

    Survey of Recent Advances in Visual Question Answering

    Authors: Supriya Pandhre, Shagun Sodhani

    Abstract: Visual Question Answering (VQA) presents a unique challenge as it requires the ability to understand and encode the multi-modal inputs - in terms of image processing and natural language processing. The algorithm further needs to learn how to perform reasoning over this multi-modal representation so it can answer the questions correctly. This paper presents a survey of different approaches propose… ▽ More

    Submitted 24 September, 2017; originally announced September 2017.

    Comments: 7 pages, 2 tables

  29. arXiv:1709.05638  [pdf, other

    cs.AI

    Improving Search through A3C Reinforcement Learning based Conversational Agent

    Authors: Milan Aggarwal, Aarushi Arora, Shagun Sodhani, Balaji Krishnamurthy

    Abstract: We develop a reinforcement learning based search assistant which can assist users through a set of actions and sequence of interactions to enable them realize their intent. Our approach caters to subjective search where the user is seeking digital assets such as images which is fundamentally different from the tasks which have objective and limited search modalities. Labeled conversational data is… ▽ More

    Submitted 19 August, 2018; v1 submitted 17 September, 2017; originally announced September 2017.

    Comments: 17 pages, 7 figures

  30. arXiv:1512.04092  [pdf

    cs.CL cs.LG

    Stack Exchange Tagger

    Authors: Sanket Mehta, Shagun Sodhani

    Abstract: The goal of our project is to develop an accurate tagger for questions posted on Stack Exchange. Our problem is an instance of the more general problem of developing accurate classifiers for large scale text datasets. We are tackling the multilabel classification problem where each item (in this case, question) can belong to multiple classes (in this case, tags). We are predicting the tags (or key… ▽ More

    Submitted 13 December, 2015; originally announced December 2015.