Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Malviya, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15895  [pdf, other

    cs.LG

    Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective

    Authors: Pranshu Malviya, Jerry Huang, Quentin Fournier, Sarath Chandar

    Abstract: The optimal model for a given task is often challenging to determine, requiring training multiple models from scratch which becomes prohibitive as dataset and model sizes grow. A more efficient alternative is to reuse smaller pre-trained models by expanding them, however, this is not widely adopted as how this impacts training dynamics remains poorly understood. While prior works have introduced s… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2307.16704  [pdf, other

    cs.LG cs.AI

    Lookbehind-SAM: k steps back, 1 step forward

    Authors: Gonçalo Mordido, Pranshu Malviya, Aristide Baratin, Sarath Chandar

    Abstract: Sharpness-aware minimization (SAM) methods have gained increasing popularity by formulating the problem of minimizing both loss value and loss sharpness as a minimax objective. In this work, we increase the efficiency of the maximization and minimization parts of SAM's objective to achieve a better loss-sharpness trade-off. By taking inspiration from the Lookahead optimizer, which uses multiple de… ▽ More

    Submitted 16 May, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: ICML 2024

  3. arXiv:2307.09638  [pdf, other

    cs.LG cs.AI

    Promoting Exploration in Memory-Augmented Adam using Critical Momenta

    Authors: Pranshu Malviya, Gonçalo Mordido, Aristide Baratin, Reza Babanezhad Harikandeh, Jerry Huang, Simon Lacoste-Julien, Razvan Pascanu, Sarath Chandar

    Abstract: Adaptive gradient-based optimizers, notably Adam, have left their mark in training large-scale deep learning models, offering fast convergence and robustness to hyperparameter settings. However, they often struggle with generalization, attributed to their tendency to converge to sharp minima in the loss landscape. To address this, we propose a new memory-augmented version of Adam that encourages e… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Published in Transactions on Machine Learning Research

  4. arXiv:2209.01275  [pdf, other

    cs.LG cs.AI

    Feature diversity in self-supervised learning

    Authors: Pranshu Malviya, Arjun Vaithilingam Sudhakar

    Abstract: Many studies on scaling laws consider basic factors such as model size, model shape, dataset size, and compute power. These factors are easily tunable and represent the fundamental elements of any machine learning setup. But researchers have also employed more complex factors to estimate the test error and generalization performance with high predictability. These factors are generally specific to… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: Accepted at 1st Conference on Lifelong Learning Agents, 2022 - Workshop Track

  5. arXiv:2207.04354  [pdf, other

    cs.LG cs.AI

    An Introduction to Lifelong Supervised Learning

    Authors: Shagun Sodhani, Mojtaba Faramarzi, Sanket Vaibhav Mehta, Pranshu Malviya, Mohamed Abdelsalam, Janarthanan Janarthanan, Sarath Chandar

    Abstract: This primer is an attempt to provide a detailed summary of the different facets of lifelong learning. We start with Chapter 2 which provides a high-level overview of lifelong learning systems. In this chapter, we discuss prominent scenarios in lifelong learning (Section 2.4), provide 8 Introduction a high-level organization of different lifelong learning approaches (Section 2.5), enumerate the des… ▽ More

    Submitted 12 July, 2022; v1 submitted 9 July, 2022; originally announced July 2022.

    Comments: Lifelong Learning Primer

  6. arXiv:2111.14348  [pdf, other

    cs.LG cs.CY

    A Causal Approach for Unfair Edge Prioritization and Discrimination Removal

    Authors: Pavan Ravishankar, Pranshu Malviya, Balaraman Ravindran

    Abstract: In budget-constrained settings aimed at mitigating unfairness, like law enforcement, it is essential to prioritize the sources of unfairness before taking measures to mitigate them in the real world. Unlike previous works, which only serve as a caution against possible discrimination and de-bias data after data generation, this work provides a toolkit to mitigate unfairness during data generation,… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: ACML 2021

  7. arXiv:2105.05155  [pdf, other

    cs.LG

    TAG: Task-based Accumulated Gradients for Lifelong learning

    Authors: Pranshu Malviya, Balaraman Ravindran, Sarath Chandar

    Abstract: When an agent encounters a continual stream of new tasks in the lifelong learning setting, it leverages the knowledge it gained from the earlier tasks to help learn the new tasks better. In such a scenario, identifying an efficient knowledge representation becomes a challenging problem. Most research works propose to either store a subset of examples from the past tasks in a replay buffer, dedicat… ▽ More

    Submitted 29 August, 2022; v1 submitted 11 May, 2021; originally announced May 2021.

    Comments: Published at 1st Conference on Lifelong Learning Agents, 2022

  8. arXiv:2105.04120  [pdf, ps, other

    cs.AI

    Fast constraint satisfaction problem and learning-based algorithm for solving Minesweeper

    Authors: Yash Pratyush Sinha, Pranshu Malviya, Rupaj Kumar Nayak

    Abstract: Minesweeper is a popular spatial-based decision-making game that works with incomplete information. As an exemplary NP-complete problem, it is a major area of research employing various artificial intelligence paradigms. The present work models this game as Constraint Satisfaction Problem (CSP) and Markov Decision Process (MDP). We propose a new method named as dependents from the independent set… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

  9. arXiv:2007.05516  [pdf, other

    cs.AI cs.CY cs.LG

    A Causal Linear Model to Quantify Edge Flow and Edge Unfairness for UnfairEdge Prioritization and Discrimination Removal

    Authors: Pavan Ravishankar, Pranshu Malviya, Balaraman Ravindran

    Abstract: Law enforcement must prioritize sources of unfairness before mitigating their underlying unfairness, considering that they have limited resources. Unlike previous works that only make cautionary claims of discrimination and de-biases data after its generation, this paper attempts to prioritize unfair sources before mitigating their unfairness in the real-world. We assume that a causal bayesian net… ▽ More

    Submitted 11 March, 2021; v1 submitted 10 July, 2020; originally announced July 2020.

    Comments: Accepted in the Workshop on Law and Machine Learning, ICML 2020; First two authors contributed equally

  10. arXiv:1811.06437  [pdf, other

    cs.LG cs.CY stat.ML

    Contextual Care Protocol using Neural Networks and Decision Trees

    Authors: Yash Pratyush Sinha, Pranshu Malviya, Minerva Panda, Syed Mohd Ali

    Abstract: A contextual care protocol is used by a medical practitioner for patient healthcare, given the context or situation that the specified patient is in. This paper proposes a method to build an automated self-adapting protocol which can help make relevant, early decisions for effective healthcare delivery. The hybrid model leverages neural networks and decision trees. The neural network estimates the… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

    Journal ref: 2018 Second International Conference on Advances in Electronics, Computers and Communications (ICAECC)