Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Katdare, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.01605  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Provable Log Density Policy Gradient

    Authors: Pulkit Katdare, Anant Joshi, Katherine Driggs-Campbell

    Abstract: Policy gradient methods are a vital ingredient behind the success of modern reinforcement learning. Modern policy gradient methods, although successful, introduce a residual error in gradient estimation. In this work, we argue that this residual term is significant and correcting for it could potentially improve sample-complexity of reinforcement learning methods. To that end, we propose log densi… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  2. arXiv:2309.01807  [pdf, other

    cs.LG cs.AI cs.RO

    Marginalized Importance Sampling for Off-Environment Policy Evaluation

    Authors: Pulkit Katdare, Nan Jiang, Katherine Driggs-Campbell

    Abstract: Reinforcement Learning (RL) methods are typically sample-inefficient, making it challenging to train and deploy RL-policies in real world robots. Even a robust policy trained in simulation requires a real-world deployment to assess their performance. This paper proposes a new approach to evaluate the real-world performance of agent policies prior to deploying them in the real world. Our approach i… ▽ More

    Submitted 4 October, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

  3. arXiv:2305.09900  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Efficient Equivariant Transfer Learning from Pretrained Models

    Authors: Sourya Basu, Pulkit Katdare, Prasanna Sattigeri, Vijil Chenthamarakshan, Katherine Driggs-Campbell, Payel Das, Lav R. Varshney

    Abstract: Efficient transfer learning algorithms are key to the success of foundation models on diverse downstream tasks even with limited data. Recent works of Basu et al. (2023) and Kaba et al. (2022) propose group averaging (equitune) and optimization-based methods, respectively, over features from group-transformed inputs to obtain equivariant outputs from non-equivariant neural networks. While Kaba et… ▽ More

    Submitted 10 October, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Journal ref: NeurIPS 2023

  4. arXiv:2112.11532  [pdf, ps, other

    cs.RO cs.LG

    Off Environment Evaluation Using Convex Risk Minimization

    Authors: Pulkit Katdare, Shuijing Liu, Katherine Driggs-Campbell

    Abstract: Applying reinforcement learning (RL) methods on robots typically involves training a policy in simulation and deploying it on a robot in the real world. Because of the model mismatch between the real world and the simulator, RL agents deployed in this manner tend to perform suboptimally. To tackle this problem, researchers have developed robust policy learning algorithms that rely on synthetic noi… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: 7 pages, 3 figures (with sub-figures)