Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Lekkala, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.12339  [pdf, other

    cs.LG cs.RO

    Value Explicit Pretraining for Learning Transferable Representations

    Authors: Kiran Lekkala, Henghui Bao, Sumedh Sontakke, Laurent Itti

    Abstract: We propose Value Explicit Pretraining (VEP), a method that learns generalizable representations for transfer reinforcement learning. VEP enables learning of new tasks that share similar objectives as previously learned tasks, by learning an encoder for objective-conditioned representations, irrespective of appearance changes and environment dynamics. To pre-train the encoder from a sequence of obs… ▽ More

    Submitted 7 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted at CoRL 2023 Workshop on PRL, Under Review at ICML 2024

  2. arXiv:2311.13648  [pdf, other

    cs.LG

    Evaluating Pretrained models for Deployable Lifelong Learning

    Authors: Kiran Lekkala, Eshan Bhargava, Yunhao Ge, Laurent Itti

    Abstract: We create a novel benchmark for evaluating a Deployable Lifelong Learning system for Visual Reinforcement Learning (RL) that is pretrained on a curated dataset, and propose a novel Scalable Lifelong Learning system capable of retaining knowledge from the previously learnt RL tasks. Our benchmark measures the efficacy of a deployable Lifelong Learning system that is evaluated on scalability, perfor… ▽ More

    Submitted 17 December, 2023; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: In submission to CoLLA 2024. Also published in the Proceedings of WACV 2024 Workshop on Pretraining

  3. arXiv:2310.18847  [pdf, other

    cs.RO cs.LG

    Bird's Eye View Based Pretrained World model for Visual Navigation

    Authors: Kiran Lekkala, Chen Liu, Laurent Itti

    Abstract: Sim2Real transfer has gained popularity because it helps transfer from inexpensive simulators to real world. This paper presents a novel system that fuses components in a traditional World Model into a robust system, trained entirely within a simulator, that Zero-Shot transfers to the real world. To facilitate transfer, we use an intermediary representation that is based on \textit{Bird's Eye View… ▽ More

    Submitted 22 March, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: Under Review at the IROS 2024; Accepted at NeurIPS 2023, Robot Learning Workshop

  4. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  5. arXiv:2305.15591  [pdf, other

    cs.LG

    Lightweight Learner for Shared Knowledge Lifelong Learning

    Authors: Yunhao Ge, Yuecheng Li, Di Wu, Ao Xu, Adam M. Jones, Amanda Sofie Rios, Iordanis Fostiropoulos, Shixian Wen, Po-Hsuan Huang, Zachary William Murdock, Gozde Sahin, Shuo Ni, Kiran Lekkala, Sumedh Anand Sontakke, Laurent Itti

    Abstract: In Lifelong Learning (LL), agents continually learn as they encounter new conditions and tasks. Most current LL is limited to a single agent that learns tasks sequentially. Dedicated LL machinery is then deployed to mitigate the forgetting of old tasks as new tasks are learned. This is inherently slow. We propose a new Shared Knowledge Lifelong Learning (SKILL) challenge, which deploys a decentral… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Transactions on Machine Learning Research (TMLR) paper

  6. arXiv:2212.00089  [pdf, other

    cs.AR cs.ET

    Ferroelectric FET based Context-Switching FPGA Enabling Dynamic Reconfiguration for Adaptive Deep Learning Machines

    Authors: Yixin Xu, Zijian Zhao, Yi Xiao, Tongguang Yu, Halid Mulaosmanovic, Dominik Kleimaier, Stefan Duenkel, Sven Beyer, Xiao Gong, Rajiv Joshi, X. Sharon Hu, Shixian Wen, Amanda Sofie Rios, Kiran Lekkala, Laurent Itti, Eric Homan, Sumitha George, Vijaykrishnan Narayanan, Kai Ni

    Abstract: Field Programmable Gate Array (FPGA) is widely used in acceleration of deep learning applications because of its reconfigurability, flexibility, and fast time-to-market. However, conventional FPGA suffers from the tradeoff between chip area and reconfiguration latency, making efficient FPGA accelerations that require switching between multiple configurations still elusive. In this paper, we perfor… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

    Comments: 54 pages, 15 figures

  7. arXiv:2201.08098  [pdf, other

    cs.CV

    What can we learn from misclassified ImageNet images?

    Authors: Shixian Wen, Amanda Sofie Rios, Kiran Lekkala, Laurent Itti

    Abstract: Understanding the patterns of misclassified ImageNet images is particularly important, as it could guide us to design deep neural networks (DNN) that generalize better. However, the richness of ImageNet imposes difficulties for researchers to visually find any useful patterns of misclassification. Here, to help find these patterns, we propose "Superclassing ImageNet dataset". It is a subset of Ima… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  8. arXiv:2105.14639  [pdf, other

    cs.RO cs.LG cs.NE

    Shaped Policy Search for Evolutionary Strategies using Waypoints

    Authors: Kiran Lekkala, Laurent Itti

    Abstract: In this paper, we try to improve exploration in Blackbox methods, particularly Evolution strategies (ES), when applied to Reinforcement Learning (RL) problems where intermediate waypoints/subgoals are available. Since Evolutionary strategies are highly parallelizable, instead of extracting just a scalar cumulative reward, we use the state-action pairs from the trajectories obtained during rollouts… ▽ More

    Submitted 3 July, 2023; v1 submitted 30 May, 2021; originally announced May 2021.

    Comments: Presented at the International Conference on Robotics and Automation (ICRA) 2021

  9. arXiv:2006.07438  [pdf, other

    cs.LG stat.ML

    Attentive Feature Reuse for Multi Task Meta learning

    Authors: Kiran Lekkala, Laurent Itti

    Abstract: We develop new algorithms for simultaneous learning of multiple tasks (e.g., image classification, depth estimation), and for adapting to unseen task/domain distributions within those high-level tasks (e.g., different environments). First, we learn common representations underlying all tasks. We then propose an attention mechanism to dynamically specialize the network, at runtime, for each task. O… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  10. arXiv:1911.10322  [pdf, other

    cs.LG cs.AI stat.ML

    Meta Adaptation using Importance Weighted Demonstrations

    Authors: Kiran Lekkala, Sami Abu-El-Haija, Laurent Itti

    Abstract: Imitation learning has gained immense popularity because of its high sample-efficiency. However, in real-world scenarios, where the trajectory distribution of most of the tasks dynamically shifts, model fitting on continuously aggregated data alone would be futile. In some cases, the distribution shifts, so much, that it is difficult for an agent to infer the new task. We propose a novel algorithm… ▽ More

    Submitted 3 July, 2023; v1 submitted 23 November, 2019; originally announced November 2019.