Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Ganguly, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16386  [pdf, other

    cs.LG cs.AI

    Variational Offline Multi-agent Skill Discovery

    Authors: Jiayu Chen, Bhargav Ganguly, Tian Lan, Vaneet Aggarwal

    Abstract: Skills are effective temporal abstractions established for sequential decision making tasks, which enable efficient hierarchical learning for long-horizon tasks and facilitate multi-task learning through their transferability. Despite extensive research, research gaps remain in multi-agent scenarios, particularly for automatically extracting subgroup coordination patterns in a multi-agent task. In… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2402.13777  [pdf, other

    cs.LG cs.AI

    Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions

    Authors: Jiayu Chen, Bhargav Ganguly, Yang Xu, Yongsheng Mei, Tian Lan, Vaneet Aggarwal

    Abstract: Deep generative models (DGMs) have demonstrated great success across various domains, particularly in generating texts, images, and videos using models trained from offline data. Similarly, data-driven decision-making and robotic control also necessitate learning a generator function from the offline data to serve as the strategy or policy. In this case, applying deep generative models in offline… ▽ More

    Submitted 25 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: We restructured the paper and added more discussion

  3. arXiv:2310.11684  [pdf, other

    cs.LG cs.AI quant-ph

    Quantum Speedups in Regret Analysis of Infinite Horizon Average-Reward Markov Decision Processes

    Authors: Bhargav Ganguly, Yang Xu, Vaneet Aggarwal

    Abstract: This paper investigates the potential of quantum acceleration in addressing infinite horizon Markov Decision Processes (MDPs) to enhance average reward outcomes. We introduce an innovative quantum framework for the agent's engagement with an unknown MDP, extending the conventional interaction paradigm. Our approach involves the design of an optimism-driven tabular Reinforcement Learning algorithm… ▽ More

    Submitted 28 April, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

  4. arXiv:2302.08617  [pdf, other

    cs.LG cs.AI eess.SY quant-ph stat.ML

    Quantum Computing Provides Exponential Regret Improvement in Episodic Reinforcement Learning

    Authors: Bhargav Ganguly, Yulian Wu, Di Wang, Vaneet Aggarwal

    Abstract: In this paper, we investigate the problem of \textit{episodic reinforcement learning} with quantum oracles for state evolution. To this end, we propose an \textit{Upper Confidence Bound} (UCB) based quantum algorithmic framework to facilitate learning of a finite-horizon MDP. Our quantum algorithm achieves an exponential improvement in regret as compared to the classical counterparts, achieving a… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  5. arXiv:2211.12578  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Online Federated Learning via Non-Stationary Detection and Adaptation amidst Concept Drift

    Authors: Bhargav Ganguly, Vaneet Aggarwal

    Abstract: Federated Learning (FL) is an emerging domain in the broader context of artificial intelligence research. Methodologies pertaining to FL assume distributed model training, consisting of a collection of clients and a server, with the main goal of achieving optimal global model with restrictions on data sharing due to privacy concerns. It is worth highlighting that the diverse existing literature in… ▽ More

    Submitted 6 May, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

  6. Multi-Edge Server-Assisted Dynamic Federated Learning with an Optimized Floating Aggregation Point

    Authors: Bhargav Ganguly, Seyyedali Hosseinalipour, Kwang Taik Kim, Christopher G. Brinton, Vaneet Aggarwal, David J. Love, Mung Chiang

    Abstract: We propose cooperative edge-assisted dynamic federated learning (CE-FL). CE-FL introduces a distributed machine learning (ML) architecture, where data collection is carried out at the end devices, while the model training is conducted cooperatively at the end devices and the edge servers, enabled via data offloading from the end devices to the edge servers through base stations. CE-FL also introdu… ▽ More

    Submitted 22 October, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Journal ref: Published in IEEE/ACM Transactions on Networking, 2023

  7. arXiv:2102.10740  [pdf, other

    cs.LG cs.AI cs.MA

    Communication Efficient Parallel Reinforcement Learning

    Authors: Mridul Agarwal, Bhargav Ganguly, Vaneet Aggarwal

    Abstract: We consider the problem where $M$ agents interact with $M$ identical and independent environments with $S$ states and $A$ actions using reinforcement learning for $T$ rounds. The agents share their data with a central server to minimize their regret. We aim to find an algorithm that allows the agents to minimize the regret with infrequent communication rounds. We provide \NAM\ which runs at each a… ▽ More

    Submitted 21 February, 2021; originally announced February 2021.

  8. arXiv:1812.11316  [pdf

    cs.RO

    Microcontroller Based Robotic Arm Development for Library Management System

    Authors: Bodhisatwa Barma, Samrat Ghosh, Abhrodip Chaudhury, Biswarup Ganguly

    Abstract: With the advancement of robotics, automation in various industries and processes has become widespread. This project aims to introduce library automation system, which addresses the fulfillment of the objectives of automatic retrieval of queued books, arrangement of returned books on the racks as well as automated updating of the library database. The proposed system is based on the Arduino microc… ▽ More

    Submitted 29 December, 2018; originally announced December 2018.

    Comments: Accepted in 2nd International Conference on Computational Advancement in Communication circuit and System (ICCACCS-2018). Best Paper award in "Poster Presentation Category"

  9. arXiv:1202.0862  [pdf, other

    math.CO cs.AI

    e-Valuate: A Two-player Game on Arithmetic Expressions -- An Update

    Authors: Sarang Aravamuthan, Biswajit Ganguly

    Abstract: e-Valuate is a game on arithmetic expressions. The players have contrasting roles of maximizing and minimizing the given expression. The maximizer proposes values and the minimizer substitutes them for variables of his choice. When the expression is fully instantiated, its value is compared with a certain minimax value that would result if the players played to their optimal strategies. The winner… ▽ More

    Submitted 12 September, 2014; v1 submitted 3 February, 2012; originally announced February 2012.

    Comments: 18 pages, 3 figures

    MSC Class: 91A05; 91A43 (Primary); 91A46 (Secondary)