Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Kinose, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.03610  [pdf, other

    cs.LG cs.AI cs.CL

    RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents

    Authors: Tomoyuki Kagaya, Thong Jing Yuan, Yuxuan Lou, Jayashree Karlekar, Sugiri Pranata, Akira Kinose, Koki Oguri, Felix Wick, Yang You

    Abstract: Owing to recent advancements, Large Language Models (LLMs) can now be deployed as agents for increasingly complex decision-making applications in areas including robotics, gaming, and API integration. However, reflecting past experiences in current decision-making processes, an innate human behavior, continues to pose significant challenges. Addressing this, we propose Retrieval-Augmented Planning… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  2. arXiv:2203.11024  [pdf, other

    cs.AI cs.RO eess.SY

    Multi-View Dreaming: Multi-View World Model with Contrastive Learning

    Authors: Akira Kinose, Masashi Okada, Ryo Okumura, Tadahiro Taniguchi

    Abstract: In this paper, we propose Multi-View Dreaming, a novel reinforcement learning agent for integrated recognition and control from multi-view observations by extending Dreaming. Most current reinforcement learning method assumes a single-view observation space, and this imposes limitations on the observed data, such as lack of spatial information and occlusions. This makes obtaining ideal observation… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 7 pages, 8 figures

  3. Integration of Imitation Learning using GAIL and Reinforcement Learning using Task-achievement Rewards via Probabilistic Graphical Model

    Authors: Akira Kinose, Tadahiro Taniguchi

    Abstract: Integration of reinforcement learning and imitation learning is an important problem that has been studied for a long time in the field of intelligent robotics. Reinforcement learning optimizes policies to maximize the cumulative reward, whereas imitation learning attempts to extract general knowledge about the trajectories demonstrated by experts, i.e., demonstrators. Because each of them has the… ▽ More

    Submitted 16 October, 2019; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: Submitted to Advanced Robotics

    Journal ref: Advanced Robotics, 2020, 34:16, 1055-1067