Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Yamagata, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.18539  [pdf, other

    cs.LG eess.SY

    Safe and Robust Reinforcement Learning: Principles and Practice

    Authors: Taku Yamagata, Raul Santos-Rodriguez

    Abstract: Reinforcement Learning (RL) has shown remarkable success in solving relatively complex tasks, yet the deployment of RL systems in real-world scenarios poses significant challenges related to safety and robustness. This paper aims to identify and further understand those challenges thorough the exploration of the main dimensions of the safe and robust RL landscape, encompassing algorithmic, ethical… ▽ More

    Submitted 30 March, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  2. arXiv:2402.06646  [pdf

    physics.ao-ph cs.LG physics.geo-ph

    Diffusion Model-based Probabilistic Downscaling for 180-year East Asian Climate Reconstruction

    Authors: Fenghua Ling, Zeyu Lu, Jing-Jia Luo, Lei Bai, Swadhin K. Behera, Dachao Jin, Baoxiang Pan, Huidong Jiang, Toshio Yamagata

    Abstract: As our planet is entering into the "global boiling" era, understanding regional climate change becomes imperative. Effective downscaling methods that provide localized insights are crucial for this target. Traditional approaches, including computationally-demanding regional dynamical models or statistical downscaling frameworks, are often susceptible to the influence of downscaling uncertainty. He… ▽ More

    Submitted 5 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  3. arXiv:2302.02706  [pdf, other

    cs.LG cs.HC

    When the Ground Truth is not True: Modelling Human Biases in Temporal Annotations

    Authors: Taku Yamagata, Emma L. Tonkin, Benjamin Arana Sanchez, Ian Craddock, Miquel Perello Nieto, Raul Santos-Rodriguez, Weisong Yang, Peter Flach

    Abstract: In supervised learning, low quality annotations lead to poorly performing classification and detection models, while also rendering evaluation unreliable. This is particularly apparent on temporal data, where annotation quality is affected by multiple factors. For example, in the post-hoc self-reporting of daily activities, cognitive biases are one of the most common ingredients. In particular, re… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  4. arXiv:2209.03993  [pdf, other

    cs.LG

    Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL

    Authors: Taku Yamagata, Ahmed Khalil, Raul Santos-Rodriguez

    Abstract: Recent works have shown that tackling offline reinforcement learning (RL) with a conditional policy produces promising results. The Decision Transformer (DT) combines the conditional policy approach and a transformer architecture, showing competitive performance against several benchmarks. However, DT lacks stitching ability -- one of the critical abilities for offline RL to learn the optimal poli… ▽ More

    Submitted 25 May, 2023; v1 submitted 8 September, 2022; originally announced September 2022.

  5. arXiv:2111.08596  [pdf, other

    cs.LG

    Reinforcement Learning with Feedback from Multiple Humans with Diverse Skills

    Authors: Taku Yamagata, Ryan McConville, Raul Santos-Rodriguez

    Abstract: A promising approach to improve the robustness and exploration in Reinforcement Learning is collecting human feedback and that way incorporating prior knowledge of the target environment. It is, however, often too expensive to obtain enough feedback of good quality. To mitigate the issue, we aim to rely on a group of multiple experts (and non-experts) with different skill levels to generate enough… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: Accepted NeurIPS 2021 Workshop on Safe and Robust Control of Uncertain Systems. arXiv admin note: text overlap with arXiv:1908.06134

  6. arXiv:2010.06266  [pdf, other

    cs.LG

    Model-Based Reinforcement Learning for Type 1Diabetes Blood Glucose Control

    Authors: Taku Yamagata, Aisling O'Kane, Amid Ayobi, Dmitri Katz, Katarzyna Stawarz, Paul Marshall, Peter Flach, Raúl Santos-Rodríguez

    Abstract: In this paper we investigate the use of model-based reinforcement learning to assist people with Type 1 Diabetes with insulin dose decisions. The proposed architecture consists of multiple Echo State Networks to predict blood glucose levels combined with Model Predictive Controller for planning. Echo State Network is a version of recurrent neural networks which allows us to learn long term depende… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: Presented at ECAI 2020 SP4HC Workshop

  7. arXiv:1908.06134  [pdf, other

    cs.LG stat.ML

    Online Feature Selection for Activity Recognition using Reinforcement Learning with Multiple Feedback

    Authors: Taku Yamagata, Raúl Santos-Rodríguez, Ryan McConville, Atis Elsts

    Abstract: Recent advances in both machine learning and Internet-of-Things have attracted attention to automatic Activity Recognition, where users wear a device with sensors and their outputs are mapped to a predefined set of activities. However, few studies have considered the balance between wearable power consumption and activity recognition accuracy. This is particularly important when part of the comput… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.