Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Turchetta, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03932  [pdf, other

    cs.LG

    Breeding Programs Optimization with Reinforcement Learning

    Authors: Omar G. Younis, Luca Corinzia, Ioannis N. Athanasiadis, Andreas Krause, Joachim M. Buhmann, Matteo Turchetta

    Abstract: Crop breeding is crucial in improving agricultural productivity while potentially decreasing land usage, greenhouse gas emissions, and water consumption. However, breeding programs are challenging due to long turnover times, high-dimensional decision spaces, long-term objectives, and the need to adapt to rapid climate change. This paper introduces the use of Reinforcement Learning (RL) to optimize… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning

  2. arXiv:2402.06562  [pdf, other

    eess.SY cs.LG cs.RO math.OC

    Safe Guaranteed Exploration for Non-linear Systems

    Authors: Manish Prajapat, Johannes Köhler, Matteo Turchetta, Andreas Krause, Melanie N. Zeilinger

    Abstract: Safely exploring environments with a-priori unknown constraints is a fundamental challenge that restricts the autonomy of robots. While safety is paramount, guarantees on sufficient exploration are also crucial for ensuring autonomous task completion. To address these challenges, we propose a novel safe guaranteed exploration framework using optimal control, which achieves first-of-its-kind result… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  3. arXiv:2210.06380  [pdf, other

    cs.LG cs.AI cs.MA cs.RO math.OC

    Near-Optimal Multi-Agent Learning for Safe Coverage Control

    Authors: Manish Prajapat, Matteo Turchetta, Melanie N. Zeilinger, Andreas Krause

    Abstract: In multi-agent coverage control problems, agents navigate their environment to reach locations that maximize the coverage of some density. In practice, the density is rarely known $\textit{a priori}$, further complicating the original NP-hard problem. Moreover, in many applications, agents cannot visit arbitrary locations due to $\textit{a priori}$ unknown safety constraints. In this paper, we aim… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS 2022

  4. GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

    Authors: Bhavya Sukhija, Matteo Turchetta, David Lindner, Andreas Krause, Sebastian Trimpe, Dominik Baumann

    Abstract: Learning optimal control policies directly on physical systems is challenging since even a single failure can lead to costly hardware damage. Most existing model-free learning methods that guarantee safety, i.e., no failures, during exploration are limited to local optima. A notable exception is the GoSafe algorithm, which, unfortunately, cannot handle high-dimensional systems and hence cannot be… ▽ More

    Submitted 12 June, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

    Journal ref: Artificial Intelligence, Volume 320, Year 2023

  5. arXiv:2105.13281  [pdf, other

    cs.RO cs.LG eess.SY

    GoSafe: Globally Optimal Safe Robot Learning

    Authors: Dominik Baumann, Alonso Marco, Matteo Turchetta, Sebastian Trimpe

    Abstract: When learning policies for robotic systems from data, safety is a major concern, as violation of safety constraints may cause hardware damage. SafeOpt is an efficient Bayesian optimization (BO) algorithm that can learn policies while guaranteeing safety with high probability. However, its search space is limited to an initially given safe region. We extend this method by exploring outside the init… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  6. arXiv:2102.12466  [pdf, other

    cs.LG

    Information Directed Reward Learning for Reinforcement Learning

    Authors: David Lindner, Matteo Turchetta, Sebastian Tschiatschek, Kamil Ciosek, Andreas Krause

    Abstract: For many reinforcement learning (RL) applications, specifying a reward is difficult. This paper considers an RL setting where the agent obtains information about the reward only by querying an expert that can, for example, evaluate individual states or provide binary preferences over trajectories. From such expensive feedback, we aim to learn a model of the reward that allows standard RL algorithm… ▽ More

    Submitted 31 January, 2022; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: Presented at Conference on Neural Information Processing Systems (NeurIPS), 2021

  7. arXiv:2101.07825  [pdf, other

    eess.SY cs.AI cs.LG cs.RO

    Safe and Efficient Model-free Adaptive Control via Bayesian Optimization

    Authors: Christopher König, Matteo Turchetta, John Lygeros, Alisa Rupenyan, Andreas Krause

    Abstract: Adaptive control approaches yield high-performance controllers when a precise system model or suitable parametrizations of the controller are available. Existing data-driven approaches for adaptive control mostly augment standard model-based methods with additional information about uncertainties in the dynamics or about disturbances. In this work, we propose a purely data-driven, model-free appro… ▽ More

    Submitted 2 March, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

  8. arXiv:2006.12136  [pdf, other

    cs.LG cs.AI cs.RO

    Safe Reinforcement Learning via Curriculum Induction

    Authors: Matteo Turchetta, Andrey Kolobov, Shital Shah, Andreas Krause, Alekh Agarwal

    Abstract: In safety-critical applications, autonomous agents may need to learn in an environment where mistakes can be very costly. In such settings, the agent needs to behave safely not only after but also while learning. To achieve this, existing safe reinforcement learning methods make an agent rely on priors that let it avoid dangerous situations during exploration with high probability, but both the pr… ▽ More

    Submitted 21 January, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

  9. arXiv:1910.13726  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Safe Exploration for Interactive Machine Learning

    Authors: Matteo Turchetta, Felix Berkenkamp, Andreas Krause

    Abstract: In Interactive Machine Learning (IML), we iteratively make decisions and obtain noisy observations of an unknown function. While IML methods, e.g., Bayesian optimization and active learning, have been successful in applications, on real-world systems they must provably avoid unsafe decisions. To this end, safe IML algorithms must carefully learn about a priori unknown constraints without making un… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: Accepted at NeurIPS 2019

  10. arXiv:1910.13399  [pdf, other

    cs.RO cs.AI cs.LG

    Robust Model-free Reinforcement Learning with Multi-objective Bayesian Optimization

    Authors: Matteo Turchetta, Andreas Krause, Sebastian Trimpe

    Abstract: In reinforcement learning (RL), an autonomous agent learns to perform complex tasks by maximizing an exogenous reward signal while interacting with its environment. In real-world applications, test conditions may differ substantially from the training scenario and, therefore, focusing on pure reward maximization during training may lead to poor results at test time. In these cases, it is important… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Submitted to IEEE Conference on Robotics and Automation 2020 (ICRA)

  11. Mixed-Variable Bayesian Optimization

    Authors: Erik Daxberger, Anastasia Makarova, Matteo Turchetta, Andreas Krause

    Abstract: The optimization of expensive to evaluate, black-box, mixed-variable functions, i.e. functions that have continuous and discrete inputs, is a difficult and yet pervasive problem in science and engineering. In Bayesian optimization (BO), special cases of this problem that consider fully continuous or fully discrete domains have been widely studied. However, few methods exist for mixed-variable doma… ▽ More

    Submitted 4 August, 2020; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: IJCAI 2020 camera-ready; 17 pages, extended version with supplementary material

    Journal ref: Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI-20), 2020, pages 2633-2639

  12. arXiv:1906.12189  [pdf, other

    eess.SY cs.AI cs.LG

    Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

    Authors: Torsten Koller, Felix Berkenkamp, Matteo Turchetta, Joschka Boedecker, Andreas Krause

    Abstract: Reinforcement learning has been successfully used to solve difficult tasks in complex unknown environments. However, these methods typically do not provide any safety guarantees during the learning process. This is particularly problematic, since reinforcement learning agent actively explore their environment. This prevents their use in safety-critical, real-world applications. In this paper, we p… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

    Comments: 14 pages, 7 figures. arXiv admin note: substantial text overlap with arXiv:1803.08287

  13. arXiv:1805.07095  [pdf, other

    cs.RO

    Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for Map-less Navigation by Leveraging Prior Demonstrations

    Authors: Mark Pfeiffer, Samarth Shukla, Matteo Turchetta, Cesar Cadena, Andreas Krause, Roland Siegwart, Juan Nieto

    Abstract: This work presents a case study of a learning-based approach for target driven map-less navigation. The underlying navigation model is an end-to-end neural network which is trained using a combination of expert demonstrations, imitation learning (IL) and reinforcement learning (RL). While RL and IL suffer from a large sample complexity and the distribution mismatch problem, respectively, we show t… ▽ More

    Submitted 31 August, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

    Comments: 8 pages, submitted for publication in the IEEE Robotics and Automation Letters

  14. arXiv:1803.08287  [pdf, other

    eess.SY cs.AI cs.LG cs.RO

    Learning-based Model Predictive Control for Safe Exploration

    Authors: Torsten Koller, Felix Berkenkamp, Matteo Turchetta, Andreas Krause

    Abstract: Learning-based methods have been successful in solving complex control tasks without significant prior knowledge about the system. However, these methods typically do not provide any safety guarantees, which prevents their use in safety-critical, real-world applications. In this paper, we present a learning-based model predictive control scheme that can provide provable high-probability safety gua… ▽ More

    Submitted 7 November, 2018; v1 submitted 22 March, 2018; originally announced March 2018.

    Comments: Proc. of the Conference on Decision and Control, 2018

  15. arXiv:1705.08551  [pdf, other

    stat.ML cs.AI cs.LG eess.SY

    Safe Model-based Reinforcement Learning with Stability Guarantees

    Authors: Felix Berkenkamp, Matteo Turchetta, Angela P. Schoellig, Andreas Krause

    Abstract: Reinforcement learning is a powerful paradigm for learning optimal policies from experimental data. However, to find optimal policies, most reinforcement learning algorithms explore all possible actions, which may be harmful for real-world systems. As a consequence, learning algorithms are rarely applied on safety-critical systems in the real world. In this paper, we present a learning algorithm t… ▽ More

    Submitted 13 November, 2017; v1 submitted 23 May, 2017; originally announced May 2017.

    Comments: Proc. of Neural Information Processing Systems (NIPS), 2017

  16. arXiv:1606.04753  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Safe Exploration in Finite Markov Decision Processes with Gaussian Processes

    Authors: Matteo Turchetta, Felix Berkenkamp, Andreas Krause

    Abstract: In classical reinforcement learning, when exploring an environment, agents accept arbitrary short term loss for long term gain. This is infeasible for safety critical applications, such as robotics, where even a single unsafe action may cause system failure. In this paper, we address the problem of safely exploring finite Markov decision processes (MDP). We define safety in terms of an, a priori u… ▽ More

    Submitted 15 November, 2016; v1 submitted 15 June, 2016; originally announced June 2016.

    Comments: 15 pages, extended version with proofs

    Journal ref: Proc. of Advances in Neural Information Processing Systems (NIPS), 2016, pp. 4305-4313