Zum Hauptinhalt springen

Showing 1–50 of 69 results for author: Hirche, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.01258  [pdf, other

    cs.RO

    Jacta: A Versatile Planner for Learning Dexterous and Whole-body Manipulation

    Authors: Jan Brüdigam, Ali-Adeeb Abbas, Maks Sorokin, Kuan Fang, Brandon Hung, Maya Guru, Stefan Sosnowski, Jiuguang Wang, Sandra Hirche, Simon Le Cleac'h

    Abstract: Robotic manipulation is challenging due to discontinuous dynamics, as well as high-dimensional state and action spaces. Data-driven approaches that succeed in manipulation tasks require large amounts of data and expert demonstrations, typically from humans. Existing manipulation planners are restricted to specific systems and often depend on specialized algorithms for using demonstration. Therefor… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  2. arXiv:2407.20156  [pdf, other

    cs.RO

    Autonomous and Teleoperation Control of a Drawing Robot Avatar

    Authors: Lingyun Chen, Abdeldjallil Naceri, Abdalla Swikir, Sandra Hirche, Sami Haddadin

    Abstract: A drawing robot avatar is a robotic system that allows for telepresence-based drawing, enabling users to remotely control a robotic arm and create drawings in real-time from a remote location. The proposed control framework aims to improve bimanual robot telepresence quality by reducing the user workload and required prior knowledge through the automation of secondary or auxiliary tasks. The intro… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: Accepted to ICRA 2024

  3. arXiv:2407.16407  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    Data-Driven Optimal Feedback Laws via Kernel Mean Embeddings

    Authors: Petar Bevanda, Nicolas Hoischen, Stefan Sosnowski, Sandra Hirche, Boris Houska

    Abstract: This paper proposes a fully data-driven approach for optimal control of nonlinear control-affine systems represented by a stochastic diffusion. The focus is on the scenario where both the nonlinear dynamics and stage cost functions are unknown, while only control penalty function and constraints are provided. Leveraging the theory of reproducing kernel Hilbert spaces, we introduce novel kernel mea… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: author-submitted electronic preprint version: 16 pages, 3 figures, 4 tables

  4. arXiv:2405.08756  [pdf, other

    eess.SY cs.LG

    Stable Inverse Reinforcement Learning: Policies from Control Lyapunov Landscapes

    Authors: Samuel Tesfazgi, Leonhard Sprandl, Armin Lederer, Sandra Hirche

    Abstract: Learning from expert demonstrations to flexibly program an autonomous system with complex behaviors or to predict an agent's behavior is a powerful tool, especially in collaborative control settings. A common method to solve this problem is inverse reinforcement learning (IRL), where the observed agent, e.g., a human demonstrator, is assumed to behave according to the optimization of an intrinsic… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  5. arXiv:2405.08711  [pdf, other

    cs.RO cs.LG eess.SY

    Data-driven Force Observer for Human-Robot Interaction with Series Elastic Actuators using Gaussian Processes

    Authors: Samuel Tesfazgi, Markus Keßler, Emilio Trigili, Armin Lederer, Sandra Hirche

    Abstract: Ensuring safety and adapting to the user's behavior are of paramount importance in physical human-robot interaction. Thus, incorporating elastic actuators in the robot's mechanical design has become popular, since it offers intrinsic compliance and additionally provide a coarse estimate for the interaction force by measuring the deformation of the elastic components. While observer-based methods h… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  6. arXiv:2405.07312  [pdf, other

    eess.SY cs.LG

    Nonparametric Control-Koopman Operator Learning: Flexible and Scalable Models for Prediction and Control

    Authors: Petar Bevanda, Bas Driessen, Lucian Cristian Iacob, Roland Toth, Stefan Sosnowski, Sandra Hirche

    Abstract: Linearity of Koopman operators and simplicity of their estimators coupled with model-reduction capabilities has lead to their great popularity in applications for learning dynamical systems. While nonparametric Koopman operator learning in infinite-dimensional reproducing kernel Hilbert spaces is well understood for autonomous systems, its control system analogues are largely unexplored. Addressin… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  7. arXiv:2404.11760  [pdf, other

    cs.LG

    Predictive Model Development to Identify Failed Healing in Patients after Non-Union Fracture Surgery

    Authors: Cedric Donié, Marie K. Reumann, Tony Hartung, Benedikt J. Braun, Tina Histing, Satoshi Endo, Sandra Hirche

    Abstract: Bone non-union is among the most severe complications associated with trauma surgery, occurring in 10-30% of cases after long bone fractures. Treating non-unions requires a high level of surgical expertise and often involves multiple revision surgeries, sometimes even leading to amputation. Thus, more accurate prognosis is crucial for patient well-being. Recent advances in machine learning (ML) ho… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: To be presented at the 46th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC 2024)

    ACM Class: J.3; I.5.4

  8. arXiv:2404.02988  [pdf, other

    eess.SY cs.LG

    Risk-averse Learning with Non-Stationary Distributions

    Authors: Siyi Wang, Zifan Wang, Xinlei Yi, Michael M. Zavlanos, Karl H. Johansson, Sandra Hirche

    Abstract: Considering non-stationary environments in online optimization enables decision-maker to effectively adapt to changes and improve its performance over time. In such cases, it is favorable to adopt a strategy that minimizes the negative impact of change to avoid potentially risky situations. In this paper, we investigate risk-averse online optimization where the distribution of the random cost chan… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  9. arXiv:2403.11932  [pdf, ps, other

    cs.IT math.OC

    Consistency of Value of Information: Effects of Packet Loss and Time Delay in Networked Control Systems Tasks

    Authors: Touraj Soleymani, John S. Baras, Siyi Wang, Sandra Hirche, Karl H. Johansson

    Abstract: In this chapter, we study the consistency of the value of information$\unicode{x2014}$a semantic metric that claims to determine the right piece of information in networked control systems tasks$\unicode{x2014}$in a lossy and delayed communication regime. Our analysis begins with a focus on state estimation, and subsequently extends to feedback control. To that end, we make a causal tradeoff betwe… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  10. arXiv:2403.11927  [pdf, ps, other

    cs.IT math.OC

    Foundations of Value of Information: A Semantic Metric for Networked Control Systems Tasks

    Authors: Touraj Soleymani, John S. Baras, Sandra Hirche, Karl H. Johansson

    Abstract: In this chapter, we present our recent invention, i.e., the notion of the value of information$\unicode{x2014}$a semantic metric that is fundamental for networked control systems tasks. We begin our analysis by formulating a causal tradeoff between the packet rate and the regulation cost, with an encoder and a decoder as two distributed decision makers, and show that the valuation of information i… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  11. arXiv:2402.03174  [pdf, ps, other

    eess.SY cs.LG

    Decentralized Event-Triggered Online Learning for Safe Consensus of Multi-Agent Systems with Gaussian Process Regression

    Authors: Xiaobing Dai, Zewen Yang, Mengtian Xu, Fangzhou Liu, Georges Hattab, Sandra Hirche

    Abstract: Consensus control in multi-agent systems has received significant attention and practical implementation across various domains. However, managing consensus control under unknown dynamics remains a significant challenge for control design due to system uncertainties and environmental disturbances. This paper presents a novel learning-based distributed control law, augmented by an auxiliary dynamic… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  12. arXiv:2402.03048  [pdf, other

    cs.MA cs.LG eess.SY

    Cooperative Learning with Gaussian Processes for Euler-Lagrange Systems Tracking Control under Switching Topologies

    Authors: Zewen Yang, Songbo Dong, Armin Lederer, Xiaobing Dai, Siyu Chen, Stefan Sosnowski, Georges Hattab, Sandra Hirche

    Abstract: This work presents an innovative learning-based approach to tackle the tracking control problem of Euler-Lagrange multi-agent systems with partially unknown dynamics operating under switching communication topologies. The approach leverages a correlation-aware cooperative algorithm framework built upon Gaussian process regression, which adeptly captures inter-agent correlations for uncertainty pre… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 8 pages

  13. arXiv:2402.03014  [pdf, other

    cs.LG cs.AI

    Whom to Trust? Elective Learning for Distributed Gaussian Process Regression

    Authors: Zewen Yang, Xiaobing Dai, Akshat Dubey, Sandra Hirche, Georges Hattab

    Abstract: This paper introduces an innovative approach to enhance distributed cooperative learning using Gaussian process (GP) regression in multi-agent systems (MASs). The key contribution of this work is the development of an elective learning algorithm, namely prior-aware elective distributed GP (Pri-GP), which empowers agents with the capability to selectively request predictions from neighboring agents… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 9 pages, conference preprint

  14. arXiv:2311.02133  [pdf, other

    eess.SY cs.AI cs.RO

    Safe Online Dynamics Learning with Initially Unknown Models and Infeasible Safety Certificates

    Authors: Alexandre Capone, Ryan Cosner, Aaron Ames, Sandra Hirche

    Abstract: Safety-critical control tasks with high levels of uncertainty are becoming increasingly common. Typically, techniques that guarantee safety during learning and control utilize constraint-based safety certificates, which can be leveraged to compute safe control inputs. However, excessive model uncertainty can render robust safety certification methods or infeasible, meaning no control input satisfi… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  15. arXiv:2310.02942  [pdf, other

    eess.SY cs.LG stat.ML

    Online Constraint Tightening in Stochastic Model Predictive Control: A Regression Approach

    Authors: Alexandre Capone, Tim Brüdigam, Sandra Hirche

    Abstract: Solving chance-constrained stochastic optimal control problems is a significant challenge in control. This is because no analytical solutions exist for up to a handful of special cases. A common and computationally efficient approach for tackling chance-constrained stochastic optimal control problems consists of reformulating the chance constraints as hard constraints with a constraint-tightening… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Submitted to Transactions on Automatic Control

  16. arXiv:2307.04415  [pdf, other

    eess.SY cs.LG stat.ML

    Episodic Gaussian Process-Based Learning Control with Vanishing Tracking Errors

    Authors: Armin Lederer, Jonas Umlauft, Sandra Hirche

    Abstract: Due to the increasing complexity of technical systems, accurate first principle models can often not be obtained. Supervised machine learning can mitigate this issue by inferring models from measurement data. Gaussian process regression is particularly well suited for this purpose due to its high data-efficiency and its explicit uncertainty representation, which allows the derivation of prediction… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  17. arXiv:2305.16215  [pdf, other

    cs.LG eess.SY math.DS stat.ML

    Koopman Kernel Regression

    Authors: Petar Bevanda, Max Beier, Armin Lederer, Stefan Sosnowski, Eyke Hüllermeier, Sandra Hirche

    Abstract: Many machine learning approaches for decision making, such as reinforcement learning, rely on simulators or predictive models to forecast the time-evolution of quantities of interest, e.g., the state of an agent or the reward of a policy. Forecasts of such complex phenomena are commonly described by highly nonlinear dynamical systems, making their use in optimization-based decision-making challeng… ▽ More

    Submitted 16 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted to the thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  18. arXiv:2305.08169  [pdf, ps, other

    eess.SY cs.LG

    Can Learning Deteriorate Control? Analyzing Computational Delays in Gaussian Process-Based Event-Triggered Online Learning

    Authors: Xiaobing Dai, Armin Lederer, Zewen Yang, Sandra Hirche

    Abstract: When the dynamics of systems are unknown, supervised machine learning techniques are commonly employed to infer models from data. Gaussian process (GP) regression is a particularly popular learning method for this purpose due to the existence of prediction error bounds. Moreover, GP models can be efficiently updated online, such that event-triggered online learning strategies can be pursued to ens… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

  19. arXiv:2304.11265  [pdf, other

    cs.LG

    Time Series Classification for Detecting Parkinson's Disease from Wrist Motions

    Authors: Cedric Donié, Neha Das, Satoshi Endo, Sandra Hirche

    Abstract: Parkinson's disease (PD) is a neurodegenerative condition characterized by frequently changing motor symptoms, necessitating continuous symptom monitoring for more targeted treatment. Classical time series classification and deep learning techniques have demonstrated limited efficacy in monitoring PD symptoms using wearable accelerometer data due to complex PD movement patterns and the small size… ▽ More

    Submitted 20 May, 2024; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: The source code is available under https://github.com/cedricdonie/tsc-for-wrist-motion-pd-detection

    ACM Class: I.5; J.2; J.3

  20. arXiv:2304.05723  [pdf, ps, other

    eess.SY cs.RO

    Distributed Coverage Control of Constrained Constant-Speed Unicycle Multi-Agent Systems

    Authors: Qingchen Liu, Zengjie Zhang, Nhan Khanh Le, Jiahu Qin, Fangzhou Liu, Sandra Hirche

    Abstract: This paper proposes a novel distributed coverage controller for a multi-agent system with constant-speed unicycle robots (CSUR). The work is motivated by the limitation of the conventional method that does not ensure the satisfaction of hard state- and input-dependent constraints and leads to feasibility issues for multi-CSUR systems. In this paper, we solve these problems by designing a novel cov… ▽ More

    Submitted 14 March, 2024; v1 submitted 12 April, 2023; originally announced April 2023.

  21. arXiv:2303.17963  [pdf, other

    eess.SY cs.LG math.OC stat.ML

    Learning-Based Optimal Control with Performance Guarantees for Unknown Systems with Latent States

    Authors: Robert Lefringhausen, Supitsana Srithasan, Armin Lederer, Sandra Hirche

    Abstract: As control engineering methods are applied to increasingly complex systems, data-driven approaches for system identification appear as a promising alternative to physics-based modeling. While the Bayesian approaches prevalent for safety-critical applications usually rely on the availability of state measurements, the states of a complex system are often not directly measurable. It may then be nece… ▽ More

    Submitted 6 August, 2024; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: Accepted version submitted to the 2024 European Control Conference (ECC)

    Journal ref: 2024 European Control Conference (ECC), pp. 90-97

  22. arXiv:2302.11961  [pdf, other

    cs.LG stat.ML

    Sharp Calibrated Gaussian Processes

    Authors: Alexandre Capone, Geoff Pleiss, Sandra Hirche

    Abstract: While Gaussian processes are a mainstay for various engineering and scientific applications, the uncertainty estimates don't satisfy frequentist guarantees and can be miscalibrated in practice. State-of-the-art approaches for designing calibrated models rely on inflating the Gaussian process posterior variance, which yields confidence intervals that are potentially too coarse. To remedy this, we p… ▽ More

    Submitted 16 November, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

  23. Variational Integrators and Graph-Based Solvers for Multibody Dynamics in Maximal Coordinates

    Authors: Jan Brüdigam, Stefan Sosnowski, Zachary Manchester, Sandra Hirche

    Abstract: Multibody dynamics simulators are an important tool in many fields, including learning and control for robotics. However, many existing dynamics simulators suffer from inaccuracies when dealing with constrained mechanical systems due to unsuitable integrators with bad energy behavior and problematic constraint violations, for example for contact interactions. Variational integrators are numerical… ▽ More

    Submitted 5 November, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

  24. arXiv:2212.00478  [pdf, ps, other

    eess.SY cs.RO

    Safe Learning-Based Control of Elastic Joint Robots via Control Barrier Functions

    Authors: Armin Lederer, Azra Begzadić, Neha Das, Sandra Hirche

    Abstract: Ensuring safety is of paramount importance in physical human-robot interaction applications. This requires both adherence to safety constraints defined on the system state, as well as guaranteeing compliant behavior of the robot. If the underlying dynamical system is known exactly, the former can be addressed with the help of control barrier functions. The incorporation of elastic actuators in the… ▽ More

    Submitted 14 April, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

  25. Vision-Based Uncertainty-Aware Motion Planning based on Probabilistic Semantic Segmentation

    Authors: Ralf Römer, Armin Lederer, Samuel Tesfazgi, Sandra Hirche

    Abstract: For safe operation, a robot must be able to avoid collisions in uncertain environments. Existing approaches for motion planning under uncertainties often assume parametric obstacle representations and Gaussian uncertainty, which can be inaccurate. While visual perception can deliver a more accurate representation of the environment, its use for safe motion planning is limited by the inherent misca… ▽ More

    Submitted 1 December, 2023; v1 submitted 14 September, 2022; originally announced September 2022.

    Journal ref: IEEE Robotics and Automation Letters, vol. 8, no. 11, pp. 7825-7832, 2023

  26. arXiv:2207.01337  [pdf, other

    cs.LG cs.AI eess.SY

    Safe Reinforcement Learning via Confidence-Based Filters

    Authors: Sebastian Curi, Armin Lederer, Sandra Hirche, Andreas Krause

    Abstract: Ensuring safety is a crucial challenge when deploying reinforcement learning (RL) to real-world systems. We develop confidence-based safety filters, a control-theoretic approach for certifying state safety constraints for nominal policies learned via standard RL techniques, based on probabilistic dynamics models. Our approach is based on a reformulation of state constraints in terms of cost functi… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  27. arXiv:2206.13966  [pdf, other

    cs.RO

    Dext-Gen: Dexterous Grasping in Sparse Reward Environments with Full Orientation Control

    Authors: Martin Schuck, Jan Brüdigam, Alexandre Capone, Stefan Sosnowski, Sandra Hirche

    Abstract: Reinforcement learning is a promising method for robotic grasping as it can learn effective reaching and grasping policies in difficult scenarios. However, achieving human-like manipulation capabilities with sophisticated robotic hands is challenging because of the problem's high dimensionality. Although remedies such as reward shaping or expert demonstrations can be employed to overcome this issu… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

  28. Physically Consistent Learning of Conservative Lagrangian Systems with Gaussian Processes

    Authors: Giulio Evangelisti, Sandra Hirche

    Abstract: This paper proposes a physically consistent Gaussian Process (GP) enabling the identification of uncertain Lagrangian systems. The function space is tailored according to the energy components of the Lagrangian and the differential equation structure, analytically guaranteeing physical and mathematical properties such as energy conservation and quadratic form. The novel formulation of Cholesky dec… ▽ More

    Submitted 3 February, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: Accepted version of paper published by IEEE in 2022 IEEE 61st Conference on Decision and Control (CDC). Final published paper can be found at https://doi.org/10.1109/CDC51059.2022.9993123

  29. arXiv:2202.11491  [pdf, other

    eess.SY cs.LG

    Networked Online Learning for Control of Safety-Critical Resource-Constrained Systems based on Gaussian Processes

    Authors: Armin Lederer, Mingmin Zhang, Samuel Tesfazgi, Sandra Hirche

    Abstract: Safety-critical technical systems operating in unknown environments require the ability to quickly adapt their behavior, which can be achieved in control by inferring a model online from the data stream generated during operation. Gaussian process-based learning is particularly well suited for safety-critical applications as it ensures bounded prediction errors. While there exist computationally e… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  30. arXiv:2201.11640  [pdf, ps, other

    eess.SY cs.LG math.DS math.OC

    Towards Data-driven LQR with Koopmanizing Flows

    Authors: Petar Bevanda, Max Beier, Shahab Heshmati-Alamdari, Stefan Sosnowski, Sandra Hirche

    Abstract: We propose a novel framework for learning linear time-invariant (LTI) models for a class of continuous-time non-autonomous nonlinear dynamics based on a representation of Koopman operators. In general, the operator is infinite-dimensional but, crucially, linear. To utilize it for efficient LTI control design, we learn a finite representation of the Koopman operator that is linear in controls while… ▽ More

    Submitted 23 May, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: Final version, accepted for presentation at the 6th IFAC Conference on Intelligent Control and Automation Sciences (ICONS), 2022. arXiv admin note: text overlap with arXiv:2112.04085

  31. arXiv:2112.05451  [pdf, other

    cs.LG

    Structure-Preserving Learning Using Gaussian Processes and Variational Integrators

    Authors: Jan Brüdigam, Martin Schuck, Alexandre Capone, Stefan Sosnowski, Sandra Hirche

    Abstract: Gaussian process regression is increasingly applied for learning unknown dynamical systems. In particular, the implicit quantification of the uncertainty of the learned model makes it a promising approach for safety-critical applications. When using Gaussian process regression to learn unknown systems, a commonly considered approach consists of learning the residual dynamics after applying some ge… ▽ More

    Submitted 17 April, 2022; v1 submitted 10 December, 2021; originally announced December 2021.

    Journal ref: Learning for Dynamics and Control Conference 2022

  32. arXiv:2112.04085  [pdf, other

    cs.LG eess.SY

    Diffeomorphically Learning Stable Koopman Operators

    Authors: Petar Bevanda, Max Beier, Sebastian Kerz, Armin Lederer, Stefan Sosnowski, Sandra Hirche

    Abstract: System representations inspired by the infinite-dimensional Koopman operator (generator) are increasingly considered for predictive modeling. Due to the operator's linearity, a range of nonlinear systems admit linear predictor representations - allowing for simplified prediction, analysis and control. However, finding meaningful finite-dimensional representations for prediction is difficult as it… ▽ More

    Submitted 30 May, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: Revised version submitted to IEEE Control Systems Letters (L-CSS) with substantially revised exposition, evaluation and proof of Lemma 2 (previously Lemma 8)

  33. arXiv:2111.03617  [pdf, ps, other

    eess.SP cs.LG eess.SY

    Adaptive Low-Pass Filtering using Sliding Window Gaussian Processes

    Authors: Alejandro J. Ordóñez-Conejo, Armin Lederer, Sandra Hirche

    Abstract: When signals are measured through physical sensors, they are perturbed by noise. To reduce noise, low-pass filters are commonly employed in order to attenuate high frequency components in the incoming signal, regardless if they come from noise or the actual signal. Therefore, low-pass filters must be carefully tuned in order to avoid significant deterioration of the signal. This tuning requires pr… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

  34. arXiv:2110.07786  [pdf, other

    cs.LG eess.SY math.DS

    Learning the Koopman Eigendecomposition: A Diffeomorphic Approach

    Authors: Petar Bevanda, Johannes Kirmayr, Stefan Sosnowski, Sandra Hirche

    Abstract: We present a novel data-driven approach for learning linear representations of a class of stable nonlinear systems using Koopman eigenfunctions. By learning the conjugacy map between a nonlinear system and its Jacobian linearization through a Normalizing Flow one can guarantee the learned function is a diffeomorphism. Using this diffeomorphism, we construct eigenfunctions of the nonlinear system v… ▽ More

    Submitted 30 May, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted for presentation at the 2022 American Control Conference (ACC)

  35. arXiv:2110.00481  [pdf, other

    cs.LG cs.RO

    Personalized Rehabilitation Robotics based on Online Learning Control

    Authors: Samuel Tesfazgi, Armin Lederer, Johannes F. Kunz, Alejandro J. Ordóñez-Conejo, Sandra Hirche

    Abstract: The use of rehabilitation robotics in clinical applications gains increasing importance, due to therapeutic benefits and the ability to alleviate labor-intensive works. However, their practical utility is dependent on the deployment of appropriate control algorithms, which adapt the level of task-assistance according to each individual patient's need. Generally, the required personalization is ach… ▽ More

    Submitted 15 September, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

  36. arXiv:2109.07262  [pdf, other

    cs.RO

    Linear-Time Contact and Friction Dynamics in Maximal Coordinates using Variational Integrators

    Authors: Jan Brüdigam, Jana Janeva, Stefan Sosnowski, Sandra Hirche

    Abstract: Simulation of contact and friction dynamics is an important basis for control- and learning-based algorithms. However, the numerical difficulties of contact interactions pose a challenge for robust and efficient simulators. A maximal-coordinate representation of the dynamics enables efficient solving algorithms, but current methods in maximal coordinates require constraint stabilization schemes. T… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  37. arXiv:2109.02606  [pdf, other

    cs.LG cs.RO eess.SY

    Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications

    Authors: Alexandre Capone, Armin Lederer, Sandra Hirche

    Abstract: Gaussian processes have become a promising tool for various safety-critical settings, since the posterior variance can be used to directly estimate the model error and quantify risk. However, state-of-the-art techniques for safety-critical settings hinge on the assumption that the kernel hyperparameters are known, which does not apply in general. To mitigate this, we introduce robust Gaussian proc… ▽ More

    Submitted 20 July, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

  38. arXiv:2107.14580  [pdf, other

    cs.RO cs.MA

    Distributed Event- and Self-Triggered Coverage Control with Speed Constrained Unicycle Robots

    Authors: Yuni Zhou, Lingxuan Kong, Stefan Sosnowski, Qingchen Liu, Sandra Hirche

    Abstract: Voronoi coverage control is a particular problem of importance in the area of multi-robot systems, which considers a network of multiple autonomous robots, tasked with optimally covering a large area. This is a common task for fleets of fixed-wing Unmanned Aerial Vehicles (UAVs), which are described in this work by a unicycle model with constant forward-speed constraints. We develop event-based co… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

  39. arXiv:2106.10662  [pdf, other

    cs.LG cs.CR

    FedXGBoost: Privacy-Preserving XGBoost for Federated Learning

    Authors: Nhan Khanh Le, Yang Liu, Quang Minh Nguyen, Qingchen Liu, Fangzhou Liu, Quanwei Cai, Sandra Hirche

    Abstract: Federated learning is the distributed machine learning framework that enables collaborative training across multiple parties while ensuring data privacy. Practical adaptation of XGBoost, the state-of-the-art tree boosting framework, to federated learning remains limited due to high cost incurred by conventional privacy-preserving methods. To address the problem, we propose two variants of federate… ▽ More

    Submitted 12 August, 2021; v1 submitted 20 June, 2021; originally announced June 2021.

  40. arXiv:2105.12236  [pdf, other

    cs.RO

    Gaussian Process-based Stochastic Model Predictive Control for Overtaking in Autonomous Racing

    Authors: Tim Brüdigam, Alexandre Capone, Sandra Hirche, Dirk Wollherr, Marion Leibold

    Abstract: A fundamental aspect of racing is overtaking other race cars. Whereas previous research on autonomous racing has majorly focused on lap-time optimization, here, we propose a method to plan overtaking maneuvers in autonomous racing. A Gaussian process is used to learn the behavior of the leading vehicle. Based on the outputs of the Gaussian process, a stochastic Model Predictive Control algorithm p… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: This work has been accepted to the ICRA 2021 workshop 'Opportunities and Challenges with Autonomous Racing'

  41. arXiv:2104.04483  [pdf, other

    cs.LG eess.SY

    Inverse Reinforcement Learning: A Control Lyapunov Approach

    Authors: Samuel Tesfazgi, Armin Lederer, Sandra Hirche

    Abstract: Inferring the intent of an intelligent agent from demonstrations and subsequently predicting its behavior, is a critical task in many collaborative settings. A common approach to solve this problem is the framework of inverse reinforcement learning (IRL), where the observed agent, e.g., a human demonstrator, is assumed to behave according to an intrinsic cost function that reflects its intent and… ▽ More

    Submitted 4 October, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: This work has been accepted for presentation at, and publication in the proceedings of, the 2021 IEEE Conference on Decision and Control (CDC)

  42. Distributed Bayesian Online Learning for Cooperative Manipulation

    Authors: Pablo Budde gen. Dohmann, Armin Lederer, Marcel Dißemond, Sandra Hirche

    Abstract: For tasks where the dynamics of multiple agents are physically coupled, e.g., in cooperative manipulation, the coordination between the individual agents becomes crucial, which requires exact knowledge of the interaction dynamics. This problem is typically addressed using centralized estimators, which can negatively impact the flexibility and robustness of the overall system. To overcome this shor… ▽ More

    Submitted 28 June, 2022; v1 submitted 9 April, 2021; originally announced April 2021.

  43. arXiv:2101.05328  [pdf, other

    cs.LG eess.SY stat.ML

    Uniform Error and Posterior Variance Bounds for Gaussian Process Regression with Application to Safe Control

    Authors: Armin Lederer, Jonas Umlauft, Sandra Hirche

    Abstract: In application areas where data generation is expensive, Gaussian processes are a preferred supervised learning model due to their high data-efficiency. Particularly in model-based control, Gaussian processes allow the derivation of performance guarantees using probabilistic model error bounds. To make these approaches applicable in practice, two open challenges must be solved i) Existing error bo… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  44. arXiv:2011.10596  [pdf, ps, other

    eess.SY cs.LG

    The Impact of Data on the Stability of Learning-Based Control- Extended Version

    Authors: Armin Lederer, Alexandre Capone, Thomas Beckers, Jonas Umlauft, Sandra Hirche

    Abstract: Despite the existence of formal guarantees for learning-based control approaches, the relationship between data and control performance is still poorly understood. In this paper, we propose a Lyapunov-based measure for quantifying the impact of data on the certifiable control performance. By modeling unknown system dynamics through Gaussian processes, we can determine the interrelation between mod… ▽ More

    Submitted 30 July, 2021; v1 submitted 20 November, 2020; originally announced November 2020.

  45. arXiv:2011.08683  [pdf, ps, other

    cs.IT

    Fisher Information of a Family of Generalized Normal Distributions

    Authors: Precious Ugo Abara, Sandra Hirche

    Abstract: In this brief note we compute the Fisher information of a family of generalized normal distributions. Fisher information is usually defined for regular distributions, i.e. continuously differentiable (log) density functions whose support does not depend on the family parameter $θ$. Although the uniform distribution in $[-θ, + θ]$ does not satisfy the regularity requirements, as a special case of o… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: 3 pages, 1 figure

  46. arXiv:2010.02613  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Deep Learning based Uncertainty Decomposition for Real-time Control

    Authors: Neha Das, Jonas Umlauft, Armin Lederer, Thomas Beckers, Sandra Hirche

    Abstract: Data-driven control in unknown environments requires a clear understanding of the involved uncertainties for ensuring safety and efficient exploration. While aleatoric uncertainty that arises from measurement noise can often be explicitly modeled given a parametric description, it can be harder to model epistemic uncertainty, which describes the presence or absence of training data. The latter can… ▽ More

    Submitted 12 July, 2023; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: Accepted at IFAC World Congress 2023

  47. arXiv:2009.06689  [pdf, other

    eess.SY cs.LG

    Online learning-based trajectory tracking for underactuated vehicles with uncertain dynamics

    Authors: Thomas Beckers, Leonardo Colombo, Sandra Hirche, George J. Pappas

    Abstract: Underactuated vehicles have gained much attention in the recent years due to the increasing amount of aerial and underwater vehicles as well as nanosatellites. Trajectory tracking control of these vehicles is a substantial aspect for an increasing range of application domains. However, external disturbances and parts of the internal dynamics are often unknown or very time-consuming to model. To ov… ▽ More

    Submitted 14 September, 2021; v1 submitted 14 September, 2020; originally announced September 2020.

  48. arXiv:2007.12377  [pdf, ps, other

    cs.LG eess.SY stat.ML

    Anticipating the Long-Term Effect of Online Learning in Control

    Authors: Alexandre Capone, Sandra Hirche

    Abstract: Control schemes that learn using measurement data collected online are increasingly promising for the control of complex and uncertain systems. However, in most approaches of this kind, learning is viewed as a side effect that passively improves control performance, e.g., by updating a model of the system dynamics. Determining how improvements in control performance due to learning can be actively… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

  49. arXiv:2006.14551  [pdf, other

    eess.SY cs.LG

    Prediction with Approximated Gaussian Process Dynamical Models

    Authors: Thomas Beckers, Sandra Hirche

    Abstract: The modeling and simulation of dynamical systems is a necessary step for many control approaches. Using classical, parameter-based techniques for modeling of modern systems, e.g., soft robotics or human-robot interaction, is often challenging or even infeasible due to the complexity of the system dynamics. In contrast, data-driven approaches need only a minimum of prior knowledge and scale with th… ▽ More

    Submitted 30 November, 2021; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: This article has been accepted for publication by IEEE

  50. arXiv:2006.09446  [pdf, ps, other

    cs.LG cs.RO stat.ML

    Real-Time Regression with Dividing Local Gaussian Processes

    Authors: Armin Lederer, Alejandro Jose Ordonez Conejo, Korbinian Maier, Wenxin Xiao, Jonas Umlauft, Sandra Hirche

    Abstract: The increased demand for online prediction and the growing availability of large data sets drives the need for computationally efficient models. While exact Gaussian process regression shows various favorable theoretical properties (uncertainty estimate, unlimited expressive power), the poor scaling with respect to the training set size prohibits its application in big data regimes in real-time. T… ▽ More

    Submitted 30 July, 2021; v1 submitted 16 June, 2020; originally announced June 2020.