Zum Hauptinhalt springen

Showing 1–50 of 75 results for author: Theodorou, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.16254  [pdf, other

    cs.RO eess.SY

    Negotiating Control: Neurosymbolic Variable Autonomy

    Authors: Georgios Bakirtzis, Manolis Chiou, Andreas Theodorou

    Abstract: Variable autonomy equips a system, such as a robot, with mixed initiatives such that it can adjust its independence level based on the task's complexity and the surrounding environment. Variable autonomy solves two main problems in robotic planning: the first is the problem of humans being unable to keep focus in monitoring and intervening during robotic tasks without appropriate human factor indi… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  2. arXiv:2405.16381  [pdf, other

    cs.LG cs.AI stat.ML

    Trivialized Momentum Facilitates Diffusion Generative Modeling on Lie Groups

    Authors: Yuchen Zhu, Tianrong Chen, Lingkai Kong, Evangelos A. Theodorou, Molei Tao

    Abstract: The generative modeling of data on manifold is an important task, for which diffusion models in flat spaces typically need nontrivial adaptations. This article demonstrates how a technique called `trivialization' can transfer the effectiveness of diffusion models in Euclidean spaces to Lie groups. In particular, an auxiliary momentum variable was algorithmically introduced to help transport the po… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  3. arXiv:2404.13430  [pdf, other

    physics.chem-ph cs.LG

    React-OT: Optimal Transport for Generating Transition State in Chemical Reactions

    Authors: Chenru Duan, Guan-Horng Liu, Yuanqi Du, Tianrong Chen, Qiyuan Zhao, Haojun Jia, Carla P. Gomes, Evangelos A. Theodorou, Heather J. Kulik

    Abstract: Transition states (TSs) are transient structures that are key in understanding reaction mechanisms and designing catalysts but challenging to be captured in experiments. Alternatively, many optimization algorithms have been developed to search for TSs computationally. Yet the cost of these algorithms driven by quantum chemistry methods (usually density functional theory) is still high, posing chal… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 5 figures, 1 table

  4. arXiv:2404.06336  [pdf, other

    quant-ph cs.LG stat.ML

    Quantum State Generation with Structure-Preserving Diffusion Model

    Authors: Yuchen Zhu, Tianrong Chen, Evangelos A. Theodorou, Xie Chen, Molei Tao

    Abstract: This article considers the generative modeling of the (mixed) states of quantum systems, and an approach based on denoising diffusion model is proposed. The key contribution is an algorithmic innovation that respects the physical nature of quantum states. More precisely, the commonly used density matrix representation of mixed-state has to be complex-valued Hermitian, positive semi-definite, and t… ▽ More

    Submitted 25 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  5. Low Frequency Sampling in Model Predictive Path Integral Control

    Authors: Bogdan Vlahov, Jason Gibson, David D. Fan, Patrick Spieler, Ali-akbar Agha-mohammadi, Evangelos A. Theodorou

    Abstract: Sampling-based model-predictive controllers have become a powerful optimization tool for planning and control problems in various challenging environments. In this paper, we show how the default choice of uncorrelated Gaussian distributions can be improved upon with the use of a colored noise distribution. Our choice of distribution allows for the emphasis on low frequency control signals, which c… ▽ More

    Submitted 18 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Published to RA-L

    Journal ref: IEEE Robotics and Automation Letters, vol. 9, no. 5, pp.4543-4550, 2024

  6. arXiv:2403.18130  [pdf, other

    math.OC cs.IT

    Generalized Maximum Entropy Differential Dynamic Programming

    Authors: Yuichiro Aoyama, Evangelos A. Theodorou

    Abstract: We present a sampling-based trajectory optimization method derived from the maximum entropy formulation of Differential Dynamic Programming with Tsallis entropy. This method can be seen as a generalization of the legacy work with Shannon entropy, which leads to a Gaussian optimal control policy for exploration during optimization. With the Tsallis entropy, the optimal control policy takes the form… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures, This paper is for CDC 2024

    MSC Class: 34H05

  7. arXiv:2402.16227  [pdf, other

    cs.RO eess.SY math.OC

    Scaling Robust Optimization for Multi-Agent Robotic Systems: A Distributed Perspective

    Authors: Arshiya Taj Abdul, Augustinos D. Saravanos, Evangelos A. Theodorou

    Abstract: This paper presents a novel distributed robust optimization scheme for steering distributions of multi-agent systems under stochastic and deterministic uncertainty. Robust optimization is a subfield of optimization which aims in discovering an optimal solution that remains robustly feasible for all possible realizations of the problem parameters within a given uncertainty set. Such approaches woul… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  8. arXiv:2312.07635  [pdf, other

    cs.AI

    Clash of the Explainers: Argumentation for Context-Appropriate Explanations

    Authors: Leila Methnani, Virginia Dignum, Andreas Theodorou

    Abstract: Understanding when and why to apply any given eXplainable Artificial Intelligence (XAI) technique is not a straightforward task. There is no single approach that is best suited for a given context. This paper aims to address the challenge of selecting the most appropriate explainer given the context in which an explanation is required. For AI explainability to be effective, explanations and how th… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 17 pages, 3 figures, Accepted at XAI^3 Workshop at ECAI 2023

  9. arXiv:2311.06978  [pdf, other

    cs.LG cs.CV stat.ML

    Augmented Bridge Matching

    Authors: Valentin De Bortoli, Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou, Weilie Nie

    Abstract: Flow and bridge matching are a novel class of processes which encompass diffusion models. One of the main aspect of their increased flexibility is that these models can interpolate between arbitrary data distributions i.e. they generalize beyond generative modeling and can be applied to learning stochastic (and deterministic) processes of arbitrary transfer tasks between two given distributions. I… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  10. arXiv:2310.07805  [pdf, other

    cs.LG cs.AI

    Generative Modeling with Phase Stochastic Bridges

    Authors: Tianrong Chen, Jiatao Gu, Laurent Dinh, Evangelos A. Theodorou, Joshua Susskind, Shuangfei Zhai

    Abstract: Diffusion models (DMs) represent state-of-the-art generative models for continuous inputs. DMs work by constructing a Stochastic Differential Equation (SDE) in the input space (ie, position space), and using a neural network to reverse it. In this work, we introduce a novel generative modeling framework grounded in \textbf{phase space dynamics}, where a phase space is defined as {an augmented spac… ▽ More

    Submitted 12 May, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  11. arXiv:2310.02233  [pdf, other

    stat.ML cs.LG math.OC

    Generalized Schrödinger Bridge Matching

    Authors: Guan-Horng Liu, Yaron Lipman, Maximilian Nickel, Brian Karrer, Evangelos A. Theodorou, Ricky T. Q. Chen

    Abstract: Modern distribution matching algorithms for training diffusion or flow models directly prescribe the time evolution of the marginal distributions between two boundary distributions. In this work, we consider a generalized distribution matching setup, where these marginals are only implicitly described as a solution to some task-specific objective function. The problem setup, known as the Generaliz… ▽ More

    Submitted 18 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Camera Ready

  12. arXiv:2310.01236  [pdf, other

    stat.ML cs.CV cs.LG

    Mirror Diffusion Models for Constrained and Watermarked Generation

    Authors: Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou, Molei Tao

    Abstract: Modern successes of diffusion models in learning complex, high-dimensional data distributions are attributed, in part, to their capability to construct diffusion processes with analytic transition kernels and score functions. The tractability results in a simulation-free framework with stable regression losses, from which reversed, generative processes can be learned at scale. However, when data i… ▽ More

    Submitted 29 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: submitted to NeurIPS on 5/18 but did not arxiv per NeurIPS policy, accepted on 9/22

  13. arXiv:2309.12756  [pdf, other

    cs.SE cs.AI

    Towards an MLOps Architecture for XAI in Industrial Applications

    Authors: Leonhard Faubel, Thomas Woudsma, Leila Methnani, Amir Ghorbani Ghezeljhemeidan, Fabian Buelow, Klaus Schmid, Willem D. van Driel, Benjamin Kloepper, Andreas Theodorou, Mohsen Nosratinia, Magnus Bång

    Abstract: Machine learning (ML) has become a popular tool in the industrial sector as it helps to improve operations, increase efficiency, and reduce costs. However, deploying and managing ML models in production environments can be complex. This is where Machine Learning Operations (MLOps) comes in. MLOps aims to streamline this deployment and management process. One of the remaining MLOps challenges is th… ▽ More

    Submitted 20 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

  14. arXiv:2308.08426  [pdf, other

    math.OC cs.RO

    Differentiable Robust Model Predictive Control

    Authors: Alex Oshin, Hassan Almubarak, Evangelos A. Theodorou

    Abstract: Deterministic model predictive control (MPC), while powerful, is often insufficient for effectively controlling autonomous systems in the real-world. Factors such as environmental noise and model error can cause deviations from the expected nominal performance. Robust MPC algorithms aim to bridge this gap between deterministic and uncertain control. However, these methods are often excessively dif… ▽ More

    Submitted 26 July, 2024; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: Accepted to Robotics: Science and Systems 2024

  15. arXiv:2305.18718  [pdf, other

    cs.RO cs.MA eess.SY

    Distributed Hierarchical Distribution Control for Very-Large-Scale Clustered Multi-Agent Systems

    Authors: Augustinos D. Saravanos, Yihui Li, Evangelos A. Theodorou

    Abstract: As the scale and complexity of multi-agent robotic systems are subject to a continuous increase, this paper considers a class of systems labeled as Very-Large-Scale Multi-Agent Systems (VLMAS) with dimensionality that can scale up to the order of millions of agents. In particular, we consider the problem of steering the state distributions of all agents of a VLMAS to prescribed target distribution… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted at Robotics: Science and Systems 2023

  16. arXiv:2305.02241  [pdf, other

    cs.RO eess.SY

    A Multi-step Dynamics Modeling Framework For Autonomous Driving In Multiple Environments

    Authors: Jason Gibson, Bogdan Vlahov, David Fan, Patrick Spieler, Daniel Pastor, Ali-akbar Agha-mohammadi, Evangelos A. Theodorou

    Abstract: Modeling dynamics is often the first step to making a vehicle autonomous. While on-road autonomous vehicles have been extensively studied, off-road vehicles pose many challenging modeling problems. An off-road vehicle encounters highly complex and difficult-to-model terrain/vehicle interactions, as well as having complex vehicle dynamics of its own. These complexities can create challenges for eff… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  17. arXiv:2303.06776  [pdf, other

    cs.RO cs.HC

    Robot Health Indicator: A Visual Cue to Improve Level of Autonomy Switching Systems

    Authors: Aniketh Ramesh, Madeleine Englund, Andreas Theodorou, Rustam Stolkin, Manolis Chiou

    Abstract: Using different Levels of Autonomy (LoA), a human operator can vary the extent of control they have over a robot's actions. LoAs enable operators to mitigate a robot's performance degradation or limitations in the its autonomous capabilities. However, LoA regulation and other tasks may often overload an operator's cognitive abilities. Inspired by video game user interfaces, we study if adding a 'R… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

    Comments: Accepted for Variable Autonomy for human-robot Teaming (VAT) workshop at ACM/IEEE HRI 2023

    ACM Class: I.2.9

  18. arXiv:2303.03360  [pdf, other

    cs.RO eess.SY

    Improved Exploration for Safety-Embedded Differential Dynamic Programming Using Tolerant Barrier States

    Authors: Joshua E. Kuperman, Hassan Almubarak, Augustinos D. Saravanos, Evangelos A. Theodorou

    Abstract: In this paper, we introduce Tolerant Discrete Barrier States (T-DBaS), a novel safety-embedding technique for trajectory optimization with enhanced exploratory capabilities. The proposed approach generalizes the standard discrete barrier state (DBaS) method by accommodating temporary constraint violation during the optimization process while still approximating its safety guarantees. Consequently,… ▽ More

    Submitted 11 March, 2024; v1 submitted 6 March, 2023; originally announced March 2023.

  19. arXiv:2303.01751  [pdf, other

    stat.ML cs.LG

    Deep Momentum Multi-Marginal Schrödinger Bridge

    Authors: Tianrong Chen, Guan-Horng Liu, Molei Tao, Evangelos A. Theodorou

    Abstract: It is a crucial challenge to reconstruct population dynamics using unlabeled samples from distributions at coarse time intervals. Recent approaches such as flow-based models or Schrödinger Bridge (SB) models have demonstrated appealing performance, yet the inferred sample trajectories either fail to account for the underlying stochasticity or are $\underline{D}$eep $\underline{M}$omentum Multi-Mar… ▽ More

    Submitted 5 October, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

  20. arXiv:2302.05872  [pdf, other

    cs.CV cs.LG stat.ML

    I$^2$SB: Image-to-Image Schrödinger Bridge

    Authors: Guan-Horng Liu, Arash Vahdat, De-An Huang, Evangelos A. Theodorou, Weili Nie, Anima Anandkumar

    Abstract: We propose Image-to-Image Schrödinger Bridge (I$^2$SB), a new class of conditional diffusion models that directly learn the nonlinear diffusion processes between two given distributions. These diffusion bridges are particularly useful for image restoration, as the degraded images are structurally informative priors for reconstructing the clean images. I$^2$SB belongs to a tractable class of Schröd… ▽ More

    Submitted 25 May, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: ICML camera ready (high-resolution figures)

  21. arXiv:2212.00398  [pdf, other

    cs.RO eess.SY

    Distributed Model Predictive Covariance Steering

    Authors: Augustinos D. Saravanos, Isin M. Balci, Efstathios Bakolas, Evangelos A. Theodorou

    Abstract: This paper proposes Distributed Model Predictive Covariance Steering (DMPCS), a novel method for safe multi-robot control under uncertainty. The scope of our approach is to blend covariance steering theory, distributed optimization and model predictive control (MPC) into a single methodology that is safe, scalable and decentralized. Initially, we pose a problem formulation that uses the Wasserstei… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  22. arXiv:2212.00268  [pdf, other

    eess.SY cs.RO

    Gaussian Process Barrier States for Safe Trajectory Optimization and Control

    Authors: Hassan Almubarak, Manan Gandhi, Yuichiro Aoyama, Nader Sadegh, Evangelos A. Theodorou

    Abstract: This paper proposes embedded Gaussian Process Barrier States (GP-BaS), a methodology to safely control unmodeled dynamics of nonlinear system using Bayesian learning. Gaussian Processes (GPs) are used to model the dynamics of the safety-critical system, which is subsequently used in the GP-BaS model. We derive the barrier state dynamics utilizing the GP posterior, which is used to construct a safe… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  23. arXiv:2210.10814  [pdf, other

    cs.GT cs.RO math.OC

    MPOGames: Efficient Multimodal Partially Observable Dynamic Games

    Authors: Oswin So, Paul Drews, Thomas Balch, Velin Dimitrov, Guy Rosman, Evangelos A. Theodorou

    Abstract: Game theoretic methods have become popular for planning and prediction in situations involving rich multi-agent interactions. However, these methods often assume the existence of a single local Nash equilibria and are hence unable to handle uncertainty in the intentions of different agents. While maximum entropy (MaxEnt) dynamic games try to address this issue, practical approaches solve for MaxEn… ▽ More

    Submitted 23 May, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted to ICRA 2023

  24. arXiv:2210.09010  [pdf, other

    cs.CY cs.AI

    Good AI for Good: How AI Strategies of the Nordic Countries Address the Sustainable Development Goals

    Authors: Andreas Theodorou, Juan Carlos Nieves, Virginia Dignum

    Abstract: Developed and used responsibly Artificial Intelligence (AI) is a force for global sustainable development. Given this opportunity, we expect that the many of the existing guidelines and recommendations for trustworthy or responsible AI will provide explicit guidance on how AI can contribute to the achievement of United Nations' Sustainable Development Goals (SDGs). This would in particular be the… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: IJCAI-AIofAI 2022 : 2nd Workshop on Adverse Impacts and Collateral Effects of AI Technologies

  25. arXiv:2210.00090  [pdf, other

    cs.LG

    Data-driven discovery of non-Newtonian astronomy via learning non-Euclidean Hamiltonian

    Authors: Oswin So, Gongjie Li, Evangelos A. Theodorou, Molei Tao

    Abstract: Incorporating the Hamiltonian structure of physical dynamics into deep learning models provides a powerful way to improve the interpretability and prediction accuracy. While previous works are mostly limited to the Euclidean spaces, their extension to the Lie group manifold is needed when rotations form a key component of the dynamics, such as the higher-order physics beyond simple point-mass dyna… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

  26. arXiv:2209.09893  [pdf, other

    stat.ML cs.GT cs.LG math.OC

    Deep Generalized Schrödinger Bridge

    Authors: Guan-Horng Liu, Tianrong Chen, Oswin So, Evangelos A. Theodorou

    Abstract: Mean-Field Game (MFG) serves as a crucial mathematical framework in modeling the collective behavior of individual agents interacting stochastically with a large population. In this work, we aim at solving a challenging class of MFGs in which the differentiability of these interacting preferences may not be available to the solver, and the population is urged to converge exactly to some desired di… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: NeurIPS 2022

  27. arXiv:2208.04697  [pdf, other

    cs.CY cs.AI

    Let it RAIN for Social Good

    Authors: Mattias Brännström, Andreas Theodorou, Virginia Dignum

    Abstract: Artificial Intelligence (AI) as a highly transformative technology take on a special role as both an enabler and a threat to UN Sustainable Development Goals (SDGs). AI Ethics and emerging high-level policy efforts stand at the pivot point between these outcomes but is barred from effect due the abstraction gap between high-level values and responsible action. In this paper the Responsible Norms (… ▽ More

    Submitted 26 July, 2022; originally announced August 2022.

  28. arXiv:2204.10740  [pdf, other

    cs.MA cs.AI

    Embracing AWKWARD! Real-time Adjustment of Reactive Plans Using Social Norms

    Authors: Leila Methnani, Andreas Antoniades, Andreas Theodorou

    Abstract: This paper presents the AWKWARD architecture for the development of hybrid agents in Multi-Agent Systems. AWKWARD agents can have their plans re-configured in real time to align with social role requirements under changing environmental and social circumstances. The proposed hybrid architecture makes use of Behaviour Oriented Design (BOD) to develop agents with reactive planning and of the well-es… ▽ More

    Submitted 21 July, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: 18 pages, 2 figures, 3 Tables, 4 Formalisms, Accepted at COINE 2022 Workshop

  29. arXiv:2204.03727  [pdf, other

    math.OC cs.RO

    Parameterized Differential Dynamic Programming

    Authors: Alex Oshin, Matthew D. Houghton, Michael J. Acheson, Irene M. Gregory, Evangelos A. Theodorou

    Abstract: Differential Dynamic Programming (DDP) is an efficient trajectory optimization algorithm relying on second-order approximations of a system's dynamics and cost function, and has recently been applied to optimize systems with time-invariant parameters. Prior works include system parameter estimation and identifying the optimal switching time between modes of hybrid dynamical systems. This paper gen… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: Submitted to RSS 2022

  30. arXiv:2204.02506  [pdf, other

    cs.MA cs.LG

    Deep Graphic FBSDEs for Opinion Dynamics Stochastic Control

    Authors: Tianrong Chen, Ziyi Wang, Evangelos A. Theodorou

    Abstract: In this paper, we present a scalable deep learning approach to solve opinion dynamics stochastic optimal control problems with mean field term coupling in the dynamics and cost function. Our approach relies on the probabilistic representation of the solution of the Hamilton-Jacobi-Bellman partial differential equation. Grounded on the nonlinear version of the Feynman-Kac lemma, the solutions of th… ▽ More

    Submitted 17 April, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

  31. arXiv:2202.10658  [pdf, other

    cs.MA cs.LG cs.RO eess.SY

    Decentralized Safe Multi-agent Stochastic Optimal Control using Deep FBSDEs and ADMM

    Authors: Marcus A. Pereira, Augustinos D. Saravanos, Oswin So, Evangelos A. Theodorou

    Abstract: In this work, we propose a novel safe and scalable decentralized solution for multi-agent control in the presence of stochastic disturbances. Safety is mathematically encoded using stochastic control barrier functions and safe controls are computed by solving quadratic programs. Decentralization is achieved by augmenting to each agent's optimization variables, copy variables, for its neighbors. Th… ▽ More

    Submitted 7 June, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

    Journal ref: Robotics: Science and Systems (RSS), 2022

  32. arXiv:2201.12925  [pdf, other

    math.OC cs.RO

    Multimodal Maximum Entropy Dynamic Games

    Authors: Oswin So, Kyle Stachowicz, Evangelos A. Theodorou

    Abstract: Environments with multi-agent interactions often result a rich set of modalities of behavior between agents due to the inherent suboptimality of decision making processes when agents settle for satisfactory decisions. However, existing algorithms for solving these dynamic games are strictly unimodal and fail to capture the intricate multimodal behaviors of the agents. In this paper, we propose MME… ▽ More

    Submitted 2 February, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: Under review for RSS 2022. Supplementary Video: https://youtu.be/7molN_Q38dk

  33. arXiv:2201.06539  [pdf, other

    cs.RO cs.AI

    Spatiotemporal Costmap Inference for MPC via Deep Inverse Reinforcement Learning

    Authors: Keuntaek Lee, David Isele, Evangelos A. Theodorou, Sangjae Bae

    Abstract: It can be difficult to autonomously produce driver behavior so that it appears natural to other traffic participants. Through Inverse Reinforcement Learning (IRL), we can automate this process by learning the underlying reward function from human demonstrations. We propose a new IRL algorithm that learns a goal-conditioned spatiotemporal reward function. The resulting costmap is used by Model Pred… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: IEEE Robotics and Automation Letters (RA-L)

  34. arXiv:2111.09207  [pdf, other

    cs.RO eess.SY

    Optimal-Horizon Model-Predictive Control with Differential Dynamic Programming

    Authors: Kyle Stachowicz, Evangelos A. Theodorou

    Abstract: We present an algorithm, based on the Differential Dynamic Programming framework, to handle trajectory optimization problems in which the horizon is determined online rather than fixed a priori. This algorithm exhibits exact one-step convergence for linear, quadratic, time-invariant problems and is fast enough for real-time nonlinear model-predictive control. We show derivations for the nonlinear… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Submitted to ICRA 2022

  35. arXiv:2110.11291  [pdf, other

    stat.ML cs.LG math.AP math.OC

    Likelihood Training of Schrödinger Bridge using Forward-Backward SDEs Theory

    Authors: Tianrong Chen, Guan-Horng Liu, Evangelos A. Theodorou

    Abstract: Schrödinger Bridge (SB) is an entropy-regularized optimal transport problem that has received increasing attention in deep generative modeling for its mathematical flexibility compared to the Scored-based Generative Model (SGM). However, it remains unclear whether the optimization principle of SB relates to the modern training of deep generative models, which often rely on constructing log-likelih… ▽ More

    Submitted 3 April, 2023; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: fix appendix net arh error

  36. arXiv:2110.06451  [pdf, other

    math.OC cs.RO

    Maximum Entropy Differential Dynamic Programming

    Authors: Oswin So, Ziyi Wang, Evangelos A. Theodorou

    Abstract: In this paper, we present a novel maximum entropy formulation of the Differential Dynamic Programming algorithm and derive two variants using unimodal and multimodal value functions parameterizations. By combining the maximum entropy Bellman equations with a particular approximation of the cost function, we are able to obtain a new formulation of Differential Dynamic Programming which is able to e… ▽ More

    Submitted 28 February, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted to ICRA 2022. Supplementary video available at https://youtu.be/NHr9Kj_jnAI

  37. arXiv:2109.14158  [pdf, other

    cs.LG eess.SY math.OC

    Second-Order Neural ODE Optimizer

    Authors: Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou

    Abstract: We propose a novel second-order optimization framework for training the emerging deep continuous-time models, specifically the Neural Ordinary Differential Equations (Neural ODEs). Since their training already involves expensive gradient computation by solving a backward ODE, deriving efficient second-order methods becomes highly nontrivial. Nevertheless, inspired by the recent Optimal Control (OC… ▽ More

    Submitted 5 November, 2021; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: Accepted to Advances in Neural Information Processing Systems (NeurIPS) 2021 as Spotlight

  38. arXiv:2109.00183  [pdf, other

    eess.SY cs.AI cs.LG

    Deep $\mathcal{L}^1$ Stochastic Optimal Control Policies for Planetary Soft-landing

    Authors: Marcus A. Pereira, Camilo A. Duarte, Ioannis Exarchos, Evangelos A. Theodorou

    Abstract: In this paper, we introduce a novel deep learning based solution to the Powered-Descent Guidance (PDG) problem, grounded in principles of nonlinear Stochastic Optimal Control (SOC) and Feynman-Kac theory. Our algorithm solves the PDG problem by framing it as an $\mathcal{L}^1$ SOC problem for minimum fuel consumption. Additionally, it can handle practically useful control constraints, nonlinear dy… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

  39. arXiv:2107.11722  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Risk-aware Costmaps for Traversability in Challenging Environments

    Authors: David D. Fan, Sharmita Dey, Ali-akbar Agha-mohammadi, Evangelos A. Theodorou

    Abstract: One of the main challenges in autonomous robotic exploration and navigation in unknown and unstructured environments is determining where the robot can or cannot safely move. A significant source of difficulty in this determination arises from stochasticity and uncertainty, coming from localization error, sensor sparsity and noise, difficult-to-model robot-ground interactions, and disturbances to… ▽ More

    Submitted 4 September, 2022; v1 submitted 25 July, 2021; originally announced July 2021.

    Comments: Published in RA-L with ICRA presentation option (IEEE International Conference on Robotics and Automation, 2022)

    Journal ref: IEEE Robotics and Automation Letters ( Volume: 7, Issue: 1, January 2022)

  40. Safety Embedded Differential Dynamic Programming Using Discrete Barrier States

    Authors: Hassan Almubarak, Kyle Stachowicz, Nader Sadegh, Evangelos A. Theodorou

    Abstract: Certified safe control is a growing challenge in robotics, especially when performance and safety objectives must be concurrently achieved. In this work, we extend the barrier state (BaS) concept, recently proposed for safe stabilization of continuous time systems, to safety embedded trajectory optimization for discrete time systems using discrete barrier states (DBaS). The constructed DBaS is emb… ▽ More

    Submitted 2 February, 2022; v1 submitted 30 May, 2021; originally announced May 2021.

    Comments: Added extensive quantitative comparisons and analysis in the implementation examples, and revised discussions and illustrations

    Journal ref: IEEE ROBOTICS AND AUTOMATION LETTERS, VOL. 7, NO. 2, APRIL 2022

  41. arXiv:2105.03788  [pdf, other

    cs.LG cs.GT math.OC

    Dynamic Game Theoretic Neural Optimizer

    Authors: Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou

    Abstract: The connection between training deep neural networks (DNNs) and optimal control theory (OCT) has attracted considerable attention as a principled tool of algorithmic design. Despite few attempts being made, they have been limited to architectures where the layer propagation resembles a Markovian dynamical system. This casts doubts on their flexibility to modern networks that heavily rely on non-Ma… ▽ More

    Submitted 11 June, 2021; v1 submitted 8 May, 2021; originally announced May 2021.

    Comments: Accepted in International Conference on Machine Learning (ICML) 2021 as Oral

  42. arXiv:2104.00241  [pdf, other

    cs.LG

    Variational Inference MPC using Tsallis Divergence

    Authors: Ziyi Wang, Oswin So, Jason Gibson, Bogdan Vlahov, Manan S. Gandhi, Guan-Horng Liu, Evangelos A. Theodorou

    Abstract: In this paper, we provide a generalized framework for Variational Inference-Stochastic Optimal Control by using thenon-extensive Tsallis divergence. By incorporating the deformed exponential function into the optimality likelihood function, a novel Tsallis Variational Inference-Model Predictive Control algorithm is derived, which includes prior works such as Variational Inference-Model Predictive… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  43. arXiv:2102.09144  [pdf, other

    cs.RO math.OC physics.app-ph

    Stochastic Spatio-Temporal Optimization for Control and Co-Design of Systems in Robotics and Applied Physics

    Authors: Ethan N. Evans, Andrew P. Kendall, Evangelos A. Theodorou

    Abstract: Correlated with the trend of increasing degrees of freedom in robotic systems is a similar trend of rising interest in Spatio-Temporal systems described by Partial Differential Equations (PDEs) among the robotics and control communities. These systems often exhibit dramatic under-actuation, high dimensionality, bifurcations, and multimodal instabilities. Their control represents many of the curren… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: 34 pages, 10 figures. Submitted to Autonomous Robots special issue of RSS 2020. arXiv admin note: text overlap with arXiv:2002.01397

  44. arXiv:2102.09104  [pdf, other

    cs.LG cs.MA cs.RO eess.SY math.OC

    Distributed Algorithms for Linearly-Solvable Optimal Control in Networked Multi-Agent Systems

    Authors: Neng Wan, Aditya Gahlawat, Naira Hovakimyan, Evangelos A. Theodorou, Petros G. Voulgaris

    Abstract: Distributed algorithms for both discrete-time and continuous-time linearly solvable optimal control (LSOC) problems of networked multi-agent systems (MASs) are investigated in this paper. A distributed framework is proposed to partition the optimal control problem of a networked MAS into several local optimal control problems in factorial subsystems, such that each (central) agent behaves optimall… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

  45. arXiv:2102.04714  [pdf, other

    cs.AI cs.CY

    Interrogating the Black Box: Transparency through Information-Seeking Dialogues

    Authors: Andrea Aler Tubella, Andreas Theodorou, Juan Carlos Nieves

    Abstract: This paper is preoccupied with the following question: given a (possibly opaque) learning system, how can we understand whether its behaviour adheres to governance constraints? The answer can be quite simple: we just need to "ask" the system about it. We propose to construct an investigator agent to query a learning agent -- the suspect agent -- to investigate its adherence to a given ethical poli… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: Accepted at AAMAS 2021

  46. arXiv:2011.10890  [pdf, other

    cs.AI

    Large-Scale Multi-Agent Deep FBSDEs

    Authors: Tianrong Chen, Ziyi Wang, Ioannis Exarchos, Evangelos A. Theodorou

    Abstract: In this paper we present a scalable deep learning framework for finding Markovian Nash Equilibria in multi-agent stochastic games using fictitious play. The motivation is inspired by theoretical analysis of Forward Backward Stochastic Differential Equations (FBSDE) and their implementation in a deep learning setting, which is the source of our algorithm's sample efficiency improvement. By taking a… ▽ More

    Submitted 21 May, 2021; v1 submitted 21 November, 2020; originally announced November 2020.

  47. arXiv:2009.14775  [pdf, other

    eess.SY cs.LG cs.MA cs.RO math.OC

    Cooperative Path Integral Control for Stochastic Multi-Agent Systems

    Authors: Neng Wan, Aditya Gahlawat, Naira Hovakimyan, Evangelos A. Theodorou, Petros G. Voulgaris

    Abstract: A distributed stochastic optimal control solution is presented for cooperative multi-agent systems. The network of agents is partitioned into multiple factorial subsystems, each of which consists of a central agent and neighboring agents. Local control actions that rely only on agents' local observations are designed to optimize the joint cost functions of subsystems. When solving for the local co… ▽ More

    Submitted 20 March, 2021; v1 submitted 30 September, 2020; originally announced September 2020.

    Comments: To appear in American Control Conference 2021, New Orleans, LA, USA

  48. arXiv:2009.13609  [pdf, other

    eess.SY cs.LG cs.MA math.OC

    Compositionality of Linearly Solvable Optimal Control in Networked Multi-Agent Systems

    Authors: Lin Song, Neng Wan, Aditya Gahlawat, Naira Hovakimyan, Evangelos A. Theodorou

    Abstract: In this paper, we discuss the methodology of generalizing the optimal control law from learned component tasks to unlearned composite tasks on Multi-Agent Systems (MASs), by using the linearity composition principle of linearly solvable optimal control (LSOC) problems. The proposed approach achieves both the compositionality and optimality of control actions simultaneously within the cooperative M… ▽ More

    Submitted 22 March, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: Accepted to the 2021 American Control Conference (ACC)

  49. arXiv:2009.01196  [pdf, other

    eess.SY cs.AI cs.RO

    Safe Optimal Control Using Stochastic Barrier Functions and Deep Forward-Backward SDEs

    Authors: Marcus Aloysius Pereira, Ziyi Wang, Ioannis Exarchos, Evangelos A. Theodorou

    Abstract: This paper introduces a new formulation for stochastic optimal control and stochastic dynamic optimization that ensures safety with respect to state and control constraints. The proposed methodology brings together concepts such as Forward-Backward Stochastic Differential Equations, Stochastic Barrier Functions, Differentiable Convex Optimization and Deep Learning. Using the aforementioned concept… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

    Journal ref: Conference on Robot Learning 2020

  50. arXiv:2009.01090  [pdf, other

    math.OC cs.RO

    Adaptive Risk Sensitive Model Predictive Control with Stochastic Search

    Authors: Ziyi Wang, Oswin So, Keuntaek Lee, Camilo A. Duarte, Evangelos A. Theodorou

    Abstract: We present a general framework for optimizing the Conditional Value-at-Risk for dynamical systems using stochastic search. The framework is capable of handling the uncertainty from the initial condition, stochastic dynamics, and uncertain parameters in the model. The algorithm is compared against a risk-sensitive distributional reinforcement learning framework and demonstrates outperformance on a… ▽ More

    Submitted 12 February, 2021; v1 submitted 2 September, 2020; originally announced September 2020.