Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Seiler, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06888  [pdf, other

    cs.LG eess.SY math.OC

    A Complete Set of Quadratic Constraints for Repeated ReLU and Generalizations

    Authors: Sahel Vahedi Noori, Bin Hu, Geir Dullerud, Peter Seiler

    Abstract: This paper derives a complete set of quadratic constraints (QCs) for the repeated ReLU. The complete set of QCs is described by a collection of matrix copositivity conditions. We also show that only two functions satisfy all QCs in our complete set: the repeated ReLU and flipped ReLU. Thus our complete set of QCs bounds the repeated ReLU as tight as possible up to the sign invariance inherent in q… ▽ More

    Submitted 22 August, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2405.05236  [pdf, ps, other

    eess.SY cs.LG math.OC

    Stability and Performance Analysis of Discrete-Time ReLU Recurrent Neural Networks

    Authors: Sahel Vahedi Noori, Bin Hu, Geir Dullerud, Peter Seiler

    Abstract: This paper presents sufficient conditions for the stability and $\ell_2$-gain performance of recurrent neural networks (RNNs) with ReLU activation functions. These conditions are derived by combining Lyapunov/dissipativity theory with Quadratic Constraints (QCs) satisfied by repeated ReLUs. We write a general class of QCs for repeated RELUs using known properties for the scalar ReLU. Our stability… ▽ More

    Submitted 14 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  3. arXiv:2404.07373  [pdf, other

    eess.SY cs.LG

    Synthesizing Neural Network Controllers with Closed-Loop Dissipativity Guarantees

    Authors: Neelay Junnarkar, Murat Arcak, Peter Seiler

    Abstract: In this paper, a method is presented to synthesize neural network controllers such that the feedback system of plant and controller is dissipative, certifying performance requirements such as L2 gain bounds. The class of plants considered is that of linear time-invariant (LTI) systems interconnected with an uncertainty, including nonlinearities treated as an uncertainty for convenience of analysis… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Submitted to the journal Automatica, 14 pages, 7 figures

  4. arXiv:2404.03647  [pdf, other

    math.OC cs.AI cs.LG

    Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra

    Authors: Darioush Kevian, Usman Syed, Xingang Guo, Aaron Havens, Geir Dullerud, Peter Seiler, Lianhui Qin, Bin Hu

    Abstract: In this paper, we explore the capabilities of state-of-the-art large language models (LLMs) such as GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra in solving undergraduate-level control problems. Controls provides an interesting case study for LLM reasoning due to its combination of mathematical theory and engineering design. We introduce ControlBench, a benchmark dataset tailored to reflect the bread… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  5. arXiv:2402.11654  [pdf, other

    math.OC cs.LG

    Model-Free $μ$-Synthesis: A Nonsmooth Optimization Perspective

    Authors: Darioush Keivan, Xingang Guo, Peter Seiler, Geir Dullerud, Bin Hu

    Abstract: In this paper, we revisit model-free policy search on an important robust control benchmark, namely $μ$-synthesis. In the general output-feedback setting, there do not exist convex formulations for this problem, and hence global optimality guarantees are not expected. Apkarian (2011) presented a nonconvex nonsmooth policy optimization approach for this problem, and achieved state-of-the-art design… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Submitted to L4DC 2024

  6. Synthesis of Stabilizing Recurrent Equilibrium Network Controllers

    Authors: Neelay Junnarkar, He Yin, Fangda Gu, Murat Arcak, Peter Seiler

    Abstract: We propose a parameterization of a nonlinear dynamic controller based on the recurrent equilibrium network, a generalization of the recurrent neural network. We derive constraints on the parameterization under which the controller guarantees exponential stability of a partially observed dynamical system with sector bounded nonlinearities. Finally, we present a method to synthesize this controller… ▽ More

    Submitted 12 September, 2022; v1 submitted 31 March, 2022; originally announced April 2022.

    Comments: Submitted to IEEE CDC 2022. arXiv admin note: text overlap with arXiv:2109.03861

  7. Efficient Data Structures for Exploiting Sparsity and Structure in Representation of Polynomial Optimization Problems: Implementation in SOSTOOLS

    Authors: Declan Jagt, Sachin Shivakumar, Peter Seiler, Matthew Peet

    Abstract: We present a new data structure for representation of polynomial variables in the parsing of sum-of-squares (SOS) programs. In SOS programs, the variables $s(x;Q)$ are polynomial in the independent variables $x$, but linear in the decision variables $Q$. Current SOS parsers, however, fail to exploit the semi-linear structure of the polynomial variables, treating the decision variables as independe… ▽ More

    Submitted 2 September, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Journal ref: IEEE Control Systems Letters, vol. 6, pp. 3493-3498, 2022

  8. arXiv:2201.00801  [pdf, other

    math.OC cs.LG

    Revisiting PGD Attacks for Stability Analysis of Large-Scale Nonlinear Systems and Perception-Based Control

    Authors: Aaron Havens, Darioush Keivan, Peter Seiler, Geir Dullerud, Bin Hu

    Abstract: Many existing region-of-attraction (ROA) analysis tools find difficulty in addressing feedback systems with large-scale neural network (NN) policies and/or high-dimensional sensing modalities such as cameras. In this paper, we tailor the projected gradient descent (PGD) attack method developed in the adversarial learning community as a general-purpose ROA analysis tool for large-scale nonlinear sy… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    Comments: Submitted to L4DC 2022

  9. arXiv:2111.15537  [pdf, other

    cs.LG math.OC

    Model-Free $μ$ Synthesis via Adversarial Reinforcement Learning

    Authors: Darioush Keivan, Aaron Havens, Peter Seiler, Geir Dullerud, Bin Hu

    Abstract: Motivated by the recent empirical success of policy-based reinforcement learning (RL), there has been a research trend studying the performance of policy-based RL methods on standard control benchmark problems. In this paper, we examine the effectiveness of policy-based RL methods on an important robust control problem, namely $μ$ synthesis. We build a connection between robust adversarial RL and… ▽ More

    Submitted 8 June, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Comments: Accepted to ACC 2022

  10. arXiv:2111.12906  [pdf, ps, other

    cs.LG cs.AI eess.SY

    Robustness against Adversarial Attacks in Neural Networks using Incremental Dissipativity

    Authors: Bernardo Aquino, Arash Rahnama, Peter Seiler, Lizhen Lin, Vijay Gupta

    Abstract: Adversarial examples can easily degrade the classification performance in neural networks. Empirical methods for promoting robustness to such examples have been proposed, but often lack both analytical insights and formal guarantees. Recently, some robustness certificates have appeared in the literature based on system theoretic notions. This work proposes an incremental dissipativity-based robust… ▽ More

    Submitted 13 February, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

  11. arXiv:2109.03861  [pdf, other

    eess.SY cs.AI cs.RO

    Recurrent Neural Network Controllers Synthesis with Stability Guarantees for Partially Observed Systems

    Authors: Fangda Gu, He Yin, Laurent El Ghaoui, Murat Arcak, Peter Seiler, Ming Jin

    Abstract: Neural network controllers have become popular in control tasks thanks to their flexibility and expressivity. Stability is a crucial property for safety-critical dynamical systems, while stabilization of partially observed systems, in many cases, requires controllers to retain and process long-term memories of the past. We consider the important class of recurrent neural networks (RNN) as dynamic… ▽ More

    Submitted 7 December, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

  12. arXiv:2011.01893  [pdf, other

    eess.SY cs.MA

    Iterative Best Response for Multi-Body Asset-Guarding Games

    Authors: Emmanuel Sin, Murat Arcak, Douglas Philbrick, Peter Seiler

    Abstract: We present a numerical approach to finding optimal trajectories for players in a multi-body, asset-guarding game with nonlinear dynamics and non-convex constraints. Using the Iterative Best Response (IBR) scheme, we solve for each player's optimal strategy assuming the other players' trajectories are known and fixed. Leveraging recent advances in Sequential Convex Programming (SCP), we use SCP as… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

  13. arXiv:2005.12226  [pdf, other

    eess.SY cs.MA

    Optimal assignment of collaborating agents in multi-body asset-guarding games

    Authors: Emmanuel Sin, Murat Arcak, Andrew Packard, Douglas Philbrick, Peter Seiler

    Abstract: We study a multi-body asset-guarding game in missile defense where teams of interceptor missiles collaborate to defend a non-manuevering asset against a group of threat missiles. We approach the problem in two steps. We first formulate an assignment problem where we optimally assign subsets of collaborating interceptors to each threat so that all threats are intercepted as far away from the asset… ▽ More

    Submitted 25 May, 2020; originally announced May 2020.

  14. arXiv:2001.09467  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Tractable Reinforcement Learning of Signal Temporal Logic Objectives

    Authors: Harish Venkataraman, Derya Aksaray, Peter Seiler

    Abstract: Signal temporal logic (STL) is an expressive language to specify time-bound real-world robotic tasks and safety specifications. Recently, there has been an interest in learning optimal policies to satisfy STL specifications via reinforcement learning (RL). Learning to satisfy STL specifications often needs a sufficient length of state history to compute reward and the next action. The need for his… ▽ More

    Submitted 17 February, 2020; v1 submitted 26 January, 2020; originally announced January 2020.

    Comments: Github code repository: https://github.com/kumaa001/Tractable_RL_for_STL_Objectives. arXiv admin note: text overlap with arXiv:1609.07409

  15. arXiv:1310.4716  [pdf, ps, other

    math.OC cs.MS eess.SY

    SOSTOOLS Version 4.00 Sum of Squares Optimization Toolbox for MATLAB

    Authors: Antonis Papachristodoulou, James Anderson, Giorgio Valmorbida, Stephen Prajna, Pete Seiler, Pablo Parrilo, Matthew M. Peet, Declan Jagt

    Abstract: The release of SOSTOOLS v4.00 comes as we approach the 20th anniversary of the original release of SOSTOOLS v1.00 back in April, 2002. SOSTOOLS was originally envisioned as a flexible tool for parsing and solving polynomial optimization problems, using the SOS tightening of polynomial positivity constraints, and capable of adapting to the ever-evolving fauna of applications of SOS. There are now a… ▽ More

    Submitted 27 December, 2021; v1 submitted 17 October, 2013; originally announced October 2013.

    Comments: 64 pages, 3 figures, "software available from http://sysos.eng.ox.ac.uk/sostools/ "