Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Dullerud, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06888  [pdf, other

    cs.LG eess.SY math.OC

    A Complete Set of Quadratic Constraints for Repeated ReLU and Generalizations

    Authors: Sahel Vahedi Noori, Bin Hu, Geir Dullerud, Peter Seiler

    Abstract: This paper derives a complete set of quadratic constraints (QCs) for the repeated ReLU. The complete set of QCs is described by a collection of matrix copositivity conditions. We also show that only two functions satisfy all QCs in our complete set: the repeated ReLU and flipped ReLU. Thus our complete set of QCs bounds the repeated ReLU as tight as possible up to the sign invariance inherent in q… ▽ More

    Submitted 22 August, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2405.05236  [pdf, ps, other

    eess.SY cs.LG math.OC

    Stability and Performance Analysis of Discrete-Time ReLU Recurrent Neural Networks

    Authors: Sahel Vahedi Noori, Bin Hu, Geir Dullerud, Peter Seiler

    Abstract: This paper presents sufficient conditions for the stability and $\ell_2$-gain performance of recurrent neural networks (RNNs) with ReLU activation functions. These conditions are derived by combining Lyapunov/dissipativity theory with Quadratic Constraints (QCs) satisfied by repeated ReLUs. We write a general class of QCs for repeated RELUs using known properties for the scalar ReLU. Our stability… ▽ More

    Submitted 14 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  3. arXiv:2404.03647  [pdf, other

    math.OC cs.AI cs.LG

    Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra

    Authors: Darioush Kevian, Usman Syed, Xingang Guo, Aaron Havens, Geir Dullerud, Peter Seiler, Lianhui Qin, Bin Hu

    Abstract: In this paper, we explore the capabilities of state-of-the-art large language models (LLMs) such as GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra in solving undergraduate-level control problems. Controls provides an interesting case study for LLM reasoning due to its combination of mathematical theory and engineering design. We introduce ControlBench, a benchmark dataset tailored to reflect the bread… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  4. arXiv:2402.11654  [pdf, other

    math.OC cs.LG

    Model-Free $μ$-Synthesis: A Nonsmooth Optimization Perspective

    Authors: Darioush Keivan, Xingang Guo, Peter Seiler, Geir Dullerud, Bin Hu

    Abstract: In this paper, we revisit model-free policy search on an important robust control benchmark, namely $μ$-synthesis. In the general output-feedback setting, there do not exist convex formulations for this problem, and hence global optimality guarantees are not expected. Apkarian (2011) presented a nonconvex nonsmooth policy optimization approach for this problem, and achieved state-of-the-art design… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Submitted to L4DC 2024

  5. arXiv:2309.06588  [pdf, other

    eess.SY cs.LG

    Convergence of Gradient-based MAML in LQR

    Authors: Negin Musavi, Geir E. Dullerud

    Abstract: The main objective of this research paper is to investigate the local convergence characteristics of Model-agnostic Meta-learning (MAML) when applied to linear system quadratic optimal control (LQR). MAML and its variations have become popular techniques for quickly adapting to new tasks by leveraging previous learning knowledge in areas like regression, classification, and reinforcement learning.… ▽ More

    Submitted 15 September, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

  6. arXiv:2302.01388  [pdf, other

    cs.CR eess.SY stat.CO

    Statistical Verification of Traffic Systems with Expected Differential Privacy

    Authors: Mark Yen, Geir E. Dullerud, Yu Wang

    Abstract: Traffic systems are multi-agent cyber-physical systems whose performance is closely related to human welfare. They work in open environments and are subject to uncertainties from various sources, making their performance hard to verify by traditional model-based approaches. Alternatively, statistical model checking (SMC) can verify their performance by sequentially drawing sample data until the co… ▽ More

    Submitted 28 February, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: American Control Conference 2023 (ACC23)

  7. arXiv:2211.16751  [pdf, other

    cs.NI

    DiProber: Using Dual Probing to Estimate Tor Relay Capacities in Underloaded Networks

    Authors: Hussein Darir, Nikita Borisov, Geir Dullerud

    Abstract: Tor is the most popular anonymous communication network. It has millions of daily users seeking privacy while browsing the internet. It has thousands of relays to route and anonymize the source and destinations of the users packets. To create a path, Tor authorities generate a probability distribution over relays based on the estimates of the capacities of the relays. An incoming user will then sa… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

  8. arXiv:2209.11328  [pdf, other

    cs.RO eess.SY

    Learning Certifiably Robust Controllers Using Fragile Perception

    Authors: Dawei Sun, Negin Musavi, Geir Dullerud, Sanjay Shakkottai, Sayan Mitra

    Abstract: Advances in computer vision and machine learning enable robots to perceive their surroundings in powerful new ways, but these perception modules have well-known fragilities. We consider the problem of synthesizing a safe controller that is robust despite perception errors. The proposed method constructs a state estimator based on Gaussian processes with input-dependent noises. This estimator compu… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  9. arXiv:2201.00801  [pdf, other

    math.OC cs.LG

    Revisiting PGD Attacks for Stability Analysis of Large-Scale Nonlinear Systems and Perception-Based Control

    Authors: Aaron Havens, Darioush Keivan, Peter Seiler, Geir Dullerud, Bin Hu

    Abstract: Many existing region-of-attraction (ROA) analysis tools find difficulty in addressing feedback systems with large-scale neural network (NN) policies and/or high-dimensional sensing modalities such as cameras. In this paper, we tailor the projected gradient descent (PGD) attack method developed in the adversarial learning community as a general-purpose ROA analysis tool for large-scale nonlinear sy… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    Comments: Submitted to L4DC 2022

  10. arXiv:2111.15537  [pdf, other

    cs.LG math.OC

    Model-Free $μ$ Synthesis via Adversarial Reinforcement Learning

    Authors: Darioush Keivan, Aaron Havens, Peter Seiler, Geir Dullerud, Bin Hu

    Abstract: Motivated by the recent empirical success of policy-based reinforcement learning (RL), there has been a research trend studying the performance of policy-based RL methods on standard control benchmark problems. In this paper, we examine the effectiveness of policy-based RL methods on an important robust control problem, namely $μ$ synthesis. We build a connection between robust adversarial RL and… ▽ More

    Submitted 8 June, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Comments: Accepted to ACC 2022

  11. arXiv:2011.11852  [pdf, ps, other

    math.OC cs.LG eess.SY

    Policy Optimization for Markovian Jump Linear Quadratic Control: Gradient-Based Methods and Global Convergence

    Authors: Joao Paulo Jansch-Porto, Bin Hu, Geir Dullerud

    Abstract: Recently, policy optimization for control purposes has received renewed attention due to the increasing interest in reinforcement learning. In this paper, we investigate the global convergence of gradient-based policy optimization methods for quadratic optimal control of discrete-time Markovian jump linear systems (MJLS). First, we study the optimization landscape of direct policy optimization for… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

  12. arXiv:2006.03116  [pdf, other

    math.OC cs.LG eess.SY

    Policy Learning of MDPs with Mixed Continuous/Discrete Variables: A Case Study on Model-Free Control of Markovian Jump Systems

    Authors: Joao Paulo Jansch-Porto, Bin Hu, Geir Dullerud

    Abstract: Markovian jump linear systems (MJLS) are an important class of dynamical systems that arise in many control applications. In this paper, we introduce the problem of controlling unknown (discrete-time) MJLS as a new benchmark for policy-based reinforcement learning of Markov decision processes (MDPs) with mixed continuous/discrete state variables. Compared with the traditional linear quadratic regu… ▽ More

    Submitted 14 July, 2020; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: Accepted to L4DC 2020

  13. arXiv:2004.00275  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Algorithms for Statistical Verification of Cyber-Physical Systems

    Authors: Yu Wang, Hussein Sibai, Mark Yen, Sayan Mitra, Geir E. Dullerud

    Abstract: Statistical model checking is a class of sequential algorithms that can verify specifications of interest on an ensemble of cyber-physical systems (e.g., whether 99% of cars from a batch meet a requirement on their energy efficiency). These algorithms infer the probability that given specifications are satisfied by the systems with provable statistical guarantees by drawing sufficient numbers of i… ▽ More

    Submitted 27 June, 2022; v1 submitted 1 April, 2020; originally announced April 2020.

    Comments: Under review for IEEE Open Journal of Control Systems

  14. arXiv:2004.00273  [pdf, ps, other

    cs.LG stat.ML

    Statistically Model Checking PCTL Specifications on Markov Decision Processes via Reinforcement Learning

    Authors: Yu Wang, Nima Roohi, Matthew West, Mahesh Viswanathan, Geir E. Dullerud

    Abstract: Probabilistic Computation Tree Logic (PCTL) is frequently used to formally specify control objectives such as probabilistic reachability and safety. In this work, we focus on model checking PCTL specifications statistically on Markov Decision Processes (MDPs) by sampling, e.g., checking whether there exists a feasible policy such that the probability of reaching certain goal states is greater than… ▽ More

    Submitted 21 April, 2020; v1 submitted 1 April, 2020; originally announced April 2020.

  15. arXiv:2002.04090  [pdf, ps, other

    math.OC cs.LG eess.SY

    Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems

    Authors: Joao Paulo Jansch-Porto, Bin Hu, Geir Dullerud

    Abstract: Recently, policy optimization for control purposes has received renewed attention due to the increasing interest in reinforcement learning. In this paper, we investigate the convergence of policy optimization for quadratic control of Markovian jump linear systems (MJLS). First, we study the optimization landscape of direct policy optimization for MJLS, and, in particular, show that despite the non… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: Accepted to ACC 2020

  16. arXiv:1911.01537  [pdf, other

    cs.LG cs.FL

    Verification and Parameter Synthesis for Stochastic Systems using Optimistic Optimization

    Authors: Negin Musavi, Dawei Sun, Sayan Mitra, Geir Dullerud, Sanjay Shakkottai

    Abstract: We present an algorithm for formal verification and parameter synthesis of continuous state-space Markov chains. This class of problems captures the design and analysis of a wide variety of autonomous and cyber-physical systems defined by nonlinear and black-box modules. In order to solve these problems, one has to maximize certain probabilistic objective functions overall choices of initial state… ▽ More

    Submitted 3 December, 2020; v1 submitted 4 November, 2019; originally announced November 2019.

    Comments: 24 pages, 7 figures

  17. arXiv:1910.01557  [pdf, other

    cs.RO

    CyPhyHouse: A Programming, Simulation, and Deployment Toolchain for Heterogeneous Distributed Coordination

    Authors: Ritwika Ghosh, Joao P. Jansch-Porto, Chiao Hsieh, Amelia Gosse, Minghao Jiang, Hebron Taylor, Peter Du, Sayan Mitra, Geir Dullerud

    Abstract: Programming languages, libraries, and development tools have transformed the application development processes for mobile computing and machine learning. This paper introduces the CyPhyHouse - a toolchain that aims to provide similar programming, debugging, and deployment benefits for distributed mobile robotic applications. Users can develop hardware-agnostic, distributed applications using the h… ▽ More

    Submitted 10 October, 2019; v1 submitted 3 October, 2019; originally announced October 2019.

  18. arXiv:1501.04925  [pdf, other

    eess.SY cs.CR

    Controller Synthesis for Linear Time-varying Systems with Adversaries

    Authors: Zhenqi Huang, Yu Wang, Sayan Mitra, Geir Dullerud

    Abstract: We present a controller synthesis algorithm for a discrete time reach-avoid problem in the presence of adversaries. Our model of the adversary captures typical malicious attacks envisioned on cyber-physical systems such as sensor spoofing, controller corruption, and actuator intrusion. After formulating the problem in a general setting, we present a sound and complete algorithm for the case with l… ▽ More

    Submitted 18 January, 2015; originally announced January 2015.

    Comments: 10 pages 4 figures; under submission for review

  19. arXiv:1207.4262  [pdf, other

    cs.CR cs.DC eess.SY

    Differentially Private Iterative Synchronous Consensus

    Authors: Zhenqi Huang, Sayan Mitra, Geir Dullerud

    Abstract: The iterative consensus problem requires a set of processes or agents with different initial values, to interact and update their states to eventually converge to a common value. Protocols solving iterative consensus serve as building blocks in a variety of systems where distributed coordination is required for load balancing, data aggregation, sensor fusion, filtering, clock synchronization and p… ▽ More

    Submitted 8 August, 2012; v1 submitted 18 July, 2012; originally announced July 2012.

    Comments: The original manuscript from 18th July was updated with new proofs for Lemmas 3, 6, and 8