Skip to main content

Showing 1–14 of 14 results for author: Benosman, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.01005  [pdf, other

    eess.SY cs.AI math.OC

    Policy Optimization for PDE Control with a Warm Start

    Authors: Xiangyuan Zhang, Saviz Mowlavi, Mouhacine Benosman, Tamer Başar

    Abstract: Dimensionality reduction is crucial for controlling nonlinear partial differential equations (PDE) through a "reduce-then-design" strategy, which identifies a reduced-order model and then implements model-based control solutions. However, inaccuracies in the reduced-order modeling can substantially degrade controller performance, especially in PDEs with chaotic behavior. To address this issue, we… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  2. arXiv:2402.15636  [pdf, other

    cs.LG cs.CE math-ph math.NA

    Smooth and Sparse Latent Dynamics in Operator Learning with Jerk Regularization

    Authors: Xiaoyu Xie, Saviz Mowlavi, Mouhacine Benosman

    Abstract: Spatiotemporal modeling is critical for understanding complex systems across various scientific and engineering disciplines, but governing equations are often not fully known or computationally intractable due to inherent system complexity. Data-driven reduced-order models (ROMs) offer a promising approach for fast and accurate spatiotemporal forecasting by computing solutions in a compressed late… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  3. arXiv:2311.18736  [pdf, other

    eess.SY cs.AI cs.CE cs.LG math.OC

    Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms

    Authors: Xiangyuan Zhang, Weichao Mao, Saviz Mowlavi, Mouhacine Benosman, Tamer Başar

    Abstract: We introduce controlgym, a library of thirty-six industrial control settings, and ten infinite-dimensional partial differential equation (PDE)-based control problems. Integrated within the OpenAI Gym/Gymnasium (Gym) framework, controlgym allows direct applications of standard reinforcement learning (RL) algorithms like stable-baselines3. Our control environments complement those in Gym with contin… ▽ More

    Submitted 23 April, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: 25 pages, 16 figures

  4. arXiv:2309.04831  [pdf, other

    math.OC cs.AI cs.LG eess.SY math.DS

    Global Convergence of Receding-Horizon Policy Search in Learning Estimator Designs

    Authors: Xiangyuan Zhang, Saviz Mowlavi, Mouhacine Benosman, Tamer Başar

    Abstract: We introduce the receding-horizon policy gradient (RHPG) algorithm, the first PG algorithm with provable global convergence in learning the optimal linear estimator designs, i.e., the Kalman filter (KF). Notably, the RHPG algorithm does not require any prior knowledge of the system for initialization and does not require the target system to be open-loop stable. The key of RHPG is that we integrat… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2301.12624

  5. arXiv:2302.01189  [pdf, other

    cs.LG eess.SY

    Reinforcement learning-based estimation for partial differential equations

    Authors: Saviz Mowlavi, Mouhacine Benosman

    Abstract: In systems governed by nonlinear partial differential equations such as fluid flows, the design of state estimators such as Kalman filters relies on a reduced-order model (ROM) that projects the original high-dimensional dynamics onto a computationally tractable low-dimensional space. However, ROMs are prone to large errors, which negatively affects the performance of the estimator. Here, we intro… ▽ More

    Submitted 4 April, 2024; v1 submitted 20 January, 2023; originally announced February 2023.

    Comments: 24 pages, 15 figures

  6. arXiv:2301.12593  [pdf, other

    cs.LG cs.AI stat.ML

    Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning

    Authors: James Queeney, Mouhacine Benosman

    Abstract: Many real-world domains require safe decision making in uncertain environments. In this work, we introduce a deep reinforcement learning framework for approaching this important problem. We consider a distribution over transition models, and apply a risk-averse perspective towards model uncertainty through the use of coherent distortion risk measures. We provide robustness guarantees for this fram… ▽ More

    Submitted 26 October, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  7. Domain Knowledge-Infused Deep Learning for Automated Analog/Radio-Frequency Circuit Parameter Optimization

    Authors: Weidong Cao, Mouhacine Benosman, Xuan Zhang, Rui Ma

    Abstract: The design automation of analog circuits is a longstanding challenge. This paper presents a reinforcement learning method enhanced by graph learning to automate the analog circuit parameter optimization at the pre-layout stage, i.e., finding device parameters to fulfill desired circuit specifications. Unlike all prior methods, our approach is inspired by human experts who rely on domain knowledge… ▽ More

    Submitted 17 May, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: 7 pages, 7 figures. arXiv admin note: substantial text overlap with arXiv:2202.13185

  8. arXiv:2202.13185  [pdf, other

    cs.LG cs.AI cs.AR

    Domain Knowledge-Based Automated Analog Circuit Design with Deep Reinforcement Learning

    Authors: Weidong Cao, Mouhacine Benosman, Xuan Zhang, Rui Ma

    Abstract: The design automation of analog circuits is a longstanding challenge in the integrated circuit field. This paper presents a deep reinforcement learning method to expedite the design of analog circuits at the pre-layout stage, where the goal is to find device parameters to fulfill desired circuit specifications. Our approach is inspired by experienced human designers who rely on domain knowledge of… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

    Comments: 8 pages, 5 figures, 2 tables, Thirty-Sixth AAAI Conference on Artificial Intelligence, The 1st Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE)

  9. arXiv:2108.02701  [pdf, ps, other

    cs.LG eess.SY

    Lyapunov Robust Constrained-MDPs: Soft-Constrained Robustly Stable Policy Optimization under Model Uncertainty

    Authors: Reazul Hasan Russel, Mouhacine Benosman, Jeroen Van Baar, Radu Corcodel

    Abstract: Safety and robustness are two desired properties for any reinforcement learning algorithm. CMDPs can handle additional safety constraints and RMDPs can perform well under model uncertainties. In this paper, we propose to unite these two frameworks resulting in robust constrained MDPs (RCMDPs). The motivation is to develop a framework that can satisfy safety constraints while also simultaneously of… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: arXiv admin note: text overlap with arXiv:2010.04870

  10. arXiv:2010.04870  [pdf, other

    cs.LG

    Robust Constrained-MDPs: Soft-Constrained Robust Policy Optimization under Model Uncertainty

    Authors: Reazul Hasan Russel, Mouhacine Benosman, Jeroen Van Baar

    Abstract: In this paper, we focus on the problem of robustifying reinforcement learning (RL) algorithms with respect to model uncertainties. Indeed, in the framework of model-based RL, we propose to merge the theory of constrained Markov decision process (CMDP), with the theory of robust Markov decision process (RMDP), leading to a formulation of robust constrained-MDPs (RCMDP). This formulation, simple in… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  11. arXiv:2010.02990  [pdf, other

    cs.LG eess.SY

    First-Order Optimization Inspired from Finite-Time Convergent Flows

    Authors: Siqi Zhang, Mouhacine Benosman, Orlando Romero, Anoop Cherian

    Abstract: In this paper, we investigate the performance of two first-order optimization algorithms, obtained from forward Euler discretization of finite-time optimization flows. These flows are the rescaled-gradient flow (RGF) and the signed-gradient flow (SGF), and consist of non-Lipscthiz or discontinuous dynamical systems that converge locally in finite time to the minima of gradient-dominated functions.… ▽ More

    Submitted 17 October, 2022; v1 submitted 6 October, 2020; originally announced October 2020.

    MSC Class: 68T07

  12. arXiv:2005.05888  [pdf, other

    eess.SY cs.LG math.OC

    Safe Learning-based Observers for Unknown Nonlinear Systems using Bayesian Optimization

    Authors: Ankush Chakrabarty, Mouhacine Benosman

    Abstract: Data generated from dynamical systems with unknown dynamics enable the learning of state observers that are: robust to modeling error, computationally tractable to design, and capable of operating with guaranteed performance. In this paper, a modular design methodology is formulated, that consists of three design phases: (i) an initial robust observer design that enables one to learn the dynamics… ▽ More

    Submitted 25 June, 2021; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: 23 pages, post-review draft

  13. arXiv:2001.08092  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Local Policy Optimization for Trajectory-Centric Reinforcement Learning

    Authors: Patrik Kolaric, Devesh K. Jha, Arvind U. Raghunathan, Frank L. Lewis, Mouhacine Benosman, Diego Romeres, Daniel Nikovski

    Abstract: The goal of this paper is to present a method for simultaneous trajectory and local stabilizing policy optimization to generate local policies for trajectory-centric model-based reinforcement learning (MBRL). This is motivated by the fact that global policy optimization for non-linear systems could be a very challenging problem both algorithmically and numerically. However, a lot of robotic manipu… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Journal ref: ICRA 2020

  14. arXiv:1912.08342  [pdf, other

    math.OC cs.LG stat.ML

    Finite-Time Convergence of Continuous-Time Optimization Algorithms via Differential Inclusions

    Authors: Orlando Romero, Mouhacine Benosman

    Abstract: In this paper, we propose two discontinuous dynamical systems in continuous time with guaranteed prescribed finite-time local convergence to strict local minima of a given cost function. Our approach consists of exploiting a Lyapunov-based differential inequality for differential inclusions, which leads to finite-time stability and thus finite-time convergence with a provable bound on the settling… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

    Comments: Presented at workshop "Beyond First Order Methods in Machine Learning" of NeurIPS 2019