Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: Lale, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.15466  [pdf

    cs.LG

    EKGNet: A 10.96μW Fully Analog Neural Network for Intra-Patient Arrhythmia Classification

    Authors: Benyamin Haghi, Lin Ma, Sahin Lale, Anima Anandkumar, Azita Emami

    Abstract: We present an integrated approach by combining analog computing and deep learning for electrocardiogram (ECG) arrhythmia classification. We propose EKGNet, a hardware-efficient and fully analog arrhythmia classification architecture that archives high accuracy with low power consumption. The proposed architecture leverages the energy efficiency of transistors operating in the subthreshold region,… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted on IEEE Biomedical Circuits and Systems (BioCAS) 2023

  2. arXiv:2301.08290  [pdf, ps, other

    physics.flu-dyn cs.LG

    Forecasting subcritical cylinder wakes with Fourier Neural Operators

    Authors: Peter I Renn, Cong Wang, Sahin Lale, Zongyi Li, Anima Anandkumar, Morteza Gharib

    Abstract: We apply Fourier neural operators (FNOs), a state-of-the-art operator learning technique, to forecast the temporal evolution of experimentally measured velocity fields. FNOs are a recently developed machine learning method capable of approximating solution operators to systems of partial differential equations through data alone. The learned FNO solution operator can be evaluated in milliseconds,… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: 12 pages, 6 figures

  3. arXiv:2206.08520  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    Thompson Sampling Achieves $\tilde O(\sqrt{T})$ Regret in Linear Quadratic Control

    Authors: Taylan Kargin, Sahin Lale, Kamyar Azizzadenesheli, Anima Anandkumar, Babak Hassibi

    Abstract: Thompson Sampling (TS) is an efficient method for decision-making under uncertainty, where an action is sampled from a carefully prescribed distribution which is updated based on the observed data. In this work, we study the problem of adaptive control of stabilizable linear-quadratic regulators (LQRs) using TS, where the system dynamics are unknown. Previous works have established that… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Accepted for presentation at the Conference on Learning Theory (COLT) 2022

  4. arXiv:2206.01782  [pdf, other

    math.OC cs.LG eess.SY

    Optimal Competitive-Ratio Control

    Authors: Oron Sabag, Sahin Lale, Babak Hassibi

    Abstract: Inspired by competitive policy designs approaches in online learning, new control paradigms such as competitive-ratio and regret-optimal control have been recently proposed as alternatives to the classical $\mathcal{H}_2$ and $\mathcal{H}_\infty$ approaches. These competitive metrics compare the control cost of the designed controller against the cost of a clairvoyant controller, which has access… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  5. arXiv:2206.01704  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    KCRL: Krasovskii-Constrained Reinforcement Learning with Guaranteed Stability in Nonlinear Dynamical Systems

    Authors: Sahin Lale, Yuanyuan Shi, Guannan Qu, Kamyar Azizzadenesheli, Adam Wierman, Anima Anandkumar

    Abstract: Learning a dynamical system requires stabilizing the unknown dynamics to avoid state blow-ups. However, current reinforcement learning (RL) methods lack stabilization guarantees, which limits their applicability for the control of safety-critical systems. We propose a model-based RL framework with formal stability guarantees, Krasovskii Constrained RL (KCRL), that adopts Krasovskii's family of Lya… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  6. arXiv:2202.10788  [pdf, other

    cs.LG math.OC stat.ML

    Explicit Regularization via Regularizer Mirror Descent

    Authors: Navid Azizan, Sahin Lale, Babak Hassibi

    Abstract: Despite perfectly interpolating the training data, deep neural networks (DNNs) can often generalize fairly well, in part due to the "implicit regularization" induced by the learning algorithm. Nonetheless, various forms of regularization, such as "explicit regularization" (via weight decay), are often used to avoid overfitting, especially when the data is corrupted. There are several challenges wi… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  7. arXiv:2112.07746  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning

    Authors: Kevin Huang, Sahin Lale, Ugo Rosolia, Yuanyuan Shi, Anima Anandkumar

    Abstract: Current state-of-the-art model-based reinforcement learning algorithms use trajectory sampling methods, such as the Cross-Entropy Method (CEM), for planning in continuous control settings. These zeroth-order optimizers require sampling a large number of trajectory rollouts to select an optimal action, which scales poorly for large prediction horizons or high dimensional action spaces. First-order… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  8. arXiv:2108.11959  [pdf, ps, other

    cs.LG eess.SY math.OC

    Finite-time System Identification and Adaptive Control in Autoregressive Exogenous Systems

    Authors: Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

    Abstract: Autoregressive exogenous (ARX) systems are the general class of input-output dynamical systems used for modeling stochastic linear dynamical systems (LDS) including partially observable LDS such as LQG systems. In this work, we study the problem of system identification and adaptive control of unknown ARX systems. We provide finite-time learning guarantees for the ARX systems under both open-loop… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: 3rd Annual Learning for Dynamics & Control Conference (L4DC)

  9. arXiv:2105.01244  [pdf, ps, other

    math.OC cs.LG eess.SY

    Regret-Optimal LQR Control

    Authors: Oron Sabag, Gautam Goel, Sahin Lale, Babak Hassibi

    Abstract: We consider the infinite-horizon LQR control problem. Motivated by competitive analysis in online learning, as a criterion for controller design we introduce the dynamic regret, defined as the difference between the LQR cost of a causal controller (that has only access to past disturbances) and the LQR cost of the \emph{unique} clairvoyant one (that has also access to future disturbances) that is… ▽ More

    Submitted 13 April, 2023; v1 submitted 3 May, 2021; originally announced May 2021.

  10. arXiv:2104.14134  [pdf, other

    math.OC cs.LG eess.SY

    Stable Online Control of Linear Time-Varying Systems

    Authors: Guannan Qu, Yuanyuan Shi, Sahin Lale, Anima Anandkumar, Adam Wierman

    Abstract: Linear time-varying (LTV) systems are widely used for modeling real-world dynamical systems due to their generality and simplicity. Providing stability guarantees for LTV systems is one of the central problems in control theory. However, existing approaches that guarantee stability typically lead to significantly sub-optimal cumulative control cost in online settings where only current or short-te… ▽ More

    Submitted 29 April, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: 3rd Annual Learning for Dynamics & Control Conference (L4DC)

  11. arXiv:2012.04160  [pdf, other

    cs.LG math.OC stat.ML

    Stability and Identification of Random Asynchronous Linear Time-Invariant Systems

    Authors: Sahin Lale, Oguzhan Teke, Babak Hassibi, Anima Anandkumar

    Abstract: In many computational tasks and dynamical systems, asynchrony and randomization are naturally present and have been considered as ways to increase the speed and reduce the cost of computation while compromising the accuracy and convergence rate. In this work, we show the additional benefits of randomization and asynchrony on the stability of linear dynamical systems. We introduce a natural model f… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  12. arXiv:2007.12291  [pdf, other

    cs.LG math.OC stat.ML

    Reinforcement Learning with Fast Stabilization in Linear Dynamical Systems

    Authors: Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

    Abstract: In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynamical systems. When learning a dynamical system, one needs to stabilize the unknown dynamics in order to avoid system blow-ups. We propose an algorithm that certifies fast stabilization of the underlying system by effectively exploring the environment with an improved exploration strategy. We show tha… ▽ More

    Submitted 3 June, 2022; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: 25th International Conference on Artificial Intelligence and Statistics (AISTATS) 2022

  13. arXiv:2003.11227  [pdf, other

    cs.LG math.OC stat.ML

    Logarithmic Regret Bound in Partially Observable Linear Dynamical Systems

    Authors: Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

    Abstract: We study the problem of system identification and adaptive control in partially observable linear dynamical systems. Adaptive and closed-loop system identification is a challenging problem due to correlations introduced in data collection. In this paper, we present the first model estimation method with finite-time guarantees in both open and closed-loop system identification. Deploying this estim… ▽ More

    Submitted 23 June, 2020; v1 submitted 25 March, 2020; originally announced March 2020.

  14. arXiv:2003.05999  [pdf, ps, other

    cs.LG math.OC stat.ML

    Adaptive Control and Regret Minimization in Linear Quadratic Gaussian (LQG) Setting

    Authors: Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

    Abstract: We study the problem of adaptive control in partially observable linear quadratic Gaussian control systems, where the model dynamics are unknown a priori. We propose LqgOpt, a novel reinforcement learning algorithm based on the principle of optimism in the face of uncertainty, to effectively minimize the overall control cost. We employ the predictor state evolution representation of the system dyn… ▽ More

    Submitted 23 June, 2020; v1 submitted 12 March, 2020; originally announced March 2020.

  15. arXiv:2002.00082  [pdf, ps, other

    cs.LG math.OC stat.ML

    Regret Minimization in Partially Observable Linear Quadratic Control

    Authors: Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

    Abstract: We study the problem of regret minimization in partially observable linear quadratic control systems when the model dynamics are unknown a priori. We propose ExpCommit, an explore-then-commit algorithm that learns the model Markov parameters and then follows the principle of optimism in the face of uncertainty to design a controller. We propose a novel way to decompose the regret and provide an en… ▽ More

    Submitted 7 March, 2020; v1 submitted 31 January, 2020; originally announced February 2020.

  16. arXiv:1906.03830  [pdf, other

    cs.LG math.OC stat.ML

    Stochastic Mirror Descent on Overparameterized Nonlinear Models: Convergence, Implicit Regularization, and Generalization

    Authors: Navid Azizan, Sahin Lale, Babak Hassibi

    Abstract: Most modern learning problems are highly overparameterized, meaning that there are many more parameters than the number of training data points, and as a result, the training loss may have infinitely many global minima (parameter vectors that perfectly interpolate the training data). Therefore, it is important to understand which interpolating solutions we converge to, how they depend on the initi… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

  17. arXiv:1901.09490  [pdf, other

    cs.LG stat.ML

    Stochastic Linear Bandits with Hidden Low Rank Structure

    Authors: Sahin Lale, Kamyar Azizzadenesheli, Anima Anandkumar, Babak Hassibi

    Abstract: High-dimensional representations often have a lower dimensional underlying structure. This is particularly the case in many decision making settings. For example, when the representation of actions is generated from a deep neural network, it is reasonable to expect a low-rank structure whereas conventional structures like sparsity are not valid anymore. Subspace recovery methods, such as Principle… ▽ More

    Submitted 27 January, 2019; originally announced January 2019.