Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Lewis, F L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2301.01997  [pdf, ps, other

    cs.LG eess.SY

    Data-Driven Inverse Reinforcement Learning for Expert-Learner Zero-Sum Games

    Authors: Wenqian Xue, Bosen Lian, Jialu Fan, Tianyou Chai, Frank L. Lewis

    Abstract: In this paper, we formulate inverse reinforcement learning (IRL) as an expert-learner interaction whereby the optimal performance intent of an expert or target agent is unknown to a learner agent. The learner observes the states and controls of the expert and hence seeks to reconstruct the expert's cost function intent and thus mimics the expert's optimal response. Next, we add non-cooperative dis… ▽ More

    Submitted 5 January, 2023; originally announced January 2023.

    Comments: 9 pages, 3 figures

  2. arXiv:2112.14676  [pdf, other

    eess.SY cs.AI math.OC nlin.AO

    Learning nonlinear dynamics in synchronization of knowledge-based leader-following networks

    Authors: Shimin Wang, Xiangyu Meng, Hongwei Zhang, Frank L. Lewis

    Abstract: Knowledge-based leader-following synchronization of heterogeneous nonlinear multi-agent systems is a challenging problem since the leader's dynamic information is unknown to any follower node. This paper proposes a learning-based fully distributed observer for a class of nonlinear leader systems, which can simultaneously learn the leader's dynamics and states. This class of leader dynamics is rath… ▽ More

    Submitted 18 July, 2022; v1 submitted 29 December, 2021; originally announced December 2021.

  3. arXiv:2101.00202   

    math.OC cs.MA eess.SY

    Sequential Convex Programming for Collaboration of Connected and Automated Vehicles

    Authors: Xiaoxue Zhang, Jun Ma, Zilong Cheng, Frank L. Lewis, Tong Heng Lee

    Abstract: This paper investigates the collaboration of multiple connected and automated vehicles (CAVs) in different scenarios. In general, the collaboration of CAVs can be formulated as a nonlinear and nonconvex model predictive control (MPC) problem. Most of the existing approaches available for utilization to solve such an optimization problem suffer from the drawback of considerable computational burden… ▽ More

    Submitted 24 July, 2022; v1 submitted 1 January, 2021; originally announced January 2021.

    Comments: With internal discussions and upon agreement from all co-authors, we would like to withdraw this preprint

  4. arXiv:2101.00201  [pdf, other

    cs.MA eess.SY

    Semi-Definite Relaxation Based ADMM for Cooperative Planning and Control of Connected Autonomous Vehicles

    Authors: Xiaoxue Zhang, Zilong Cheng, Jun Ma, Sunan Huang, Frank L. Lewis, Tong Heng Lee

    Abstract: This paper investigates the cooperative planning and control problem for multiple connected autonomous vehicles (CAVs) in different scenarios. In the existing literature, most of the methods suffer from significant problems in computational efficiency. Besides, as the optimization problem is nonlinear and nonconvex, it typically poses great difficultly in determining the optimal solution. To addre… ▽ More

    Submitted 1 January, 2021; originally announced January 2021.

    Comments: 11 pages, 8 figures

  5. arXiv:2001.08092  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Local Policy Optimization for Trajectory-Centric Reinforcement Learning

    Authors: Patrik Kolaric, Devesh K. Jha, Arvind U. Raghunathan, Frank L. Lewis, Mouhacine Benosman, Diego Romeres, Daniel Nikovski

    Abstract: The goal of this paper is to present a method for simultaneous trajectory and local stabilizing policy optimization to generate local policies for trajectory-centric model-based reinforcement learning (MBRL). This is motivated by the fact that global policy optimization for non-linear systems could be a very challenging problem both algorithmically and numerically. However, a lot of robotic manipu… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Journal ref: ICRA 2020

  6. arXiv:1810.11548   

    eess.SY cs.MA

    On the Identifiability of the Influence Model for Stochastic Spatiotemporal Spread Processes

    Authors: Chenyuan He, Yan Wan, Frank L. Lewis

    Abstract: The influence model is a discrete-time stochastic model that succinctly captures the interactions of a network of Markov chains. The model produces a reduced-order representation of the stochastic network, and can be used to describe and tractably analyze probabilistic spatiotemporal spread dynamics, and hence has found broad usage in network applications such as social networks, traffic managemen… ▽ More

    Submitted 6 November, 2018; v1 submitted 26 October, 2018; originally announced October 2018.

    Comments: This temporary draft version of this paper has caused conflict of interest and we request to withdraw this paper from arXiv