Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Leahy, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.02951  [pdf, ps, other

    math.OC cs.LG math.PR

    A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces

    Authors: Bekzhan Kerimkulov, James-Michael Leahy, David Siska, Lukasz Szpruch, Yufei Zhang

    Abstract: We study the global convergence of a Fisher-Rao policy gradient flow for infinite-horizon entropy-regularised Markov decision processes with Polish state and action space. The flow is a continuous-time analogue of a policy mirror descent method. We establish the global well-posedness of the gradient flow and demonstrate its exponential convergence to the optimal policy. Moreover, we prove the flow… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    MSC Class: 90C40; 93E20; 90C26; 60B05; 90C53

  2. arXiv:2202.03188  [pdf

    cs.AI

    Knowledge-Integrated Informed AI for National Security

    Authors: Anu K. Myne, Kevin J. Leahy, Ryan J. Soklaski

    Abstract: The state of artificial intelligence technology has a rich history that dates back decades and includes two fall-outs before the explosive resurgence of today, which is credited largely to data-driven techniques. While AI technology has and continues to become increasingly mainstream with impact across domains and industries, it's not without several drawbacks, weaknesses, and potential to cause u… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Report number: Technical Report TR-1272

  3. arXiv:2201.07296  [pdf, ps, other

    math.OC cs.AI cs.LG math.PR stat.ML

    Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime

    Authors: Bekzhan Kerimkulov, James-Michael Leahy, David Šiška, Lukasz Szpruch

    Abstract: We study the global convergence of policy gradient for infinite-horizon, continuous state and action space, and entropy-regularized Markov decision processes (MDPs). We consider a softmax policy with (one-hidden layer) neural network approximation in a mean-field regime. Additional entropic regularization in the associated mean-field probability measure is added, and the corresponding gradient flo… ▽ More

    Submitted 16 June, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

  4. arXiv:1901.00984  [pdf, other

    quant-ph cs.DS

    Quantum Insertion-Deletion Channels

    Authors: Janet Leahy, Dave Touchette, Penghui Yao

    Abstract: We introduce a model of quantum insertion-deletion (insdel) channels. Insdel channels are meant to represent, for example, synchronization errors arising in data transmission. In the classical setting, they represent a strict generalization of the better-understood corruption error channels, and until recently, had mostly resisted effort toward a similar understanding as their corruption counterpa… ▽ More

    Submitted 4 January, 2019; originally announced January 2019.

  5. arXiv:1803.04813  [pdf

    cs.LG cs.CE cs.NE physics.data-an

    Artificial neural network based modelling approach for municipal solid waste gasification in a fluidized bed reactor

    Authors: Daya Shankar Pandey, Saptarshi Das, Indranil Pan, James J. Leahy, Witold Kwapinski

    Abstract: In this paper, multi-layer feed forward neural networks are used to predict the lower heating value of gas (LHV), lower heating value of gasification products including tars and entrained char (LHVp) and syngas yield during gasification of municipal solid waste (MSW) during gasification in a fluidized bed reactor. These artificial neural networks (ANNs) with different architectures are trained usi… ▽ More

    Submitted 5 February, 2018; originally announced March 2018.

    Comments: 34 pages, 11 figures

    Journal ref: Waste Management (Elsevier), Volume 58, December 2016, Pages 202-213