Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Zini, M S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.20007  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning

    Authors: Ahmadreza Moradipari, Mohammad Pedramfar, Modjtaba Shokrian Zini, Vaneet Aggarwal

    Abstract: In this paper, we prove the first Bayesian regret bounds for Thompson Sampling in reinforcement learning in a multitude of settings. We simplify the learning problem using a discrete set of surrogate environments, and present a refined analysis of the information ratio using posterior consistency. This leads to an upper bound of order $\widetilde{O}(H\sqrt{d_{l_1}T})$ in the time inhomogeneous rei… ▽ More

    Submitted 6 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  2. Symmetry Protected Quantum Computation

    Authors: Michael H. Freedman, Matthew B. Hastings, Modjtaba Shokrian Zini

    Abstract: We consider a model of quantum computation using qubits where it is possible to measure whether a given pair are in a singlet (total spin $0$) or triplet (total spin $1$) state. The physical motivation is that we can do these measurements in a way that is protected against revealing other information so long as all terms in the Hamiltonian are $SU(2)$-invariant. We conjecture that this model is eq… ▽ More

    Submitted 26 September, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: To be published in Quantum Journal

    Journal ref: Quantum 5, 554 (2021)

  3. arXiv:2001.10474  [pdf, other

    cs.LG cs.AI stat.ML

    Coagent Networks Revisited

    Authors: Modjtaba Shokrian Zini, Mohammad Pedramfar, Matthew Riemer, Ahmadreza Moradipari, Miao Liu

    Abstract: Coagent networks formalize the concept of arbitrary networks of stochastic agents that collaborate to take actions in a reinforcement learning environment. Prominent examples of coagent networks in action include approaches to hierarchical reinforcement learning (HRL), such as those using options, which attempt to address the exploration exploitation trade-off by introducing abstract actions at di… ▽ More

    Submitted 29 August, 2023; v1 submitted 28 January, 2020; originally announced January 2020.

    Comments: Reformatted paper significantly and clarified results on the asynchronous case