Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Klima, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:1901.08021  [pdf, other

    cs.LG cs.MA stat.ML

    Robust Temporal Difference Learning for Critical Domains

    Authors: Richard Klima, Daan Bloembergen, Michael Kaisers, Karl Tuyls

    Abstract: We present a new Q-function operator for temporal difference (TD) learning methods that explicitly encodes robustness against significant rare events (SRE) in critical domains. The operator, which we call the $κ$-operator, allows to learn a robust policy in a model-based fashion without actually observing the SRE. We introduce single- and multi-agent robust TD methods using the operator $κ$. We pr… ▽ More

    Submitted 13 March, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: AAMAS 2019