Non-ergodicity in reinforcement learning: robustness via ergodicity transformations

Baumann, Dominik; Noorani, Erfaun; Price, James; Peters, Ole; Connaughton, Colm; Schön, Thomas B.

Computer Science > Machine Learning

arXiv:2310.11335 (cs)

[Submitted on 17 Oct 2023 (v1), last revised 10 Apr 2024 (this version, v2)]

Title:Non-ergodicity in reinforcement learning: robustness via ergodicity transformations

Authors:Dominik Baumann, Erfaun Noorani, James Price, Ole Peters, Colm Connaughton, Thomas B. Schön

View PDF HTML (experimental)

Abstract:Envisioned application areas for reinforcement learning (RL) include autonomous driving, precision agriculture, and finance, which all require RL agents to make decisions in the real world. A significant challenge hindering the adoption of RL methods in these domains is the non-robustness of conventional algorithms. In this paper, we argue that a fundamental issue contributing to this lack of robustness lies in the focus on the expected value of the return as the sole ``correct'' optimization objective. The expected value is the average over the statistical ensemble of infinitely many trajectories. For non-ergodic returns, this average differs from the average over a single but infinitely long trajectory. Consequently, optimizing the expected value can lead to policies that yield exceptionally high returns with probability zero but almost surely result in catastrophic outcomes. This problem can be circumvented by transforming the time series of collected returns into one with ergodic increments. This transformation enables learning robust policies by optimizing the long-term return for individual agents rather than the average across infinitely many trajectories. We propose an algorithm for learning ergodicity transformations from data and demonstrate its effectiveness in an instructive, non-ergodic environment and on standard RL benchmarks.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2310.11335 [cs.LG]
	(or arXiv:2310.11335v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.11335

Submission history

From: Dominik Baumann [view email]
[v1] Tue, 17 Oct 2023 15:13:33 UTC (763 KB)
[v2] Wed, 10 Apr 2024 19:15:07 UTC (677 KB)

Computer Science > Machine Learning

Title:Non-ergodicity in reinforcement learning: robustness via ergodicity transformations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Non-ergodicity in reinforcement learning: robustness via ergodicity transformations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators