Stateful Posted Pricing with Vanishing Regret via Dynamic Deterministic Markov Decision Processes

Emek, Yuval; Lavi, Ron; Niazadeh, Rad; Shi, Yangguang

Computer Science > Computer Science and Game Theory

arXiv:2005.01869 (cs)

[Submitted on 4 May 2020 (v1), last revised 28 Jun 2020 (this version, v2)]

Title:Stateful Posted Pricing with Vanishing Regret via Dynamic Deterministic Markov Decision Processes

Authors:Yuval Emek, Ron Lavi, Rad Niazadeh, Yangguang Shi

View PDF

Abstract:In this paper, a rather general online problem called dynamic resource allocation with capacity constraints (DRACC) is introduced and studied in the realm of posted price mechanisms. This problem subsumes several applications of stateful pricing, including but not limited to posted prices for online job scheduling and matching over a dynamic bipartite graph. As the existing online learning techniques do not yield vanishing-regret mechanisms for this problem, we develop a novel online learning framework defined over deterministic Markov decision processes with dynamic state transition and reward functions. We then prove that if the Markov decision process is guaranteed to admit an oracle that can simulate any given policy from any initial state with bounded loss -- a condition that is satisfied in the DRACC problem -- then the online learning problem can be solved with vanishing regret. Our proof technique is based on a reduction to online learning with switching cost, in which an online decision maker incurs an extra cost every time she switches from one arm to another. We formally demonstrate this connection and further show how DRACC can be used in our proposed applications of stateful pricing.

Comments:	24 pages
Subjects:	Computer Science and Game Theory (cs.GT)
Cite as:	arXiv:2005.01869 [cs.GT]
	(or arXiv:2005.01869v2 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2005.01869

Submission history

From: Yangguang Shi [view email]
[v1] Mon, 4 May 2020 22:02:18 UTC (40 KB)
[v2] Sun, 28 Jun 2020 17:35:15 UTC (50 KB)

🚨2024-09-29: arxiv.org is experiencing DB issues.🚨

Computer Science > Computer Science and Game Theory

Title:Stateful Posted Pricing with Vanishing Regret via Dynamic Deterministic Markov Decision Processes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

🚨2024-09-29: arxiv.org is experiencing DB issues.🚨

Computer Science > Computer Science and Game Theory

Title:Stateful Posted Pricing with Vanishing Regret via Dynamic Deterministic Markov Decision Processes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators