Best-Arm Identification in Linear Bandits

Soare, Marta; Lazaric, Alessandro; Munos, Rémi

Computer Science > Machine Learning

arXiv:1409.6110 (cs)

[Submitted on 22 Sep 2014 (v1), last revised 4 Nov 2014 (this version, v2)]

Title:Best-Arm Identification in Linear Bandits

Authors:Marta Soare, Alessandro Lazaric, Rémi Munos

View PDF

Abstract:We study the best-arm identification problem in linear bandit, where the rewards of the arms depend linearly on an unknown parameter $\theta^*$ and the objective is to return the arm with the largest reward. We characterize the complexity of the problem and introduce sample allocation strategies that pull arms to identify the best arm with a fixed confidence, while minimizing the sample budget. In particular, we show the importance of exploiting the global linear structure to improve the estimate of the reward of near-optimal arms. We analyze the proposed strategies and compare their empirical performance. Finally, as a by-product of our analysis, we point out the connection to the $G$-optimality criterion used in optimal experimental design.

Comments:	In Advances in Neural Information Processing Systems 27 (NIPS), 2014
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1409.6110 [cs.LG]
	(or arXiv:1409.6110v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1409.6110

Submission history

From: Marta Soare [view email]
[v1] Mon, 22 Sep 2014 08:41:02 UTC (78 KB)
[v2] Tue, 4 Nov 2014 14:21:28 UTC (79 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2014-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Marta Soare
Alessandro Lazaric
Rémi Munos

export BibTeX citation

Computer Science > Machine Learning

Title:Best-Arm Identification in Linear Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Best-Arm Identification in Linear Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators