arXiv:2210.05431 Abstract | arXiv Analytics

arXiv:2210.05431 [stat.ML]Abstract References Reviews Resources

Non-Asymptotic Analysis of a UCB-based Top Two Algorithm

Published 2022-10-11Version 1

A Top Two sampling rule for bandit identification is a method which selects the next arm to sample from among two candidate arms, a leader and a challenger. Due to their simplicity and good empirical performance, they have received increased attention in recent years. For fixed-confidence best arm identification, theoretical guarantees for Top Two methods have only been obtained in the asymptotic regime, when the error level vanishes. We derive the first non-asymptotic upper bound on the expected sample complexity of a Top Two algorithm holding for any error level. Our analysis highlights sufficient properties for a regret minimization algorithm to be used as leader. They are satisfied by the UCB algorithm and our proposed UCB-based Top Two algorithm enjoys simultaneously non-asymptotic guarantees and competitive empirical performance.

Comments: 32 pages, 5 figures, 3 tables

Categories: stat.ML, cs.LG

Keywords: non-asymptotic analysis, analysis highlights sufficient properties, fixed-confidence best arm identification, algorithm enjoys simultaneously non-asymptotic guarantees, first non-asymptotic upper bound

Related articles: Most relevant | Search more

arXiv:2206.05979 [stat.ML] (Published 2022-06-13)

Top Two Algorithms Revisited

Marc Jourdan, Rémy Degenne, Dorian Baudry, Rianne de Heide, Emilie Kaufmann

arXiv:1707.03663 [stat.ML] (Published 2017-07-12)

Underdamped Langevin MCMC: A non-asymptotic analysis

Xiang Cheng, Niladri S. Chatterji, Peter L. Bartlett, Michael I. Jordan

arXiv:2105.02337 [stat.ML] (Published 2021-05-05)

Non-asymptotic analysis and inference for an outlyingness induced winsorized mean

Yijun Zuo

arXiv Analytics

arXiv:2210.05431 [stat.ML]Abstract References Reviews Resources

Non-Asymptotic Analysis of a UCB-based Top Two Algorithm

Links

Toolbox

arXiv:2210.05431 [stat.ML]AbstractReferencesReviewsResources

Non-Asymptotic Analysis of a UCB-based Top Two Algorithm

Links

Toolbox

arXiv:2210.05431 [stat.ML]Abstract References Reviews Resources