arXiv:1904.11131 Abstract | arXiv Analytics

arXiv:1904.11131 [cs.LG]Abstract References Reviews Resources

Lipschitz Bandit Optimization with Improved Efficiency

Published 2019-04-25Version 1

We consider the Lipschitz bandit optimization problem with an emphasis on practical efficiency. Although there is rich literature on regret analysis of this type of problem, e.g., [Kleinberg et al. 2008, Bubeck et al. 2011, Slivkins 2014], their proposed algorithms suffer from serious practical problems including extreme time complexity and dependence on oracle implementations. With this motivation, we propose a novel algorithm with an Upper Confidence Bound (UCB) exploration, namely Tree UCB-Hoeffding, using adaptive partitions. Our partitioning scheme is easy to implement and does not require any oracle settings. With a tree-based search strategy, the total computational cost can be improved to $\mathcal{O}(T\log T)$ for the first $T$ iterations. In addition, our algorithm achieves the regret lower bound up to a logarithmic factor.

Comments: 12 pages, 0 figure

Categories: cs.LG, cs.AI, math.OC, stat.ML

Keywords: efficiency, lipschitz bandit optimization problem, regret lower bound, total computational cost, upper confidence bound

Related articles: Most relevant | Search more

arXiv:2501.13013 [cs.LG] (Published 2025-01-22)

The regret lower bound for communicating Markov Decision Processes

Victor Boone, Odalric-Ambrym Maillard

arXiv:2404.05155 [cs.LG] (Published 2024-04-08)

On the price of exact truthfulness in incentive-compatible online learning with bandit feedback: A regret lower bound for WSU-UX

Ali Mortazavi, Junhao Lin, Nishant A. Mehta

arXiv:2301.01824 [cs.LG] (Published 2023-01-04)

Privacy and Efficiency of Communications in Federated Split Learning