arXiv Analytics

Sign in

arXiv:1904.11131 [cs.LG]AbstractReferencesReviewsResources

Lipschitz Bandit Optimization with Improved Efficiency

Xu Zhu, David B. Dunson

Published 2019-04-25Version 1

We consider the Lipschitz bandit optimization problem with an emphasis on practical efficiency. Although there is rich literature on regret analysis of this type of problem, e.g., [Kleinberg et al. 2008, Bubeck et al. 2011, Slivkins 2014], their proposed algorithms suffer from serious practical problems including extreme time complexity and dependence on oracle implementations. With this motivation, we propose a novel algorithm with an Upper Confidence Bound (UCB) exploration, namely Tree UCB-Hoeffding, using adaptive partitions. Our partitioning scheme is easy to implement and does not require any oracle settings. With a tree-based search strategy, the total computational cost can be improved to $\mathcal{O}(T\log T)$ for the first $T$ iterations. In addition, our algorithm achieves the regret lower bound up to a logarithmic factor.

Related articles: Most relevant | Search more
arXiv:2501.13013 [cs.LG] (Published 2025-01-22)
The regret lower bound for communicating Markov Decision Processes
arXiv:2404.05155 [cs.LG] (Published 2024-04-08)
On the price of exact truthfulness in incentive-compatible online learning with bandit feedback: A regret lower bound for WSU-UX
arXiv:2301.01824 [cs.LG] (Published 2023-01-04)
Privacy and Efficiency of Communications in Federated Split Learning