arXiv:2401.05794 Abstract | arXiv Analytics

arXiv:2401.05794 [cs.LG]Abstract References Reviews Resources

Bounds on the price of feedback for mistake-bounded online learning

Published 2024-01-11Version 1

We improve several worst-case bounds for various online learning scenarios from (Auer and Long, Machine Learning, 1999). In particular, we sharpen an upper bound for delayed ambiguous reinforcement learning by a factor of 2, an upper bound for learning compositions of families of functions by a factor of 2.41, and an upper bound for agnostic learning by a factor of 1.09. We also improve a lower bound from the same paper for learning compositions of $k$ families of functions by a factor of $\Theta(\ln{k})$, matching the upper bound up to a constant factor. In addition, we solve a problem from (Long, Theoretical Computer Science, 2020) on the price of bandit feedback with respect to standard feedback for multiclass learning, and we improve an upper bound from (Feng et al., Theoretical Computer Science, 2023) on the price of $r$-input delayed ambiguous reinforcement learning by a factor of $r$, matching a lower bound from the same paper up to the leading term.

Categories: cs.LG, cs.DM, math.CO

Keywords: upper bound, mistake-bounded online learning, delayed ambiguous reinforcement learning, theoretical computer science, lower bound

Related articles: Most relevant | Search more

arXiv:2106.11692 [cs.LG] (Published 2021-06-22)

A Unified Framework for Conservative Exploration

Yunchang Yang et al.

arXiv:2102.04939 [cs.LG] (Published 2021-02-09)

RL for Latent MDPs: Regret Guarantees and a Lower Bound

Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor

arXiv:1806.02970 [cs.LG] (Published 2018-06-08)

PAC Ranking from Pairwise and Listwise Queries: Lower Bounds and Upper Bounds

Wenbo Ren, Jia Liu, Ness B. Shroff

arXiv Analytics

arXiv:2401.05794 [cs.LG]Abstract References Reviews Resources

Bounds on the price of feedback for mistake-bounded online learning

Links

Toolbox

arXiv:2401.05794 [cs.LG]AbstractReferencesReviewsResources

Bounds on the price of feedback for mistake-bounded online learning

Links

Toolbox

arXiv:2401.05794 [cs.LG]Abstract References Reviews Resources