arXiv:2107.02377 Abstract | arXiv Analytics

arXiv:2107.02377 [cs.LG]Abstract References Reviews Resources

A Short Note on the Relationship of Information Gain and Eluder Dimension

Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei

Published 2021-07-06Version 1

Eluder dimension and information gain are two widely used methods of complexity measures in bandit and reinforcement learning. Eluder dimension was originally proposed as a general complexity measure of function classes, but the common examples of where it is known to be small are function spaces (vector spaces). In these cases, the primary tool to upper bound the eluder dimension is the elliptic potential lemma. Interestingly, the elliptic potential lemma also features prominently in the analysis of linear bandits/reinforcement learning and their nonparametric generalization, the information gain. We show that this is not a coincidence -- eluder dimension and information gain are equivalent in a precise sense for reproducing kernel Hilbert spaces.

Categories: cs.LG, cs.AI, math.OC, stat.ML

Keywords: eluder dimension, information gain, short note, elliptic potential lemma, relationship

Related articles: Most relevant | Search more

arXiv:2409.13232 [cs.LG] (Published 2024-09-20)

Relationship between Uncertainty in DNNs and Adversarial Attacks

Abigail Adeniran, Adewale Adeyemo

arXiv:2007.10297 [cs.LG] (Published 2020-07-20)

A Short Note on Soft-max and Policy Gradients in Bandits Problems

Neil Walton

arXiv:2007.03742 [cs.LG] (Published 2020-07-07)

Meta-active Learning in Probabilistically-Safe Optimization

Mariah L. Schrum, Mark Connolly, Eric Cole, Mihir Ghetiya, Robert Gross, Matthew C. Gombolay

arXiv Analytics

arXiv:2107.02377 [cs.LG]Abstract References Reviews Resources

A Short Note on the Relationship of Information Gain and Eluder Dimension

Links

Toolbox

arXiv:2107.02377 [cs.LG]AbstractReferencesReviewsResources

A Short Note on the Relationship of Information Gain and Eluder Dimension

Links

Toolbox

arXiv:2107.02377 [cs.LG]Abstract References Reviews Resources