arXiv:2008.05523 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords bandit feedback, non-stochastic control, loss function, efficient sublinear regret algorithm, main algorithmic difficulty Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset