arXiv:1202.3079 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords bandit feedback, minimax policies, action set, computationally efficient algorithm, minimax optimal regret bounds Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset