arXiv:1004.5229 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords reinforcement learning, kullback-leibler divergence, guarantee near-optimal regret bounds, linear maximization problem, solving kl-optimistic extended value iteration Tags conference paper, journal article Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset