arXiv:1704.03926 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords multi-armed bandit, value directed exploration, structured priors, machine learning problem requiring, utilizes value-function-driven online planning techniques Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset