arXiv:1812.07544 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords deep reinforcement learning, information-directed exploration, upper confidence bound algorithms, heteroscedastic observation noise, resulting exploration strategy explicitly accounts Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset