arXiv:2201.08536 [stat.ML]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords reinforcement learning, instance-optimal algorithm, early stopping, optimal value estimation problem, obtaining sharp instance-dependent confidence regions Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset