arXiv:2311.05638 [stat.ML]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords online pac reinforcement learning, finite-horizon tabular markov decision processes, upper bounds, sample complexity, instance-optimality Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset