arXiv:1901.07839 [math.OC]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords markov decision processes, peak constraints, reinforcement learning, time satisfy additional constraints, first time learning algorithms guarantee Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset