arXiv:2312.01586 [math.OC]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords markov decision processes, long-run reward cvar, saddle point solution, history-dependent randomized policies, stationary randomized policy Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset