arXiv:1512.07669 [math.OC]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords markov decision processes, stochastic approximation algorithms, reinforcement learning, concise description, suboptimal method Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset