arXiv:1706.04711 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords reinforcement learning, model mismatch, stochastic gradient descent algorithms, robust approximate policy iteration, robust approximate value iteration Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset