arXiv:1805.00869 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords gradient descent, approximate temporal difference learning, reversible policies, approximate value functions, approximate td Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset