arXiv:1810.12558 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords deep reinforcement learning, open ai gym benchmark problems, first relative importance sampling-off-policy actor-critic Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset