arXiv:1805.03359 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords deep reinforcement learning, reward estimation, advantage actor critic, variance reduction methods, help reduce variance Tags conference paper Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset