arXiv:1907.05634 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords value functions, learning self-correctable policies, negative sampling, demonstrations, reinforcement learning algorithm Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset