arXiv:1811.06521 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords human preferences, demonstrations, reward learning, deep neural network, reward function Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset