arXiv:2402.17747 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords human feedback, reinforcement learning, ais deceive, challenges, return function Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset