arXiv:2401.04056 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords human feedback, reinforcement learning, minimaximalist approach, social choice theory literature, stochastic preferences Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset