arXiv:2305.07036 Abstract | arXiv Analytics

arXiv:2305.07036 [cs.LG]Abstract References Reviews Resources

GFlowNets with Human Feedback

Yinchuan Li, Shuang Luo, Yunfeng Shao, Jianye Hao

Published 2023-05-11Version 1

We propose the GFlowNets with Human Feedback (GFlowHF) framework to improve the exploration ability when training AI models. For tasks where the reward is unknown, we fit the reward function through human evaluations on different trajectories. The goal of GFlowHF is to learn a policy that is strictly proportional to human ratings, instead of only focusing on human favorite ratings like RLHF. Experiments show that GFlowHF can achieve better exploration ability than RLHF.

Categories: cs.LG, cs.AI

Keywords: human feedback, achieve better exploration ability, human favorite ratings, training ai models, human evaluations

Related articles: Most relevant | Search more

arXiv:2402.09401 [cs.LG] (Published 2024-02-14, updated 2025-02-11)

Reinforcement Learning from Human Feedback with Active Queries

Kaixuan Ji, Jiafan He, Quanquan Gu

arXiv:2107.01969 [cs.LG] (Published 2021-07-05)

The MineRL BASALT Competition on Learning from Human Feedback

Rohin Shah et al.

arXiv:2406.02764 [cs.LG] (Published 2024-06-04)

Adaptive Preference Scaling for Reinforcement Learning with Human Feedback

Ilgee Hong, Zichong Li, Alexander Bukharin, Yixiao Li, Haoming Jiang, Tianbao Yang, Tuo Zhao

arXiv Analytics

arXiv:2305.07036 [cs.LG]Abstract References Reviews Resources

GFlowNets with Human Feedback

Links

Toolbox

arXiv:2305.07036 [cs.LG]AbstractReferencesReviewsResources

GFlowNets with Human Feedback

Links

Toolbox

arXiv:2305.07036 [cs.LG]Abstract References Reviews Resources