arXiv:2210.11348 Abstract | arXiv Analytics

arXiv:2210.11348 [cs.LG]Abstract References Reviews Resources

Hypernetworks in Meta-Reinforcement Learning

Jacob Beck, Matthew Thomas Jackson, Risto Vuorio, Shimon Whiteson

Published 2022-10-20Version 1

Training a reinforcement learning (RL) agent on a real-world robotics task remains generally impractical due to sample inefficiency. Multi-task RL and meta-RL aim to improve sample efficiency by generalizing over a distribution of related tasks. However, doing so is difficult in practice: In multi-task RL, state of the art methods often fail to outperform a degenerate solution that simply learns each task separately. Hypernetworks are a promising path forward since they replicate the separate policies of the degenerate solution while also allowing for generalization across tasks, and are applicable to meta-RL. However, evidence from supervised learning suggests hypernetwork performance is highly sensitive to the initialization. In this paper, we 1) show that hypernetwork initialization is also a critical factor in meta-RL, and that naive initializations yield poor performance; 2) propose a novel hypernetwork initialization scheme that matches or exceeds the performance of a state-of-the-art approach proposed for supervised settings, as well as being simpler and more general; and 3) use this method to show that hypernetworks can improve performance in meta-RL by evaluating on multiple simulated robotics benchmarks.

Comments: Published at CoRL 2022

Categories: cs.LG, cs.AI, cs.RO

Keywords: meta-reinforcement learning, degenerate solution, novel hypernetwork initialization scheme, real-world robotics task remains, naive initializations yield poor performance

Related articles: Most relevant | Search more

arXiv:2206.03271 [cs.LG] (Published 2022-06-07)

On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning

Zhao Mandi, Pieter Abbeel, Stephen James

arXiv:1901.08162 [cs.LG] (Published 2019-01-23)

Causal Reasoning from Meta-reinforcement Learning

Ishita Dasgupta et al.

arXiv:2301.08028 [cs.LG] (Published 2023-01-19)

A Survey of Meta-Reinforcement Learning

Jacob Beck, Risto Vuorio, Evan Zheran Liu, Zheng Xiong, Luisa Zintgraf, Chelsea Finn, Shimon Whiteson

arXiv Analytics

arXiv:2210.11348 [cs.LG]Abstract References Reviews Resources

Hypernetworks in Meta-Reinforcement Learning

Links

Toolbox

arXiv:2210.11348 [cs.LG]AbstractReferencesReviewsResources

Hypernetworks in Meta-Reinforcement Learning

Links

Toolbox

arXiv:2210.11348 [cs.LG]Abstract References Reviews Resources