arXiv:2210.15515 Abstract | arXiv Analytics

arXiv:2210.15515 [cs.LG]Abstract References Reviews Resources

Meta-Reinforcement Learning Using Model Parameters

Published 2022-10-27Version 1

In meta-reinforcement learning, an agent is trained in multiple different environments and attempts to learn a meta-policy that can efficiently adapt to a new environment. This paper presents RAMP, a Reinforcement learning Agent using Model Parameters that utilizes the idea that a neural network trained to predict environment dynamics encapsulates the environment information. RAMP is constructed in two phases: in the first phase, a multi-environment parameterized dynamic model is learned. In the second phase, the model parameters of the dynamic model are used as context for the multi-environment policy of the model-free reinforcement learning agent.

Comments: 8 pages

Categories: cs.LG

Keywords: model parameters, meta-reinforcement learning, predict environment dynamics encapsulates, model-free reinforcement learning agent, multi-environment parameterized dynamic model

Related articles: Most relevant | Search more

arXiv:2206.03271 [cs.LG] (Published 2022-06-07)

On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning

Zhao Mandi, Pieter Abbeel, Stephen James

arXiv:1901.08162 [cs.LG] (Published 2019-01-23)

Causal Reasoning from Meta-reinforcement Learning

Ishita Dasgupta et al.

arXiv:2210.11348 [cs.LG] (Published 2022-10-20)

Hypernetworks in Meta-Reinforcement Learning

Jacob Beck, Matthew Thomas Jackson, Risto Vuorio, Shimon Whiteson

arXiv Analytics

arXiv:2210.15515 [cs.LG]Abstract References Reviews Resources

Meta-Reinforcement Learning Using Model Parameters

Links

Toolbox

arXiv:2210.15515 [cs.LG]AbstractReferencesReviewsResources

Meta-Reinforcement Learning Using Model Parameters

Links

Toolbox

arXiv:2210.15515 [cs.LG]Abstract References Reviews Resources