arXiv:2210.15515 [cs.LG]AbstractReferencesReviewsResources
Meta-Reinforcement Learning Using Model Parameters
Published 2022-10-27Version 1
In meta-reinforcement learning, an agent is trained in multiple different environments and attempts to learn a meta-policy that can efficiently adapt to a new environment. This paper presents RAMP, a Reinforcement learning Agent using Model Parameters that utilizes the idea that a neural network trained to predict environment dynamics encapsulates the environment information. RAMP is constructed in two phases: in the first phase, a multi-environment parameterized dynamic model is learned. In the second phase, the model parameters of the dynamic model are used as context for the multi-environment policy of the model-free reinforcement learning agent.
Comments: 8 pages
Categories: cs.LG
Related articles: Most relevant | Search more
arXiv:2206.03271 [cs.LG] (Published 2022-06-07)
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
arXiv:1901.08162 [cs.LG] (Published 2019-01-23)
Causal Reasoning from Meta-reinforcement Learning
Ishita Dasgupta et al.
arXiv:2210.11348 [cs.LG] (Published 2022-10-20)
Hypernetworks in Meta-Reinforcement Learning