arXiv Analytics

Sign in

arXiv:2303.13489 [cs.LG]AbstractReferencesReviewsResources

Boosting Reinforcement Learning and Planning with Demonstrations: A Survey

Tongzhou Mu, Hao Su

Published 2023-03-23Version 1

Although reinforcement learning has seen tremendous success recently, this kind of trial-and-error learning can be impractical or inefficient in complex environments. The use of demonstrations, on the other hand, enables agents to benefit from expert knowledge rather than having to discover the best action to take through exploration. In this survey, we discuss the advantages of using demonstrations in sequential decision making, various ways to apply demonstrations in learning-based decision making paradigms (for example, reinforcement learning and planning in the learned models), and how to collect the demonstrations in various scenarios. Additionally, we exemplify a practical pipeline for generating and utilizing demonstrations in the recently proposed ManiSkill robot learning benchmark.

Related articles: Most relevant | Search more
arXiv:1809.05127 [cs.LG] (Published 2018-09-13)
IL-Net: Using Expert Knowledge to Guide the Design of Furcated Neural Networks
arXiv:2009.14108 [cs.LG] (Published 2020-09-29)
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
arXiv:1907.11105 [cs.LG] (Published 2019-07-24)
The Good, the Bad and the Ugly: Augmenting a black-box model with expert knowledge