arXiv Analytics

Sign in

arXiv:1604.08153 [cs.LG]AbstractReferencesReviewsResources

Classifying Options for Deep Reinforcement Learning

Kai Arulkumaran, Nat Dilokthanakul, Murray Shanahan, Anil Anthony Bharath

Published 2016-04-27Version 1

Deep reinforcement learning is the learning of multiple levels of hierarchical representations for reinforcement learning. Hierarchical reinforcement learning focuses on temporal abstractions in planning and learning, allowing temporally-extended actions to be transferred between tasks. In this paper we combine one method for hierarchical reinforcement learning - the options framework - with deep Q-networks (DQNs) through the use of different "option heads" on the policy network, and a supervisory network for choosing between the different options. We show that in a domain where we have prior knowledge of the mapping between states and options, our augmented DQN achieves a policy competitive with that of a standard DQN, but with much lower sample complexity. This is achieved through a straightforward architectural adjustment to the DQN, as well as an additional supervised neural network.

Related articles: Most relevant | Search more
arXiv:1811.04407 [cs.LG] (Published 2018-11-11)
An Initial Attempt of Combining Visual Selective Attention with Deep Reinforcement Learning
arXiv:1806.01175 [cs.LG] (Published 2018-06-04)
TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning
arXiv:1803.11115 [cs.LG] (Published 2018-03-29)
Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks