arXiv:1612.08810 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords end-to-end learning, conventional deep neural network architectures, true value function, predictron accumulates internal rewards, multiple planning depths Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset