arXiv:1804.08619 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords deep q-learning, state-action value, unnecessary td updates, novel state distribution-aware sampling method, transition Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset