arXiv:2210.07338 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords linear function approximation, unbiased policy evaluation, reinforcement learning, two-time-scale stochastic approximation algorithm, gradient descent Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset