arXiv:2111.06784 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords partially observable markov decision processes, confounded partially observable markov decision, minimax learning approach, off-policy evaluation, permits general function approximation Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset