arXiv:2310.19861 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords function approximation, competitive rl, partial observation, mg models fitting mg, self-play posterior sampling method Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset