arXiv:2006.11645 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords accelerating safe reinforcement learning, baseline policy, constraint-mismatched policies, times fewer constraint violations, first step performs Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset