arXiv Analytics

Sign in

arXiv:2102.11893 [cs.LG]AbstractReferencesReviewsResources

Good Actors can come in Smaller Sizes: A Case Study on the Value of Actor-Critic Asymmetry

Siddharth Mysore, Bassel Mabsout, Renato Mancuso, Kate Saenko

Published 2021-02-23Version 1

Actors and critics in actor-critic reinforcement learning algorithms are functionally separate, yet they often use the same network architectures. This case study explores the performance impact of network sizes when considering actor and critic architectures independently. By relaxing the assumption of architectural symmetry, it is often possible for smaller actors to achieve comparable policy performance to their symmetric counterparts. Our experiments show up to 97% reduction in the number of network weights with an average reduction of 64% over multiple algorithms on multiple tasks. Given the practical benefits of reducing actor complexity, we believe configurations of actors and critics are aspects of actor-critic design that deserve to be considered independently.

Related articles: Most relevant | Search more
arXiv:2005.12729 [cs.LG] (Published 2020-05-25)
Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO
arXiv:1806.07129 [cs.LG] (Published 2018-06-19)
Instance-Level Explanations for Fraud Detection: A Case Study
arXiv:1810.05524 [cs.LG] (Published 2018-10-10)
Introducing a hybrid model of DEA and data mining in evaluating efficiency. Case study: Bank Branches