arXiv:2102.11893 Abstract | arXiv Analytics

arXiv:2102.11893 [cs.LG]Abstract References Reviews Resources

Good Actors can come in Smaller Sizes: A Case Study on the Value of Actor-Critic Asymmetry

Siddharth Mysore, Bassel Mabsout, Renato Mancuso, Kate Saenko

Published 2021-02-23Version 1

Actors and critics in actor-critic reinforcement learning algorithms are functionally separate, yet they often use the same network architectures. This case study explores the performance impact of network sizes when considering actor and critic architectures independently. By relaxing the assumption of architectural symmetry, it is often possible for smaller actors to achieve comparable policy performance to their symmetric counterparts. Our experiments show up to 97% reduction in the number of network weights with an average reduction of 64% over multiple algorithms on multiple tasks. Given the practical benefits of reducing actor complexity, we believe configurations of actors and critics are aspects of actor-critic design that deserve to be considered independently.

Categories: cs.LG, cs.RO

Keywords: case study, actor-critic asymmetry, smaller sizes, actor-critic reinforcement learning algorithms, achieve comparable policy performance

Related articles: Most relevant | Search more

arXiv:2005.12729 [cs.LG] (Published 2020-05-25)

Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO

Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry

arXiv:1806.07129 [cs.LG] (Published 2018-06-19)

Instance-Level Explanations for Fraud Detection: A Case Study

Dennis Collaris, Leo M. Vink, Jarke J. van Wijk

arXiv:1810.05524 [cs.LG] (Published 2018-10-10)

Introducing a hybrid model of DEA and data mining in evaluating efficiency. Case study: Bank Branches

Sara Hosseinzadeh Kassani, Peyman Hosseinzadeh Kassani, Seyed Esmaeel Najafi