arXiv Analytics

Sign in

arXiv:2010.13834 [cs.LG]AbstractReferencesReviewsResources

End-to-End Learning and Intervention in Games

Jiayang Li, Jing Yu, Yu, Nie, Zhaoran Wang

Published 2020-10-26Version 1

In a social system, the self-interest of agents can be detrimental to the collective good, sometimes leading to social dilemmas. To resolve such a conflict, a central designer may intervene by either redesigning the system or incentivizing the agents to change their behaviors. To be effective, the designer must anticipate how the agents react to the intervention, which is dictated by their often unknown payoff functions. Therefore, learning about the agents is a prerequisite for intervention. In this paper, we provide a unified framework for learning and intervention in games. We cast the equilibria of games as individual layers and integrate them into an end-to-end optimization framework. To enable the backward propagation through the equilibria of games, we propose two approaches, respectively based on explicit and implicit differentiation. Specifically, we cast the equilibria as the solutions to variational inequalities (VIs). The explicit approach unrolls the projection method for solving VIs, while the implicit approach exploits the sensitivity of the solutions to VIs. At the core of both approaches is the differentiation through a projection operator. Moreover, we establish the correctness of both approaches and identify the conditions under which one approach is more desirable than the other. The analytical results are validated using several real-world problems.

Comments: To be published in Advances in Neural Information Processing Systems 33 (NeurIPS 2020)
Categories: cs.LG, cs.GT
Related articles: Most relevant | Search more
arXiv:2409.07137 [cs.LG] (Published 2024-09-11)
Combined Optimization of Dynamics and Assimilation with End-to-End Learning on Sparse Observations
arXiv:2411.04430 [cs.LG] (Published 2024-11-07)
Towards Unifying Interpretability and Control: Evaluation via Intervention
arXiv:1612.08810 [cs.LG] (Published 2016-12-28)
The Predictron: End-To-End Learning and Planning
David Silver et al.