arXiv:1806.08301 Abstract | arXiv Analytics

arXiv:1806.08301 [stat.ML]Abstract References Reviews Resources

Online Saddle Point Problem with Applications to Constrained Online Convex Optimization

Published 2018-06-21Version 1

We study an online saddle point problem where at each iteration a pair of actions need to be chosen without knowledge of the future (convex-concave) payoff functions. The objective is to minimize the gap between the cumulative payoffs and the saddle point value of the aggregate payoff function, which we measure using a metric called "SP-regret". The problem generalizes the online convex optimization framework and can be interpreted as finding the Nash equilibrium for the aggregate of a sequence of two-player zero-sum games. We propose an algorithm that achieves $\tilde{O}(\sqrt{T})$ SP-regret in the general case, and $O(\log T)$ SP-regret for the strongly convex-concave case. We then consider a constrained online convex optimization problem motivated by a variety of applications in dynamic pricing, auctions, and crowdsourcing. We relate this problem to an online saddle point problem and establish $O(\sqrt{T})$ regret using a primal-dual algorithm.

Categories: stat.ML, cs.LG, math.OC

Keywords: online saddle point problem, applications, constrained online convex optimization problem, payoff function, online convex optimization framework

Related articles: Most relevant | Search more

arXiv:1209.3079 [stat.ML] (Published 2012-09-14)

Signal Recovery in Unions of Subspaces with Applications to Compressive Imaging

Nikhil Rao, Benjamin Recht, Robert Nowak

arXiv:1911.02728 [stat.ML] (Published 2019-11-07)

Auto-encoding graph-valued data with applications to brain connectomes

Meimei Liu, Zhengwu Zhang, David B. Dunson

arXiv:2211.14115 [stat.ML] (Published 2022-11-25)

Inverse Solvability and Security with Applications to Federated Learning

Tomasz Piotrowski, Matthias Frey, Renato L. G. Cavalcante, Rafail Ismailov