arXiv Analytics

Sign in

arXiv:1806.08301 [stat.ML]AbstractReferencesReviewsResources

Online Saddle Point Problem with Applications to Constrained Online Convex Optimization

Adrian Rivera, He Wang, Huan Xu

Published 2018-06-21Version 1

We study an online saddle point problem where at each iteration a pair of actions need to be chosen without knowledge of the future (convex-concave) payoff functions. The objective is to minimize the gap between the cumulative payoffs and the saddle point value of the aggregate payoff function, which we measure using a metric called "SP-regret". The problem generalizes the online convex optimization framework and can be interpreted as finding the Nash equilibrium for the aggregate of a sequence of two-player zero-sum games. We propose an algorithm that achieves $\tilde{O}(\sqrt{T})$ SP-regret in the general case, and $O(\log T)$ SP-regret for the strongly convex-concave case. We then consider a constrained online convex optimization problem motivated by a variety of applications in dynamic pricing, auctions, and crowdsourcing. We relate this problem to an online saddle point problem and establish $O(\sqrt{T})$ regret using a primal-dual algorithm.

Related articles: Most relevant | Search more
arXiv:1209.3079 [stat.ML] (Published 2012-09-14)
Signal Recovery in Unions of Subspaces with Applications to Compressive Imaging
arXiv:1911.02728 [stat.ML] (Published 2019-11-07)
Auto-encoding graph-valued data with applications to brain connectomes
arXiv:2211.14115 [stat.ML] (Published 2022-11-25)
Inverse Solvability and Security with Applications to Federated Learning