arXiv Analytics

Sign in

arXiv:1608.01713 [math.OC]AbstractReferencesReviewsResources

Global Convergence Rate of Proximal Incremental Aggregated Gradient Methods

Nuri Denizcan Vanli, Mert Gurbuzbalaban, Asu Ozdaglar

Published 2016-08-04Version 1

We focus on the problem of minimizing the sum of smooth component functions (where the sum is strongly convex) and a non-smooth convex function, which arises in regularized empirical risk minimization in machine learning and distributed constrained optimization in wireless sensor networks and smart grids. We consider solving this problem using the proximal incremental aggregated gradient (PIAG) method, which at each iteration moves along an aggregated gradient (formed by incrementally updating gradients of component functions according to a deterministic order) and taking a proximal step with respect to the non-smooth function. While the convergence properties of this method with randomized orders (in updating gradients of component functions) have been investigated, this paper, to the best of our knowledge, is the first study that establishes the convergence rate properties of the PIAG method for any deterministic order. In particular, we show that the PIAG algorithm is globally convergent with a linear rate provided that the step size is sufficiently small. We explicitly identify the rate of convergence and the corresponding step size to achieve this convergence rate. Our results improve upon the best known condition number dependence of the convergence rate of the incremental aggregated gradient methods used for minimizing a sum of smooth functions.

Related articles: Most relevant | Search more
arXiv:1711.05812 [math.OC] (Published 2017-11-15)
Global convergence rates of augmented Lagrangian methods for constrained convex programming
arXiv:1801.03600 [math.OC] (Published 2018-01-11)
Multi-Level Stochastic Gradient Methods for Nested Compositon Optimization
arXiv:2305.13082 [math.OC] (Published 2023-05-22)
Sketch-and-Project Meets Newton Method: Global $\mathcal O(k^{-2})$ Convergence with Low-Rank Updates