arXiv:2402.03210 Abstract | arXiv Analytics

arXiv:2402.03210 [math.OC]Abstract References Reviews Resources

Universal Gradient Methods for Stochastic Convex Optimization

Anton Rodomanov, Ali Kavis, Yongtao Wu, Kimon Antonakopoulos, Volkan Cevher

Published 2024-02-05, updated 2024-07-11Version 2

We develop universal gradient methods for Stochastic Convex Optimization (SCO). Our algorithms automatically adapt not only to the oracle's noise but also to the H\"older smoothness of the objective function without a priori knowledge of the particular setting. The key ingredient is a novel strategy for adjusting step-size coefficients in the Stochastic Gradient Method (SGD). Unlike AdaGrad, which accumulates gradient norms, our Universal Gradient Method accumulates appropriate combinations of gradient- and iterate differences. The resulting algorithm has state-of-the-art worst-case convergence rate guarantees for the entire H\"older class including, in particular, both nonsmooth functions and those with Lipschitz continuous gradient. We also present the Universal Fast Gradient Method for SCO enjoying optimal efficiency estimates.

Categories: math.OC

Keywords: stochastic convex optimization, worst-case convergence rate guarantees, method accumulates appropriate combinations, enjoying optimal efficiency estimates, universal gradient method accumulates appropriate

Related articles: Most relevant | Search more

arXiv:1502.06259 [math.OC] (Published 2015-02-22)

Gradient and gradient-free methods for stochastic convex optimization with inexact oracle

Alexander Gasnikov, Pavel Dvurechensky, Kamzolov Dmitry

arXiv:1107.1744 [math.OC] (Published 2011-07-08, updated 2011-10-08)

Stochastic convex optimization with bandit feedback

Alekh Agarwal, Dean P. Foster, Daniel Hsu, Sham M. Kakade, Alexander Rakhlin

arXiv:2012.15636 [math.OC] (Published 2020-12-31)

Inexact Tensor Methods and Their Application to Stochastic Convex Optimization

Artem Agafonov, Dmitry Kamzolov, Pavel Dvurechensky, Alexander Gasnikov