arXiv Analytics

Sign in

arXiv:2207.02750 [math.OC]AbstractReferencesReviewsResources

An SDE perspective on stochastic convex optimization

Rodrigo Maulen Soto, Jalal Fadili, Hedy Attouch

Published 2022-07-06Version 1

We analyze the global and local behavior of gradient-like flows under stochastic errors towards the aim of solving convex optimization problems with noisy gradient input. We first study the unconstrained differentiable convex case, using a stochastic differential equation where the drift term is minus the gradient of the objective function and the diffusion term is either bounded or square-integrable. In this context, under Lipschitz continuity of the gradient, our first main result shows almost sure convergence of the objective and the trajectory process towards a minimizer of the objective function. We also provide a comprehensive complexity analysis by establishing several new pointwise and ergodic convergence rates in expectation for the convex, strongly convex, and (local) {\L}ojasiewicz case. The latter, which involves local analysis, is challenging and requires non-trivial arguments from measure theory. Then, we extend our study to the constrained case and more generally to certain nonsmooth situations. We show that several of our results have natural extensions obtained by replacing the gradient of the objective function by a cocoercive monotone operator. This makes it possible to obtain similar convergence results for optimization problems with an additively "smooth + non-smooth" convex structure. Finally, we consider another extension of our results to non-smooth optimization which is based on the Moreau envelope.

Related articles: Most relevant | Search more
arXiv:1212.4701 [math.OC] (Published 2012-12-19, updated 2014-09-25)
On Solving Convex Optimization Problems with Linear Ascending Constraints
arXiv:1802.01062 [math.OC] (Published 2018-02-04)
How to Characterize the Worst-Case Performance of Algorithms for Nonconvex Optimization
arXiv:1404.5100 [math.OC] (Published 2014-04-21, updated 2014-09-14)
Convergence of cyclic coordinatewise l1 minimization