arXiv:2207.04922 Abstract | arXiv Analytics

arXiv:2207.04922 [stat.ML]Abstract References Reviews Resources

On uniform-in-time diffusion approximation for stochastic gradient descent

Published 2022-07-11Version 1

The diffusion approximation of stochastic gradient descent (SGD) in current literature is only valid on a finite time interval. In this paper, we establish the uniform-in-time diffusion approximation of SGD, by only assuming that the expected loss is strongly convex and some other mild conditions, without assuming the convexity of each random loss function. The main technique is to establish the exponential decay rates of the derivatives of the solution to the backward Kolmogorov equation. The uniform-in-time approximation allows us to study asymptotic behaviors of SGD via the continuous stochastic differential equation (SDE) even when the random objective function $f(\cdot;\xi)$ is not strongly convex.

Categories: stat.ML, cs.LG

Keywords: stochastic gradient descent, uniform-in-time diffusion approximation, finite time interval, continuous stochastic differential equation, study asymptotic behaviors

Related articles: Most relevant | Search more

arXiv:2407.07670 [stat.ML] (Published 2024-07-10)

Stochastic Gradient Descent for Two-layer Neural Networks

Dinghao Cao, Zheng-Chu Guo, Lei Shi

arXiv:2209.08951 [stat.ML] (Published 2022-09-19)

Generalization Bounds for Stochastic Gradient Descent via Localized $\varepsilon$-Covers

Sejun Park, Umut Şimşekli, Murat A. Erdogdu

arXiv:2502.06719 [stat.ML] (Published 2025-02-10)

Gaussian Approximation and Multiplier Bootstrap for Stochastic Gradient Descent

Marina Sheshukova, Sergey Samsonov, Denis Belomestny, Eric Moulines, Qi-Man Shao, Zhuo-Song Zhang, Alexey Naumov