arXiv:2410.17297 [stat.ML]AbstractReferencesReviewsResources
Error estimates between SGD with momentum and underdamped Langevin diffusion
Arnaud Guillin, Yu Wang, Lihu Xu, Haoran Yang
Published 2024-10-22Version 1
Stochastic gradient descent with momentum is a popular variant of stochastic gradient descent, which has recently been reported to have a close relationship with the underdamped Langevin diffusion. In this paper, we establish a quantitative error estimate between them in the 1-Wasserstein and total variation distances.
Related articles: Most relevant | Search more
arXiv:2103.14350 [stat.ML] (Published 2021-03-26)
The convergence of the Stochastic Gradient Descent (SGD) : a self-contained proof
arXiv:2502.06719 [stat.ML] (Published 2025-02-10)
Gaussian Approximation and Multiplier Bootstrap for Stochastic Gradient Descent
Marina Sheshukova, Sergey Samsonov, Denis Belomestny, Eric Moulines, Qi-Man Shao, Zhuo-Song Zhang, Alexey Naumov
arXiv:2502.00885 [stat.ML] (Published 2025-02-02)
Algorithmic Stability of Stochastic Gradient Descent with Momentum under Heavy-Tailed Noise