arXiv Analytics

Sign in

arXiv:2406.10823 [math.PR]AbstractReferencesReviewsResources

Iterated Schrödinger bridge approximation to Wasserstein Gradient Flows

Medha Agarwal, Zaid Harchaoui, Garrett Mulcahy, Soumik Pal

Published 2024-06-16Version 1

We introduce a novel discretization scheme for Wasserstein gradient flows that involves successively computing Schr\"{o}dinger bridges with the same marginals. This is different from both the forward/geodesic approximation and the backward/Jordan-Kinderlehrer-Otto (JKO) approximations. The proposed scheme has two advantages: one, it avoids the use of the score function, and, two, it is amenable to particle-based approximations using the Sinkhorn algorithm. Our proof hinges upon showing that relative entropy between the Schr\"{o}dinger bridge with the same marginals at temperature $\epsilon$ and the joint distribution of a stationary Langevin diffusion at times zero and $\epsilon$ is of the order $o(\epsilon^2)$ with an explicit dependence given by Fisher information. Owing to this inequality, we can show, using a triangular approximation argument, that the interpolated iterated application of the Schr\"{o}dinger bridge approximation converge to the Wasserstein gradient flow, for a class of gradient flows, including the heat flow. The results also provide a probabilistic and rigorous framework for the convergence of the self-attention mechanisms in transformer networks to the solutions of heat flows, first observed in the inspiring work SABP22 in machine learning research.

Related articles: Most relevant | Search more
arXiv:2304.06665 [math.PR] (Published 2023-04-13)
The heat flow, GAF, and SL(2;R)
arXiv:2308.11685 [math.PR] (Published 2023-08-22)
Zeros of random polynomials undergoing the heat flow
arXiv:2101.06024 [math.PR] (Published 2021-01-15)
A probabilistic representation for heat flow of harmonic map on manifolds with time-dependent Riemannian metric