arXiv Analytics

Sign in

arXiv:1805.11897 [stat.ML]AbstractReferencesReviewsResources

Differential Properties of Sinkhorn Approximation for Learning with Wasserstein Distance

Giulia Luise, Alessandro Rudi, Massimiliano Pontil, Carlo Ciliberto

Published 2018-05-30Version 1

Applications of optimal transport have recently gained remarkable attention thanks to the computational advantages of entropic regularization. However, in most situations the Sinkhorn approximation of the Wasserstein distance is replaced by a regularized version that is less accurate but easy to differentiate. In this work we characterize the differential properties of the original Sinkhorn distance, proving that it enjoys the same smoothness as its regularized version and we explicitly provide an efficient algorithm to compute its gradient. We show that this result benefits both theory and applications: on one hand, high order smoothness confers statistical guarantees to learning with Wasserstein approximations. On the other hand, the gradient formula allows us to efficiently solve learning and optimization problems in practice. Promising preliminary experiments complement our analysis.

Related articles: Most relevant | Search more
arXiv:2006.10325 [stat.ML] (Published 2020-06-18)
When OT meets MoM: Robust estimation of Wasserstein Distance
arXiv:2401.11562 [stat.ML] (Published 2024-01-21)
Enhancing selectivity using Wasserstein distance based reweighing
arXiv:2109.14206 [stat.ML] (Published 2021-09-29, updated 2022-01-20)
Exact Statistical Inference for the Wasserstein Distance by Selective Inference