A sharp uniform-in-time error estimate for Stochastic Gradient Langevin Dynamics (original) (raw)

View PDF HTML (experimental)

Abstract:We establish a sharp uniform-in-time error estimate for the Stochastic Gradient Langevin Dynamics (SGLD), which is a widely-used sampling algorithm. Under mild assumptions, we obtain a uniform-in-time O(eta2)O(\eta^2)O(eta2) bound for the KL-divergence between the SGLD iteration and the Langevin diffusion, where eta\etaeta is the step size (or learning rate). Our analysis is also valid for varying step sizes. Consequently, we are able to derive an O(eta)O(\eta)O(eta) bound for the distance between the invariant measures of the SGLD iteration and the Langevin diffusion, in terms of Wasserstein or total variation distances. Our result can be viewed as a significant improvement compared with existing analysis for SGLD in related literature.

Submission history

From: Yuliang Wang [view email]
[v1] Tue, 19 Jul 2022 14:38:52 UTC (42 KB)
[v2] Sat, 22 Oct 2022 02:00:51 UTC (50 KB)
[v3] Wed, 19 Mar 2025 15:14:41 UTC (52 KB)