arXiv:2207.12738 Abstract | arXiv Analytics

arXiv:2207.12738 [math.OC]Abstract References Reviews Resources

Quantitative propagation of chaos for mean field Markov decision process with common noise

Published 2022-07-26Version 1

We investigate propagation of chaos for mean field Markov Decision Process with common noise (CMKV-MDP), and when the optimization is performed over randomized open-loop controls on infinite horizon. We first state a rate of convergence of order $M_N^\gamma$, where $M_N$ is the mean rate of convergence in Wasserstein distance of the empirical measure, and $\gamma \in (0,1]$ is an explicit constant, in the limit of the value functions of $N$-agent control problem with asymmetric open-loop controls, towards the value function of CMKV-MDP. Furthermore, we show how to explicitly construct $(\epsilon+\mathcal{O}(M_N^\gamma))$-optimal policies for the $N$-agent model from $\epsilon$-optimal policies for the CMKV-MDP. Our approach relies on sharp comparison between the Bellman operators in the $N$-agent problem and the CMKV-MDP, and fine coupling of empirical measures.

Categories: math.OC, math.PR

Keywords: mean field markov decision process, common noise, quantitative propagation, open-loop controls, optimal policies

Related articles: Most relevant | Search more

arXiv:2204.01185 [math.OC] (Published 2022-04-03)

Wasserstein Hamiltonian flow with common noise on graph

Jianbo Cui, Shu Liu, Haomin Zhou

arXiv:1708.06035 [math.OC] (Published 2017-08-20)

Quantile-based Mean-Field Games with Common Noise

Hamidou Tembine

arXiv:1912.07883 [math.OC] (Published 2019-12-17)

Mean-field Markov decision processes with common noise and open-loop controls

Médéric Motte, Huyên Pham

arXiv Analytics

arXiv:2207.12738 [math.OC]Abstract References Reviews Resources

Quantitative propagation of chaos for mean field Markov decision process with common noise

Links

Toolbox

arXiv:2207.12738 [math.OC]AbstractReferencesReviewsResources

Quantitative propagation of chaos for mean field Markov decision process with common noise

Links

Toolbox

arXiv:2207.12738 [math.OC]Abstract References Reviews Resources