arXiv:1712.00970 Abstract | arXiv Analytics

arXiv:1712.00970 [math.OC]Abstract References Reviews Resources

Convex and Lipschitz function approximations for Markov decision processes

Published 2017-12-04Version 1

This paper studies the use of convex Lipschitz continuous functions to approximate the value functions in Markov decision processes. Compact convergence is proved under various sampling schemes for the driving state disturbance. Under some assumptions, these approximations give a non-decreasing sequence of lower bounding or a non-increasing sequence of upper bounding functions. Numerical experiments involving piecewise linear approximations for a Bermudan put option demonstrate that tight bounds for its fair price can be obtained within fractions of a cpu second.

Comments: 24 pages

Categories: math.OC

Keywords: markov decision processes, lipschitz function approximations, convex lipschitz continuous functions, value functions, compact convergence

Related articles: Most relevant | Search more

arXiv:2406.05086 [math.OC] (Published 2024-06-07)

Robust Reward Design for Markov Decision Processes

Shuo Wu, Haoxiang Ma, Jie Fu, Shuo Han

arXiv:1202.6259 [math.OC] (Published 2012-02-28)

A distance for probability spaces, and long-term values in Markov Decision Processes and Repeated Games

Jérôme Renault, Xavier Venel

arXiv:math/0506489 [math.OC] (Published 2005-06-23, updated 2008-03-27)

Acceleration Operators in the Value Iteration Algorithms for Markov Decision Processes