arXiv Analytics

Sign in

arXiv:1712.00970 [math.OC]AbstractReferencesReviewsResources

Convex and Lipschitz function approximations for Markov decision processes

Jeremy Yee

Published 2017-12-04Version 1

This paper studies the use of convex Lipschitz continuous functions to approximate the value functions in Markov decision processes. Compact convergence is proved under various sampling schemes for the driving state disturbance. Under some assumptions, these approximations give a non-decreasing sequence of lower bounding or a non-increasing sequence of upper bounding functions. Numerical experiments involving piecewise linear approximations for a Bermudan put option demonstrate that tight bounds for its fair price can be obtained within fractions of a cpu second.

Related articles: Most relevant | Search more
arXiv:2406.05086 [math.OC] (Published 2024-06-07)
Robust Reward Design for Markov Decision Processes
arXiv:1202.6259 [math.OC] (Published 2012-02-28)
A distance for probability spaces, and long-term values in Markov Decision Processes and Repeated Games
arXiv:math/0506489 [math.OC] (Published 2005-06-23, updated 2008-03-27)
Acceleration Operators in the Value Iteration Algorithms for Markov Decision Processes