arXiv:1310.5770 Abstract | arXiv Analytics

arXiv:1310.5770 [math.OC]Abstract References Reviews Resources

Quantized Stationary Control Policies in Markov Decision Processes

Published 2013-10-22, updated 2014-04-26Version 2

For a large class of Markov Decision Processes, stationary (possibly randomized) policies are globally optimal. However, in Borel state and action spaces, the computation and implementation of even such stationary policies are known to be prohibitive. In addition, networked control applications require remote controllers to transmit action commands to an actuator with low information rate. These two problems motivate the study of approximating optimal policies by quantized (discretized) policies. To this end, we introduce deterministic stationary quantizer policies and show that such policies can approximate optimal deterministic stationary policies with arbitrary precision under mild technical conditions, thus demonstrating that one can search for $\varepsilon$-optimal policies within the class of quantized control policies. We also derive explicit bounds on the approximation error in terms of the rate of the approximating quantizers. We extend all these approximation results to randomized policies. These findings pave the way toward applications in optimal design of networked control systems where controller actions need to be quantized, as well as for new computational methods for generating approximately optimal decision policies in general (Polish) state and action spaces for both discounted cost and average cost.

Comments: 21 pages

Categories: math.OC, cs.SY

Subjects: 93E20

Keywords: quantized stationary control policies, markov decision processes, approximately optimal decision policies, approximate optimal deterministic stationary policies

Related articles: Most relevant | Search more

arXiv:math/0506489 [math.OC] (Published 2005-06-23, updated 2008-03-27)

Acceleration Operators in the Value Iteration Algorithms for Markov Decision Processes

Oleksandr Shlakhter, Chi-Guhn Lee, Dmitry Khmelev, Nasser Jaber

arXiv:1808.04478 [math.OC] (Published 2018-08-13)

Risk Sensitive Multiple Goal Stochastic Optimization, with application to Risk Sensitive Partially Observed Markov Decision Processes

Vaios Laschos, Robert Seidel, Klaus Obermayer

arXiv:2312.01586 [math.OC] (Published 2023-12-04)