arXiv:2001.01357 Abstract | arXiv Analytics

arXiv:2001.01357 [math.OC]Abstract References Reviews Resources

Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies

Published 2020-01-06Version 1

We consider average-cost Markov decision processes (MDPs) with Borel state and action spaces and universally measurable policies. For the nonnegative cost model and an unbounded cost model with a Lyapunov-type stability character, we introduce a set of new conditions under which we prove the average cost optimality inequality (ACOI) via the vanishing discount factor approach. Unlike most existing results on the ACOI, our result does not require any compactness and continuity conditions on the MDPs. Instead, the main idea is to use the almost-uniform-convergence property of a pointwise convergent sequence of measurable functions as asserted in Egoroff's theorem. Our conditions are formulated in order to exploit this property. Among others, we require that for each state, on selected subsets of actions at that state, the state transition stochastic kernel is majorized by finite measures. We combine this majorization property of the transition kernel with Egoroff's theorem to prove the ACOI.

Comments: 33 pages; submitted. This paper consists of (i) the author's results given previously in Section 3 of arXiv:1901.03374v1 and (ii) newly added discussion and examples

Categories: math.OC

Subjects: 90C39, 90C40, 93E20

Keywords: average cost optimality inequality, universally measurable policies, borel spaces, average-cost markov decision processes

Related articles: Most relevant | Search more

arXiv:1902.10685 [math.OC] (Published 2019-02-27)

On the Minimum Pair Approach for Average-Cost Markov Decision Processes with Countable Discrete Action Space and Strictly Unbounded Costs

Huizhen Yu

arXiv:2206.06492 [math.OC] (Published 2022-06-13)

On Strategic Measures and Optimality Properties in Discrete-Time Stochastic Control with Universally Measurable Policies

Huizhen Yu

arXiv:1202.4122 [math.OC] (Published 2012-02-19)

Average-Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

Eugene A. Feinberg, Pavlo O. Kasyanov, Nina V. Zadoianchuk

arXiv Analytics

arXiv:2001.01357 [math.OC]Abstract References Reviews Resources

Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies

Links

Toolbox

arXiv:2001.01357 [math.OC]AbstractReferencesReviewsResources

Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies

Links

Toolbox

arXiv:2001.01357 [math.OC]Abstract References Reviews Resources