arXiv:1512.03873 Abstract | arXiv Analytics

arXiv:1512.03873 [math.OC]Abstract References Reviews Resources

Structural Results for Partially Observed Markov Decision Processes

Published 2015-12-12Version 1

This article provides an introductory tutorial on structural results in partially observed Markov decision processes (POMDPs). Typically, computing the optimal policy of a POMDP is computationally intractable. We use lattice program- ming methods to characterize the structure of the optimal policy of a POMDP without brute force computations.

Categories: math.OC

Keywords: markov decision processes, structural results, optimal policy, brute force computations, introductory tutorial

Related articles: Most relevant | Search more

arXiv:1404.3328 [math.OC] (Published 2014-04-12, updated 2015-11-14)

Myopic Bounds for Optimal Policy of POMDPs: An extension of Lovejoy's structural results

Vikram Krishnamurthy, Udit Pareek

arXiv:1202.6259 [math.OC] (Published 2012-02-28)

A distance for probability spaces, and long-term values in Markov Decision Processes and Repeated Games

Jérôme Renault, Xavier Venel

arXiv:1310.7906 [math.OC] (Published 2013-10-29, updated 2015-08-04)

Convergence Analysis of the Approximate Newton Method for Markov Decision Processes

Thomas Furmston, Guy Lever