arXiv:1512.03873 [math.OC]AbstractReferencesReviewsResources
Structural Results for Partially Observed Markov Decision Processes
Published 2015-12-12Version 1
This article provides an introductory tutorial on structural results in partially observed Markov decision processes (POMDPs). Typically, computing the optimal policy of a POMDP is computationally intractable. We use lattice program- ming methods to characterize the structure of the optimal policy of a POMDP without brute force computations.
Categories: math.OC
Related articles: Most relevant | Search more
Myopic Bounds for Optimal Policy of POMDPs: An extension of Lovejoy's structural results
arXiv:1202.6259 [math.OC] (Published 2012-02-28)
A distance for probability spaces, and long-term values in Markov Decision Processes and Repeated Games
Convergence Analysis of the Approximate Newton Method for Markov Decision Processes