arXiv Analytics

Sign in

arXiv:1512.03873 [math.OC]AbstractReferencesReviewsResources

Structural Results for Partially Observed Markov Decision Processes

Vikram Krishnamurthy

Published 2015-12-12Version 1

This article provides an introductory tutorial on structural results in partially observed Markov decision processes (POMDPs). Typically, computing the optimal policy of a POMDP is computationally intractable. We use lattice program- ming methods to characterize the structure of the optimal policy of a POMDP without brute force computations.

Related articles: Most relevant | Search more
arXiv:1404.3328 [math.OC] (Published 2014-04-12, updated 2015-11-14)
Myopic Bounds for Optimal Policy of POMDPs: An extension of Lovejoy's structural results
arXiv:1202.6259 [math.OC] (Published 2012-02-28)
A distance for probability spaces, and long-term values in Markov Decision Processes and Repeated Games
arXiv:1310.7906 [math.OC] (Published 2013-10-29, updated 2015-08-04)
Convergence Analysis of the Approximate Newton Method for Markov Decision Processes