arXiv:1506.07291 Abstract | arXiv Analytics

arXiv:1506.07291 [math.OC]Abstract References Reviews Resources

Two-Armed Restless Bandits with Imperfect Information: Stochastic Control and Indexability

Published 2015-06-24Version 1

We present a two-armed bandit model of decision making under uncertainty where the expected return to investing in the "risky arm" increases when choosing that arm and decreases when choosing the "safe" arm. These dynamics are natural in applications such as human capital development, job search, and occupational choice. Using new insights from stochastic control, along with a monotonicity condition on the payoff dynamics, we show that optimal strategies in our model are stopping rules that can be characterized by an index which formally coincides with Gittins' index. Our result implies the indexability of a new class of restless bandit models.

Comments: 46 pages

Categories: math.OC

Subjects: 93E20, 93E11, 60G40

Keywords: stochastic control, two-armed restless bandits, imperfect information, indexability, human capital development

Related articles: Most relevant | Search more

arXiv:2101.06205 [math.OC] (Published 2021-01-15)

Maximum principle for stochastic control of SDEs with measurable drifts

Olivier Menoukeu-Pamen, Ludovic Tangpi

arXiv:1103.3005 [math.OC] (Published 2011-03-15, updated 2012-05-05)

The Separation Principle in Stochastic Control, Redux

Tryphon T. Georgiou, Anders Lindquist

arXiv:2007.09978 [math.OC] (Published 2020-07-20)