arXiv:1407.7241 Abstract | arXiv Analytics

arXiv:1407.7241 [math.PR]Abstract References Reviews Resources

Bandit problems with Levy processes

Published 2014-07-27Version 1

Bandit problems model the trade-off between exploration and exploitation in various decision problems. We study two-armed bandit problems in continuous time, where the risky arm can have two types: High or Low; both types yield stochastic payoffs generated by a Levy process. We show that the optimal strategy is a cut-off strategy and we provide an explicit expression for the cut-off and for the optimal payoff.

Comments: arXiv admin note: text overlap with arXiv:0906.0835

Journal: Mathematics of Operations Research, Volume 38, Issue 1, February 2013, pp. 92-107

Categories: math.PR

Keywords: levy processes, types yield stochastic payoffs, bandit problems model, study two-armed bandit problems, optimal payoff

Tags: journal article

Related articles: Most relevant | Search more

arXiv:math/0507193 [math.PR] (Published 2005-07-10)

Levy processes: Hitting time, overshoot and undershoot II - Asymptotic behaviour

Bernard Roynette, Pierre Vallois, Agnes Volpi

arXiv:1507.06258 [math.PR] (Published 2015-07-22)

Optimal stopping for Levy processes with polynomial rewards

Ernesto Mordecki, Yuliya Mishura

arXiv:math/0608402 [math.PR] (Published 2006-08-15, updated 2015-09-04)

Levy Processes, Generators

Lev Sakhnovich

arXiv Analytics

arXiv:1407.7241 [math.PR]Abstract References Reviews Resources

Bandit problems with Levy processes

Links

Toolbox

arXiv:1407.7241 [math.PR]AbstractReferencesReviewsResources

Bandit problems with Levy processes

Links

Toolbox

arXiv:1407.7241 [math.PR]Abstract References Reviews Resources