arXiv:1811.01926 Abstract | arXiv Analytics

arXiv:1811.01926 [cs.LG]Abstract References Reviews Resources

contextual: Evaluating Contextual Multi-Armed Bandit Problems in R

Published 2018-11-06Version 1

Over the past decade, contextual bandit algorithms have been gaining in popularity due to their effectiveness and flexibility in solving sequential decision problems---from online advertising and finance to clinical trial design and personalized medicine. At the same time, there are, as of yet, surprisingly few options that enable researchers and practitioners to simulate and compare the wealth of new and existing bandit algorithms in a standardized way. To help close this gap between analytical research and empirical evaluation the current paper introduces the object-oriented R package "contextual": a user-friendly and, through its object-oriented structure, easily extensible framework that facilitates parallelized comparison of contextual and context-free bandit policies through both simulation and offline analysis.

Comments: 55 pages, 12 figures

Categories: cs.LG, math.OC, stat.ML

Subjects: 93E35, I.2.6, K.4.4, F.2.0, G.3

Keywords: evaluating contextual multi-armed bandit problems, decision problems-from online advertising, bandit algorithms, solving sequential decision problems-from online, context-free bandit policies

Related articles: Most relevant | Search more

arXiv:2008.07146 [cs.LG] (Published 2020-08-17)

A Large-scale Open Dataset for Bandit Algorithms

Yuta Saito, Shunsuke Aihara, Megumi Matsutani, Yusuke Narita

arXiv:2009.06606 [cs.LG] (Published 2020-09-14)

Hellinger KL-UCB based Bandit Algorithms for Markovian and i.i.d. Settings

Arghyadip Roy, Sanjay Shakkottai, R. Srikant

arXiv:2006.12038 [cs.LG] (Published 2020-06-22)