arXiv Analytics

Sign in

arXiv:2304.01840 [math.OC]AbstractReferencesReviewsResources

A $(2/3)n^3$ fast-pivoting algorithm for the Gittins index and optimal stopping of a Markov chain

José Niño-Mora

Published 2023-04-04Version 1

This paper presents a new \emph{fast-pivoting} algorithm that computes the $n$ Gittins index values of an $n$-state bandit -- in the discounted and undiscounted cases -- by performing $(2/3) n^3 + O(n^2)$ arithmetic operations, thus attaining better complexity than previous algorithms and matching that of solving a corresponding linear-equation system by Gaussian elimination. The algorithm further applies to the problem of optimal stopping of a Markov chain, for which a novel Gittins-index solution approach is introduced. The algorithm draws on Gittins and Jones' (1974) index definition via calibration, on Kallenberg's (1986) proposal of using parametric linear programming, on Dantzig's simplex method, on Varaiya et al.'s (1985) algorithm, and on the author's earlier work. The paper elucidates the structure of parametric simplex tableaux. Special structure is exploited to reduce the computational effort of pivot steps, decreasing the operation count by a factor of three relative to using conventional pivoting, and by a factor of $3/2$ relative to recent state-elimination algorithms. A computational study demonstrates significant time savings against alternative algorithms.

Related articles: Most relevant | Search more
arXiv:2203.13446 [math.OC] (Published 2022-03-25)
Randomized Policy Optimization for Optimal Stopping
arXiv:2403.11744 [math.OC] (Published 2024-03-18, updated 2024-11-06)
A First-Order Gradient Approach for the Connectivity Analysis of Markov Chains
arXiv:0902.2518 [math.OC] (Published 2009-02-15)
A Simulation Approach to Optimal Stopping Under Partial Information