arXiv:1801.09821 Abstract | arXiv Analytics

arXiv:1801.09821 [cs.LG]Abstract References Reviews Resources

Learning to Emulate an Expert Projective Cone Scheduler

Published 2018-01-30Version 1

Projective cone scheduling defines a large class of rate-stabilizing policies for queueing models relevant to several applications. While there exists considerable theory on the properties of projective cone schedulers, there is little practical guidance on choosing the parameters that define them. In this paper, we propose an algorithm for designing an automated projective cone scheduling system based on observations of an expert projective cone scheduler. We show that the estimated scheduling policy is able to emulate the expert in the sense that the average loss realized by the learned policy will converge to zero. Specifically, for a system with $n$ queues observed over a time horizon $T$, the average loss for the algorithm is $O(\ln(T)\sqrt{\ln(n)/T})$. This upper bound holds regardless of the statistical characteristics of the system. The algorithm uses the multiplicative weights update method and can be applied online so that additional observations of the expert scheduler can be used to improve an existing estimate of the policy. This provides a data-driven method for designing a scheduling policy based on observations of a human expert. We demonstrate the efficacy of the algorithm with a simple numerical example and discuss several extensions.

Comments: 6 pages, 3 figures

Categories: cs.LG

Keywords: expert projective cone scheduler, projective cone scheduling system, average loss, upper bound holds, observations

Related articles: Most relevant | Search more

arXiv:2406.02295 [cs.LG] (Published 2024-06-04)

How to Explore with Belief: State Entropy Maximization in POMDPs

Riccardo Zamboni, Duilio Cirino, Marcello Restelli, Mirco Mutti

arXiv:2401.10518 [cs.LG] (Published 2024-01-19)

Spatial-temporal Forecasting for Regions without Observations

Xinyu Su, Jianzhong Qi, Egemen Tanin, Yanchuan Chang, Majid Sarvi

arXiv:2501.09331 [cs.LG] (Published 2025-01-16)

Identifying Information from Observations with Uncertainty and Novelty