arXiv:1911.07409 Abstract | arXiv Analytics

arXiv:1911.07409 [math.OC]Abstract References Reviews Resources

Online Learning and Matching for Resource Allocation Problems

Andrea Boskovic, Qinyi Chen, Dominik Kufel, Zijie Zhou

Published 2019-11-18Version 1

In order for an e-commerce platform to maximize its revenue, it must recommend customers items they are most likely to purchase. However, the company often has business constraints on these items, such as the number of each item in stock. In this work, our goal is to recommend items to users as they arrive on a webpage sequentially, in an online manner, in order to maximize reward for a company, but also satisfy budget constraints. We first approach the simpler online problem in which the customers arrive as a stationary Poisson process, and present an integrated algorithm that performs online optimization and online learning together. We then make the model more complicated but more realistic, treating the arrival processes as non-stationary Poisson processes. To deal with heterogeneous customer arrivals, we propose a time segmentation algorithm that converts a non-stationary problem into a series of stationary problems. Experiments conducted on large-scale synthetic data demonstrate the effectiveness and efficiency of our proposed approaches on solving constrained resource allocation problems.

Comments: 22 pages, 9 figures

Categories: math.OC, cs.LG, stat.ML

Keywords: online learning, large-scale synthetic data demonstrate, non-stationary poisson processes, solving constrained resource allocation problems, time segmentation algorithm

Related articles: Most relevant | Search more

arXiv:1603.04136 [math.OC] (Published 2016-03-14)

On the Influence of Momentum Acceleration on Online Learning

Kun Yuan, Bicheng Ying, Ali H. Sayed

arXiv:1402.6361 [math.OC] (Published 2014-02-25)

Oracle-Based Robust Optimization via Online Learning

Aharon Ben-Tal, Elad Hazan, Tomer Koren, Shie Mannor

arXiv:1404.1592 [math.OC] (Published 2014-04-06, updated 2014-07-29)

The Power of Online Learning in Stochastic Network Optimization