arXiv Analytics

Sign in

arXiv:1911.07409 [math.OC]AbstractReferencesReviewsResources

Online Learning and Matching for Resource Allocation Problems

Andrea Boskovic, Qinyi Chen, Dominik Kufel, Zijie Zhou

Published 2019-11-18Version 1

In order for an e-commerce platform to maximize its revenue, it must recommend customers items they are most likely to purchase. However, the company often has business constraints on these items, such as the number of each item in stock. In this work, our goal is to recommend items to users as they arrive on a webpage sequentially, in an online manner, in order to maximize reward for a company, but also satisfy budget constraints. We first approach the simpler online problem in which the customers arrive as a stationary Poisson process, and present an integrated algorithm that performs online optimization and online learning together. We then make the model more complicated but more realistic, treating the arrival processes as non-stationary Poisson processes. To deal with heterogeneous customer arrivals, we propose a time segmentation algorithm that converts a non-stationary problem into a series of stationary problems. Experiments conducted on large-scale synthetic data demonstrate the effectiveness and efficiency of our proposed approaches on solving constrained resource allocation problems.

Related articles: Most relevant | Search more
arXiv:1603.04136 [math.OC] (Published 2016-03-14)
On the Influence of Momentum Acceleration on Online Learning
arXiv:1402.6361 [math.OC] (Published 2014-02-25)
Oracle-Based Robust Optimization via Online Learning
arXiv:1404.1592 [math.OC] (Published 2014-04-06, updated 2014-07-29)
The Power of Online Learning in Stochastic Network Optimization