arXiv Analytics

Sign in

arXiv:1902.04741 [cs.LG]AbstractReferencesReviewsResources

Learning and Generalization for Matching Problems

Alon Cohen, Avinatan Hassidim, Haim Kaplan, Yishay Mansour, Shay Moran

Published 2019-02-13Version 1

We study a classic algorithmic problem through the lens of statistical learning. That is, we consider a matching problem where the input graph is sampled from some distribution. This distribution is unknown to the algorithm; however, an additional graph which is sampled from the same distribution is given during a training phase (preprocessing). More specifically, the algorithmic problem is to match $k$ out of $n$ items that arrive online to $d$ categories ($d\ll k \ll n$). Our goal is to design a two-stage online algorithm that retains a small subset of items in the first stage which contains an offline matching of maximum weight. We then compute this optimal matching in a second stage. The added statistical component is that before the online matching process begins, our algorithms learn from a training set consisting of another matching instance drawn from the same unknown distribution. Using this training set, we learn a policy that we apply during the online matching process. We consider a class of online policies that we term \emph{thresholds policies}. For this class, we derive uniform convergence results both for the number of retained items and the value of the optimal matching. We show that the number of retained items and the value of the offline optimal matching deviate from their expectation by $O(\sqrt{k})$. This requires usage of less-standard concentration inequalities (standard ones give deviations of $O(\sqrt{n})$). Furthermore, we design an algorithm that outputs the optimal offline solution with high probability while retaining only $O(k\log \log n)$ items in expectation.

Related articles: Most relevant | Search more
arXiv:1906.03574 [cs.LG] (Published 2019-06-09)
Transfer Learning by Modeling a Distribution over Policies
arXiv:2010.15100 [cs.LG] (Published 2020-10-28)
Evaluating Model Robustness to Dataset Shift
arXiv:1603.02501 [cs.LG] (Published 2016-03-08)
Mixture Proportion Estimation via Kernel Embedding of Distributions