arXiv:2305.06584 Abstract | arXiv Analytics

arXiv:2305.06584 [cs.LG]Abstract References Reviews Resources

Active Learning in the Predict-then-Optimize Framework: A Margin-Based Approach

Mo Liu, Paul Grigas, Heyuan Liu, Zuo-Jun Max Shen

Published 2023-05-11Version 1

We develop the first active learning method in the predict-then-optimize framework. Specifically, we develop a learning method that sequentially decides whether to request the "labels" of feature samples from an unlabeled data stream, where the labels correspond to the parameters of an optimization model for decision-making. Our active learning method is the first to be directly informed by the decision error induced by the predicted parameters, which is referred to as the Smart Predict-then-Optimize (SPO) loss. Motivated by the structure of the SPO loss, our algorithm adopts a margin-based criterion utilizing the concept of distance to degeneracy and minimizes a tractable surrogate of the SPO loss on the collected data. In particular, we develop an efficient active learning algorithm with both hard and soft rejection variants, each with theoretical excess risk (i.e., generalization) guarantees. We further derive bounds on the label complexity, which refers to the number of samples whose labels are acquired to achieve a desired small level of SPO risk. Under some natural low-noise conditions, we show that these bounds can be better than the naive supervised learning approach that labels all samples. Furthermore, when using the SPO+ loss function, a specialized surrogate of the SPO loss, we derive a significantly smaller label complexity under separability conditions. We also present numerical evidence showing the practical value of our proposed algorithms in the settings of personalized pricing and the shortest path problem.

Categories: cs.LG, math.OC, stat.ML

Keywords: predict-then-optimize framework, margin-based approach, spo loss, active learning method, shortest path problem

Related articles: Most relevant | Search more

arXiv:2012.13325 [cs.LG] (Published 2020-12-24)

An Active Learning Method for Diabetic Retinopathy Classification with Uncertainty Quantification

Muhammad Ahtazaz Ahsan, Adnan Qayyum, Junaid Qadir, Adeel Razi

arXiv:2407.11053 [cs.LG] (Published 2024-07-09)

Sampling and active learning methods for network reliability estimation using K-terminal spanning tree

Chen Ding, Pengfei Wei, Yan Shi, Jinxing Liu, Matteo Broggi, Michael Beer

arXiv:2003.00360 [cs.LG] (Published 2020-02-29)

Decision Trees for Decision-Making under the Predict-then-Optimize Framework

Adam N. Elmachtoub, Jason Cheuk Nam Liang, Ryan McNellis