arXiv Analytics

Sign in

arXiv:2210.15576 [math.OC]AbstractReferencesReviewsResources

Regret Bounds and Experimental Design for Estimate-then-Optimize

Samuel Tan, Peter I. Frazier

Published 2022-10-27Version 1

In practical applications, data is used to make decisions in two steps: estimation and optimization. First, a machine learning model estimates parameters for a structural model relating decisions to outcomes. Second, a decision is chosen to optimize the structural model's predicted outcome as if its parameters were correctly estimated. Due to its flexibility and simple implementation, this ``estimate-then-optimize'' approach is often used for data-driven decision-making. Errors in the estimation step can lead estimate-then-optimize to sub-optimal decisions that result in regret, i.e., a difference in value between the decision made and the best decision available with knowledge of the structural model's parameters. We provide a novel bound on this regret for smooth and unconstrained optimization problems. Using this bound, in settings where estimated parameters are linear transformations of sub-Gaussian random vectors, we provide a general procedure for experimental design to minimize the regret resulting from estimate-then-optimize. We demonstrate our approach on simple examples and a pandemic control application.

Related articles: Most relevant | Search more
arXiv:2006.04554 [math.OC] (Published 2020-06-03)
Batch greedy maximization of non-submodular functions: Guarantees and applications to experimental design
arXiv:2310.06194 [math.OC] (Published 2023-10-09)
Stability and Regret bounds on Distributed Truncated Predictive Control for Networked Dynamical Systems
arXiv:1101.3663 [math.OC] (Published 2011-01-19, updated 2012-02-01)
A robust optimization approach to experimental design for model discrimination of dynamical systems