arXiv Analytics

Sign in

arXiv:2310.07672 [stat.ML]AbstractReferencesReviewsResources

Stabilizing Estimates of Shapley Values with Control Variates

Jeremy Goldwasser, Giles Hooker

Published 2023-10-11Version 1

Shapley values are among the most popular tools for explaining predictions of blackbox machine learning models. However, their high computational cost motivates the use of sampling approximations, inducing a considerable degree of uncertainty. To stabilize these model explanations, we propose ControlSHAP, an approach based on the Monte Carlo technique of control variates. Our methodology is applicable to any machine learning model and requires virtually no extra computation or modeling effort. On several high-dimensional datasets, we find it can produce dramatic reductions in the Monte Carlo variability of Shapley estimates.

Related articles: Most relevant | Search more
arXiv:1903.10464 [stat.ML] (Published 2019-03-25)
Explaining individual predictions when features are dependent: More accurate approximations to Shapley values
arXiv:1909.03495 [stat.ML] (Published 2019-09-08)
Shapley Values of Reconstruction Errors of PCA for Explaining Anomaly Detection
arXiv:2106.12228 [stat.ML] (Published 2021-06-23)
groupShapley: Efficient prediction explanation with Shapley values for feature groups