arXiv:2106.02597 Abstract | arXiv Analytics

arXiv:2106.02597 [cs.LG]Abstract References Reviews Resources

Model-agnostic and Scalable Counterfactual Explanations via Reinforcement Learning

Robert-Florian Samoilescu, Arnaud Van Looveren, Janis Klaise

Published 2021-06-04Version 1

Counterfactual instances are a powerful tool to obtain valuable insights into automated decision processes, describing the necessary minimal changes in the input space to alter the prediction towards a desired target. Most previous approaches require a separate, computationally expensive optimization procedure per instance, making them impractical for both large amounts of data and high-dimensional data. Moreover, these methods are often restricted to certain subclasses of machine learning models (e.g. differentiable or tree-based models). In this work, we propose a deep reinforcement learning approach that transforms the optimization procedure into an end-to-end learnable process, allowing us to generate batches of counterfactual instances in a single forward pass. Our experiments on real-world data show that our method i) is model-agnostic (does not assume differentiability), relying only on feedback from model predictions; ii) allows for generating target-conditional counterfactual instances; iii) allows for flexible feature range constraints for numerical and categorical attributes, including the immutability of protected features (e.g. gender, race); iv) is easily extended to other data modalities such as images.

Comments: 18 pages

Categories: cs.LG, stat.ML

Keywords: scalable counterfactual explanations, model-agnostic, optimization procedure, necessary minimal changes, deep reinforcement learning approach

Related articles: Most relevant | Search more

arXiv:1805.07297 [cs.LG] (Published 2018-05-13)

General solutions for nonlinear differential equations: a deep reinforcement learning approach

Shiyin Wei, Xiaowei Jin, Hui Li

arXiv:1906.08809 [cs.LG] (Published 2019-06-20)

A Deep Reinforcement Learning Approach for Global Routing

Haiguang Liao, Wentai Zhang, Xuliang Dong, Barnabas Poczos, Kenji Shimada, Levent Burak Kara

arXiv:2007.03313 [cs.LG] (Published 2020-07-07)

Predictive Maintenance for Edge-Based Sensor Networks: A Deep Reinforcement Learning Approach