arXiv:2212.06925 Abstract | arXiv Analytics

arXiv:2212.06925 [cs.LG]Abstract References Reviews Resources

On the Relationship Between Explanation and Prediction: A Causal View

Amir-Hossein Karimi, Krikamol Muandet, Simon Kornblith, Bernhard Schölkopf, Been Kim

Published 2022-12-13Version 1

Explainability has become a central requirement for the development, deployment, and adoption of machine learning (ML) models and we are yet to understand what explanation methods can and cannot do. Several factors such as data, model prediction, hyperparameters used in training the model, and random initialization can all influence downstream explanations. While previous work empirically hinted that explanations (E) may have little relationship with the prediction (Y), there is a lack of conclusive study to quantify this relationship. Our work borrows tools from causal inference to systematically assay this relationship. More specifically, we measure the relationship between E and Y by measuring the treatment effect when intervening on their causal ancestors (hyperparameters) (inputs to generate saliency-based Es or Ys). We discover that Y's relative direct influence on E follows an odd pattern; the influence is higher in the lowest-performing models than in mid-performing models, and it then decreases in the top-performing models. We believe our work is a promising first step towards providing better guidance for practitioners who can make more informed decisions in utilizing these explanations by knowing what factors are at play and how they relate to their end task.

Categories: cs.LG, stat.ME, stat.ML

Keywords: relationship, causal view, ys relative direct influence, influence downstream explanations, work borrows tools

Related articles: Most relevant | Search more

arXiv:2005.01095 [cs.LG] (Published 2020-05-03)

A Causal View on Robustness of Neural Networks

Cheng Zhang, Kun Zhang, Yingzhen Li

arXiv:2409.13232 [cs.LG] (Published 2024-09-20)

Relationship between Uncertainty in DNNs and Adversarial Attacks

Abigail Adeniran, Adewale Adeyemo

arXiv:2007.04440 [cs.LG] (Published 2020-07-08)

On the relationship between class selectivity, dimensionality, and robustness

Matthew L. Leavitt, Ari S. Morcos