arXiv Analytics

Sign in

arXiv:2206.01295 [cs.LG]AbstractReferencesReviewsResources

Rashomon Capacity: A Metric for Predictive Multiplicity in Probabilistic Classification

Hsiang Hsu, Flavio du Pin Calmon

Published 2022-06-02Version 1

Predictive multiplicity occurs when classification models with nearly indistinguishable average performances assign conflicting predictions to individual samples. When used for decision-making in applications of consequence (e.g., lending, education, criminal justice), models developed without regard for predictive multiplicity may result in unjustified and arbitrary decisions for specific individuals. We introduce a new measure of predictive multiplicity in probabilistic classification called Rashomon Capacity. Prior metrics for predictive multiplicity focus on classifiers that output thresholded (i.e., 0-1) predicted classes. In contrast, Rashomon Capacity applies to probabilistic classifiers, capturing more nuanced score variations for individual samples. We provide a rigorous derivation for Rashomon Capacity, argue its intuitive appeal, and demonstrate how to estimate it in practice. We show that Rashomon Capacity yields principled strategies for disclosing conflicting models to stakeholders. Our numerical experiments illustrate how Rashomon Capacity captures predictive multiplicity in various datasets and learning models, including neural networks. The tools introduced in this paper can help data scientists measure, report, and ultimately resolve predictive multiplicity prior to model deployment.

Related articles: Most relevant | Search more
arXiv:2001.06089 [cs.LG] (Published 2020-01-16)
Fairness Measures for Regression via Probabilistic Classification
arXiv:1904.03063 [cs.LG] (Published 2019-04-05)
Bayesian Heatmaps: Probabilistic Classification with Multiple Unreliable Information Sources
arXiv:1304.3345 [cs.LG] (Published 2013-04-11)
Probabilistic Classification using Fuzzy Support Vector Machines