arXiv:2206.01131 Abstract | arXiv Analytics

arXiv:2206.01131 [cs.LG]Abstract References Reviews Resources

Predictive Multiplicity in Probabilistic Classification

Jamelle Watson-Daniels, David C. Parkes, Berk Ustun

Published 2022-06-02Version 1

For a prediction task, there may exist multiple models that perform almost equally well. This multiplicity complicates how we typically develop and deploy machine learning models. We study how multiplicity affects predictions -- i.e., predictive multiplicity -- in probabilistic classification. We introduce new measures for this setting and present optimization-based methods to compute these measures for convex empirical risk minimization problems like logistic regression. We apply our methodology to gain insight into why predictive multiplicity arises. We study the incidence and prevalence of predictive multiplicity in real-world risk assessment tasks. Our results emphasize the need to report multiplicity more widely.

Categories: cs.LG

Keywords: predictive multiplicity, probabilistic classification, real-world risk assessment tasks, convex empirical risk minimization problems, deploy machine learning models

Related articles: Most relevant | Search more

arXiv:2001.06089 [cs.LG] (Published 2020-01-16)

Fairness Measures for Regression via Probabilistic Classification

Daniel Steinberg, Alistair Reid, Simon O'Callaghan

arXiv:1901.05350 [cs.LG] (Published 2019-01-16)

TensorFlow.js: Machine Learning for the Web and Beyond

Daniel Smilkov et al.

arXiv:2206.01295 [cs.LG] (Published 2022-06-02)

Rashomon Capacity: A Metric for Predictive Multiplicity in Probabilistic Classification

Hsiang Hsu, Flavio du Pin Calmon

arXiv Analytics

arXiv:2206.01131 [cs.LG]Abstract References Reviews Resources

Predictive Multiplicity in Probabilistic Classification

Links

Toolbox

arXiv:2206.01131 [cs.LG]AbstractReferencesReviewsResources

Predictive Multiplicity in Probabilistic Classification

Links

Toolbox

arXiv:2206.01131 [cs.LG]Abstract References Reviews Resources