arXiv Analytics

Sign in

arXiv:2206.01131 [cs.LG]AbstractReferencesReviewsResources

Predictive Multiplicity in Probabilistic Classification

Jamelle Watson-Daniels, David C. Parkes, Berk Ustun

Published 2022-06-02Version 1

For a prediction task, there may exist multiple models that perform almost equally well. This multiplicity complicates how we typically develop and deploy machine learning models. We study how multiplicity affects predictions -- i.e., predictive multiplicity -- in probabilistic classification. We introduce new measures for this setting and present optimization-based methods to compute these measures for convex empirical risk minimization problems like logistic regression. We apply our methodology to gain insight into why predictive multiplicity arises. We study the incidence and prevalence of predictive multiplicity in real-world risk assessment tasks. Our results emphasize the need to report multiplicity more widely.

Related articles: Most relevant | Search more
arXiv:2001.06089 [cs.LG] (Published 2020-01-16)
Fairness Measures for Regression via Probabilistic Classification
arXiv:1901.05350 [cs.LG] (Published 2019-01-16)
TensorFlow.js: Machine Learning for the Web and Beyond
arXiv:2206.01295 [cs.LG] (Published 2022-06-02)
Rashomon Capacity: A Metric for Predictive Multiplicity in Probabilistic Classification