arXiv Analytics

Sign in

arXiv:2012.02312 [cs.LG]AbstractReferencesReviewsResources

ReMix: Calibrated Resampling for Class Imbalance in Deep learning

Colin Bellinger, Roberto Corizzo, Nathalie Japkowicz

Published 2020-12-03Version 1

Class imbalance is a problem of significant importance in applied deep learning where trained models are exploited for decision support and automated decisions in critical areas such as health and medicine, transportation, and finance. The challenge of learning deep models from imbalanced training data remains high, and the state-of-the-art solutions are typically data dependent and primarily focused on image data. Real-world imbalanced classification problems, however, are much more diverse thus necessitating a general solution that can be applied to tabular, image and text data. In this paper, we propose ReMix, a training technique that leverages batch resampling, instance mixing and soft-labels to enable the induction of robust deep models for imbalanced learning. Our results show that dense nets and CNNs trained with ReMix generally outperform the alternatives according to the g-mean and are better calibrated according to the balanced Brier score.

Related articles: Most relevant | Search more
arXiv:1810.12513 [cs.LG] (Published 2018-10-30)
Weak-supervision for Deep Representation Learning under Class Imbalance
arXiv:1506.06472 [cs.LG] (Published 2015-06-22)
The Ebb and Flow of Deep Learning: a Theory of Local Learning
arXiv:1708.06046 [cs.LG] (Published 2017-08-21)
nuts-flow/ml: data pre-processing for deep learning