arXiv Analytics

Sign in

arXiv:1907.10982 [cs.LG]AbstractReferencesReviewsResources

Overfitting of neural nets under class imbalance: Analysis and improvements for segmentation

Zeju Li, Konstantinos Kamnitsas, Ben Glocker

Published 2019-07-25Version 1

Overfitting in deep learning has been the focus of a number of recent works, yet its exact impact on the behavior of neural networks is not well understood. This study analyzes overfitting by examining how the distribution of logits alters in relation to how much the model overfits. Specifically, we find that when training with few data samples, the distribution of logit activations when processing unseen test samples of an under-represented class tends to shift towards and even across the decision boundary, while the over-represented class seems unaffected. In image segmentation, foreground samples are often heavily under-represented. We observe that sensitivity of the model drops as a result of overfitting, while precision remains mostly stable. Based on our analysis, we derive asymmetric modifications of existing loss functions and regularizers including a large margin loss, focal loss, adversarial training and mixup, which specifically aim at reducing the shift observed when embedding unseen samples of the under-represented class. We study the case of binary segmentation of brain tumor core and show that our proposed simple modifications lead to significantly improved segmentation performance over the symmetric variants.

Comments: Accepted at MICCAI 2019
Categories: cs.LG, cs.CV, stat.ML
Related articles: Most relevant | Search more
arXiv:1806.06850 [cs.LG] (Published 2018-06-13)
Polynomial Regression As an Alternative to Neural Nets
arXiv:2006.14606 [cs.LG] (Published 2020-06-25)
Global Convergence and Induced Kernels of Gradient-Based Meta-Learning with Neural Nets
arXiv:2008.06217 [cs.LG] (Published 2020-08-14)
Towards Class Imbalance in Federated Learning