arXiv Analytics

Sign in

arXiv:1906.10343 [cs.LG]AbstractReferencesReviewsResources

Semi-Supervised Learning with Self-Supervised Networks

Phi Vu Tran

Published 2019-06-25Version 1

Recent advances in semi-supervised learning have shown tremendous potential in overcoming a major barrier to the success of modern machine learning algorithms: access to vast amounts of human-labeled training data. Algorithms based on self-ensemble learning and virtual adversarial training can harness the abundance of unlabeled data to produce impressive state-of-the-art results on a number of semi-supervised benchmarks, approaching the performance of strong supervised baselines using only a fraction of the available labeled data. However, these methods often require careful tuning of many hyper-parameters and are usually not easy to implement in practice. In this work, we present a conceptually simple yet effective semi-supervised algorithm based on self-supervised learning to combine semantic feature representations from unlabeled data. Our models are efficiently trained end-to-end for the joint, multi-task learning of labeled and unlabeled data in a single stage. Striving for simplicity and practicality, our approach requires no additional hyper-parameters to tune for optimal performance beyond the standard set for training convolutional neural networks. We conduct a comprehensive empirical evaluation of our models for semi-supervised image classification on SVHN, CIFAR-10 and CIFAR-100, and demonstrate results competitive with, and in some cases exceeding, prior state of the art. Reference code and data are available at https://github.com/vuptran/sesemi

Related articles: Most relevant | Search more
arXiv:2007.01293 [cs.LG] (Published 2020-07-02)
Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning
arXiv:1905.02249 [cs.LG] (Published 2019-05-06)
MixMatch: A Holistic Approach to Semi-Supervised Learning
arXiv:1908.09574 [cs.LG] (Published 2019-08-26)
Improvability Through Semi-Supervised Learning: A Survey of Theoretical Results