arXiv Analytics

Sign in

arXiv:2006.07589 [cs.LG]AbstractReferencesReviewsResources

Adversarial Self-Supervised Contrastive Learning

Minseon Kim, Jihoon Tack, Sung Ju Hwang

Published 2020-06-13Version 1

Existing adversarial learning approaches mostly use class labels to generate adversarial samples that lead to incorrect predictions, which are then used to augment the training of the model for improved robustness. While some recent works propose semi-supervised adversarial learning methods that utilize unlabeled data, they still require class labels. However, do we really need class labels at all, for adversarially robust training of deep neural networks? In this paper, we propose a novel adversarial attack for unlabeled data, which makes the model confuse the instance-level identities of the perturbed data samples. Further, we present a self-supervised contrastive learning framework to adversarially train a robust neural network without labeled data, which aims to maximize the similarity between a random augmentation of a data sample and its instance-wise adversarial perturbation. We validate our method, Robust Contrastive Learning (RoCL), on multiple benchmark datasets, on which it obtains comparable robust accuracy over state-of-the-art supervised adversarial learning methods, and significantly improved robustness against the black box and unseen types of attacks. Moreover, with further joint fine-tuning with supervised adversarial loss, RoCL obtains even higher robust accuracy over using self-supervised learning alone. Notably, RoCL also demonstrate impressive results in robust transfer learning.

Related articles: Most relevant | Search more
arXiv:2006.13319 [cs.LG] (Published 2020-06-23)
Classification Performance Metric for Imbalance Data Based on Recall and Selectivity Normalized in Class Labels
arXiv:1301.0565 [cs.LG] (Published 2012-12-12)
An Information-Theoretic External Cluster-Validity Measure
arXiv:1312.7258 [cs.LG] (Published 2013-12-27, updated 2014-03-18)
Active Discovery of Network Roles for Predicting the Classes of Network Nodes