arXiv Analytics

Sign in

arXiv:2006.07733 [cs.LG]AbstractReferencesReviewsResources

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre H. Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Koray Kavukcuoglu, Rémi Munos, Michal Valko

Published 2020-06-13Version 1

We introduce Bootstrap Your Own Latent (BYOL), a new approach to self-supervised image representation learning. BYOL relies on two neural networks, referred to as online and target networks, that interact and learn from each other. From an augmented view of an image, we train the online network to predict the target network representation of the same image under a different augmented view. At the same time, we update the target network with a slow-moving average of the online network. While state-of-the art methods intrinsically rely on negative pairs, BYOL achieves a new state of the art without them. BYOL reaches $74.3\%$ top-1 classification accuracy on ImageNet using the standard linear evaluation protocol with a ResNet-50 architecture and $79.6\%$ with a larger ResNet. We show that BYOL performs on par or better than the current state of the art on both transfer and semi-supervised benchmarks.

Related articles: Most relevant | Search more
arXiv:2403.05726 [cs.LG] (Published 2024-03-08)
Augmentations vs Algorithms: What Works in Self-Supervised Learning
arXiv:2307.08913 [cs.LG] (Published 2023-07-18)
Towards the Sparseness of Projection Head in Self-Supervised Learning
arXiv:2305.16789 [cs.LG] (Published 2023-05-26)
Modulate Your Spectrum in Self-Supervised Learning
Xi Weng et al.