arXiv:2006.12480 Abstract | arXiv Analytics

arXiv:2006.12480 [cs.CV]Abstract References Reviews Resources

Self-supervised Video Object Segmentation

Fangrui Zhu, Li Zhang, Yanwei Fu, Guodong Guo, Weidi Xie

Published 2020-06-22Version 1

The objective of this paper is self-supervised representation learning, with the goal of solving semi-supervised video object segmentation (a.k.a. dense tracking). We make the following contributions: (i) we propose to improve the existing self-supervised approach, with a simple, yet more effective memory mechanism for long-term correspondence matching, which resolves the challenge caused by the dis-appearance and reappearance of objects; (ii) by augmenting the self-supervised approach with an online adaptation module, our method successfully alleviates tracker drifts caused by spatial-temporal discontinuity, e.g. occlusions or dis-occlusions, fast motions; (iii) we explore the efficiency of self-supervised representation learning for dense tracking, surprisingly, we show that a powerful tracking model can be trained with as few as 100 raw video clips (equivalent to a duration of 11mins), indicating that low-level statistics have already been effective for tracking tasks; (iv) we demonstrate state-of-the-art results among the self-supervised approaches on DAVIS-2017 and YouTube-VOS, as well as surpassing most of methods trained with millions of manual segmentation annotations, further bridging the gap between self-supervised and supervised learning. Codes are released to foster any further research (https://github.com/fangruizhu/self_sup_semiVOS).

Categories: cs.CV, cs.LG

Keywords: self-supervised video object segmentation, self-supervised approach, method successfully alleviates tracker drifts, solving semi-supervised video object segmentation, demonstrate state-of-the-art results

Related articles: Most relevant | Search more

arXiv:2104.07658 [cs.CV] (Published 2021-04-15)

Self-supervised Video Object Segmentation by Motion Grouping

Charig Yang, Hala Lamdouar, Erika Lu, Andrew Zisserman, Weidi Xie

arXiv:2401.13937 [cs.CV] (Published 2024-01-25)

Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention

Quang-Trung Truong, Duc Thanh Nguyen, Binh-Son Hua, Sai-Kit Yeung

arXiv:2311.17893 [cs.CV] (Published 2023-11-29)

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

Shuangrui Ding, Rui Qian, Haohang Xu, Dahua Lin, Hongkai Xiong