arXiv Analytics

Sign in

arXiv:1907.01430 [cs.CV]AbstractReferencesReviewsResources

Where are the Masks: Instance Segmentation with Image-level Supervision

Issam H. Laradji, David Vazquez, Mark Schmidt

Published 2019-07-02Version 1

A major obstacle in instance segmentation is that existing methods often need many per-pixel labels in order to be effective. These labels require large human effort and for certain applications, such labels are not readily available. To address this limitation, we propose a novel framework that can effectively train with image-level labels, which are significantly cheaper to acquire. For instance, one can do an internet search for the term "car" and obtain many images where a car is present with minimal effort. Our framework consists of two stages: (1) train a classifier to generate pseudo masks for the objects of interest; (2) train a fully supervised Mask R-CNN on these pseudo masks. Our two main contribution are proposing a pipeline that is simple to implement and is amenable to different segmentation methods; and achieves new state-of-the-art results for this problem setup. Our results are based on evaluating our method on PASCAL VOC 2012, a standard dataset for weakly supervised methods, where we demonstrate major performance gains compared to existing methods with respect to mean average precision.

Related articles: Most relevant | Search more
arXiv:2007.00047 [cs.CV] (Published 2020-06-28)
A Survey on Instance Segmentation: State of the art
arXiv:2003.08429 [cs.CV] (Published 2020-03-18)
STEm-Seg: Spatio-temporal Embeddings for Instance Segmentation in Videos
arXiv:1810.10327 [cs.CV] (Published 2018-10-15)
Instance Segmentation and Object Detection with Bounding Shape Masks