arXiv Analytics

Sign in

arXiv:1511.06449 [cs.CV]AbstractReferencesReviewsResources

Learning to decompose for object detection and instance segmentation

Eunbyung Park, Alexander C. Berg

Published 2015-11-19Version 1

Although deep convolutional neural networks(CNNs) have achieved remarkable results on object detection and segmentation, pre- and post-processing steps such as region proposals and non-maximum suppression(NMS), have been required. These steps result in high computational complexity and sensitivity to hyperparameters, e.g. thresholds for NMS. In this work, we propose a novel end-to-end trainable deep neural network architecture that generates the correct number of object instances and their bounding boxes (or segmentation masks) given an image, using only a single network evaluation without any pre- or post-processing steps. We have tested on detecting digits in multi-digit images synthesized using MNIST, automatically segmenting digits in these images, and detecting cars in the KITTI benchmark dataset. The proposed approach outperforms a strong CNN baseline on the synthesized digits datasets and shows promising results on KITTI car detection.

Related articles: Most relevant | Search more
arXiv:2007.02846 [cs.CV] (Published 2020-07-06)
Point-Set Anchors for Object Detection, Instance Segmentation and Pose Estimation
arXiv:2007.00047 [cs.CV] (Published 2020-06-28)
A Survey on Instance Segmentation: State of the art
arXiv:1810.10327 [cs.CV] (Published 2018-10-15)
Instance Segmentation and Object Detection with Bounding Shape Masks