arXiv Analytics

Sign in

arXiv:1312.6229 [cs.CV]AbstractReferencesReviewsResources

OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

Pierre Sermanet, David Eigen, Xiang Zhang, Michael Mathieu, Rob Fergus, Yann LeCun

Published 2013-12-21, updated 2014-02-24Version 4

We present an integrated framework for using Convolutional Networks for classification, localization and detection. We show how a multiscale and sliding window approach can be efficiently implemented within a ConvNet. We also introduce a novel deep learning approach to localization by learning to predict object boundaries. Bounding boxes are then accumulated rather than suppressed in order to increase detection confidence. We show that different tasks can be learned simultaneously using a single shared network. This integrated framework is the winner of the localization task of the ImageNet Large Scale Visual Recognition Challenge 2013 (ILSVRC2013) and obtained very competitive results for the detection and classifications tasks. In post-competition work, we establish a new state of the art for the detection task. Finally, we release a feature extractor from our best model called OverFeat.

Related articles: Most relevant | Search more
arXiv:1409.0575 [cs.CV] (Published 2014-09-01)
ImageNet Large Scale Visual Recognition Challenge
arXiv:1707.00755 [cs.CV] (Published 2017-07-03)
Appearance invariance in convolutional networks with neighborhood similarity
arXiv:1905.01658 [cs.CV] (Published 2019-05-05)
Drone Path-Following in GPS-Denied Environments using Convolutional Networks