arXiv Analytics

Sign in

arXiv:1505.06798 [cs.CV]AbstractReferencesReviewsResources

Accelerating Very Deep Convolutional Networks for Classification and Detection

Xiangyu Zhang, Jianhua Zou, Kaiming He, Jian Sun

Published 2015-05-26Version 1

This paper aims to accelerate the test-time computation of convolutional neural networks (CNNs), especially very deep CNNs that have substantially impacted the computer vision community. Unlike existing methods that are designed for approximating linear filters or linear responses, our method takes the nonlinear units into account. We develop an effective solution to the resulting nonlinear optimization problem without the need of stochastic gradient descent (SGD). More importantly, while current methods mainly focus on optimizing one or two layers, our nonlinear method enables an asymmetric reconstruction that reduces the rapidly accumulated error when multiple (e.g., >=10) layers are approximated. For the widely used very deep VGG-16 model, our method achieves a whole-model speedup of 4x with merely a 0.3% increase of top-5 error in ImageNet classification. Our 4x accelerated VGG-16 model also shows a graceful accuracy degradation for object detection when plugged into the latest Fast R-CNN detector.

Comments: Technical report. arXiv admin note: substantial text overlap with arXiv:1411.4229
Categories: cs.CV, cs.LG, cs.NE
Related articles: Most relevant | Search more
arXiv:1804.10167 [cs.CV] (Published 2018-04-26)
fMRI: preprocessing, classification and pattern recognition
arXiv:1803.01356 [cs.CV] (Published 2018-03-04)
Classification based Grasp Detection using Spatial Transformer Network
arXiv:1309.2074 [cs.CV] (Published 2013-09-09, updated 2014-03-09)
Learning Transformations for Clustering and Classification