arXiv:1604.04053 Abstract | arXiv Analytics

arXiv:1604.04053 [cs.CV]Abstract References Reviews Resources

Object Detection from Video Tubelets with Convolutional Neural Networks

Kai Kang, Wanli Ouyang, Hongsheng Li, Xiaogang Wang

Published 2016-04-14Version 1

Deep Convolution Neural Networks (CNNs) have shown impressive performance in various vision tasks such as image classification, object detection and semantic segmentation. For object detection, particularly in still images, the performance has been significantly increased last year thanks to powerful deep networks (e.g. GoogleNet) and detection frameworks (e.g. Regions with CNN features (R-CNN)). The lately introduced ImageNet task on object detection from video (VID) brings the object detection task into the video domain, in which objects' locations at each frame are required to be annotated with bounding boxes. In this work, we introduce a complete framework for the VID task based on still-image object detection and general object tracking. Their relations and contributions in the VID task are thoroughly studied and evaluated. In addition, a temporal convolution network is proposed to incorporate temporal information to regularize the detection results and shows its effectiveness for the task.

Comments: Accepted in CVPR 2016 as a Spotlight paper

Categories: cs.CV

Keywords: convolutional neural networks, video tubelets, deep convolution neural networks, vid task, object detection task

Related articles: Most relevant | Search more

arXiv:1606.04189 [cs.CV] (Published 2016-06-14)

Inverting face embeddings with convolutional neural networks

Andrey Zhmoginov, Mark Sandler

arXiv:1604.02532 [cs.CV] (Published 2016-04-09)

T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos

Kai Kang et al.

arXiv:1604.03168 [cs.CV] (Published 2016-04-11)

Hardware-oriented Approximation of Convolutional Neural Networks

Philipp Gysel, Mohammad Motamedi, Soheil Ghiasi

arXiv Analytics

arXiv:1604.04053 [cs.CV]Abstract References Reviews Resources

Object Detection from Video Tubelets with Convolutional Neural Networks

Links

Toolbox

arXiv:1604.04053 [cs.CV]AbstractReferencesReviewsResources

Object Detection from Video Tubelets with Convolutional Neural Networks

Links

Toolbox

arXiv:1604.04053 [cs.CV]Abstract References Reviews Resources