arXiv Analytics

Sign in

arXiv:1912.00535 [cs.CV]AbstractReferencesReviewsResources

Deep Learning for Visual Tracking: A Comprehensive Survey

Seyed Mojtaba Marvasti-Zadeh, Li Cheng, Hossein Ghanei-Yakhdan, Shohreh Kasaei

Published 2019-12-02Version 1

Visual target tracking is one of the most sought-after yet challenging research topics in computer vision. Given the ill-posed nature of the problem and its popularity in a broad range of real-world scenarios, a number of large-scale benchmark datasets have been established, on which considerable methods have been developed and demonstrated with significant progress in recent years -- predominantly by recent deep learning (DL)-based methods. This survey aims to systematically investigate the current DL-based visual tracking methods, benchmark datasets, and evaluation metrics. It also extensively evaluates and analyzes the leading visual tracking methods. First, the fundamental characteristics, primary motivations, and contributions of DL-based methods are summarized from six key aspects of: network architecture, network exploitation, network training for visual tracking, network objective, network output, and the exploitation of correlation filter advantages. Second, popular visual tracking benchmarks and their respective properties are compared, and their evaluation metrics are summarized. Third, the state-of-the-art DL-based methods are comprehensively examined on a set of well-established benchmarks of OTB2013, OTB2015, VOT2018, and LaSOT. Finally, by conducting critical analyses of these state-of-the-art methods both quantitatively and qualitatively, their pros and cons under various common scenarios are investigated. It may serve as a gentle use guide for practitioners to weigh on when and under what conditions to choose which method(s). It also facilitates a discussion on ongoing issues and sheds light on promising research directions.

Comments: 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Categories: cs.CV, cs.LG, eess.IV
Related articles: Most relevant | Search more
arXiv:1709.00308 [cs.CV] (Published 2017-09-01)
A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community
arXiv:2106.03323 [cs.CV] (Published 2021-06-07)
A Comprehensive Survey on Image Dehazing Based on Deep Learning
arXiv:1412.7725 [cs.CV] (Published 2014-12-24)
Automatic Photo Adjustment Using Deep Learning