arXiv:1909.09803 Abstract | arXiv Analytics

arXiv:1909.09803 [cs.CV]Abstract References Reviews Resources

Visual Odometry Revisited: What Should Be Learnt?

Huangying Zhan, Chamara Saroj Weerasekera, Jiawang Bian, Ian Reid

Published 2019-09-21Version 1

In this work we present a monocular visual odometry (VO) algorithm which leverages geometry-based methods and deep learning. Most existing VO/SLAM systems with superior performance are based on geometry and have to be carefully designed for different application scenarios. Moreover, most monocular systems suffer from scale-drift issue. Some recent deep learning works learn VO in an end-to-end manner but the performance of these deep systems is still not comparable to geometry-based methods. In this work, we revisit the basics of VO and explore the right way for integrating deep learning with epipolar geometry and Perspective-n-Point (PnP) method. Specifically, we train two convolutional neural networks (CNNs) for estimating single-view depths and two-view optical flows as intermediate outputs. With the deep predictions, we design a simple but robust frame-to-frame VO algorithm (DF-VO) which outperforms pure deep learning-based and geometry-based methods. More importantly, our system does not suffer from the scale-drift issue being aided by a scale consistent single-view depth CNN. Extensive experiments on KITTI dataset shows the robustness of our system and a detailed ablation study shows the effect of different factors in our system.

Comments: Demo video: https://youtu.be/Nl8mFU4SJKY

Categories: cs.CV

Keywords: visual odometry, geometry-based methods, scale consistent single-view depth cnn, robust frame-to-frame vo algorithm, scale-drift issue

Related articles: Most relevant | Search more

arXiv:1803.02380 [cs.CV] (Published 2018-03-06)

Fast Cylinder and Plane Extraction from Depth Cameras for Visual Odometry

Pedro F. Proença, Yang Gao

arXiv:1803.03893 [cs.CV] (Published 2018-03-11)

Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction

Huangying Zhan, Ravi Garg, Chamara Saroj Weerasekera, Kejie Li, Harsh Agarwal, Ian Reid

arXiv:1903.04253 [cs.CV] (Published 2019-03-11)

A Unified Formulation for Visual Odometry