arXiv Analytics

Sign in

arXiv:2108.04281 [cs.CV]AbstractReferencesReviewsResources

Visual SLAM with Graph-Cut Optimized Multi-Plane Reconstruction

Fangwen Shu, Yaxu Xie, Jason Rambach, Alain Pagani, Didier Stricker

Published 2021-08-09Version 1

This paper presents a semantic planar SLAM system that improves pose estimation and mapping using cues from an instance planar segmentation network. While the mainstream approaches are using RGB-D sensors, employing a monocular camera with such a system still faces challenges such as robust data association and precise geometric model fitting. In the majority of existing work, geometric model estimation problems such as homography estimation and piece-wise planar reconstruction (PPR) are usually solved by standard (greedy) RANSAC separately and sequentially. However, setting the inlier-outlier threshold is difficult in absence of information about the scene (i.e. the scale). In this work, we revisit these problems and argue that two mentioned geometric models (homographies/3D planes) can be solved by minimizing an energy function that exploits the spatial coherence, i.e. with graph-cut optimization, which also tackles the practical issue when the output of a trained CNN is inaccurate. Moreover, we propose an adaptive parameter setting strategy based on our experiments, and report a comprehensive evaluation on various open-source datasets.

Related articles: Most relevant | Search more
arXiv:2207.06738 [cs.CV] (Published 2022-07-14)
Semi-supervised Vector-Quantization in Visual SLAM using HGCN
arXiv:2008.00072 [cs.CV] (Published 2020-07-31)
Dynamic Object Tracking and Masking for Visual SLAM
arXiv:1902.03747 [cs.CV] (Published 2019-02-11)
Visual SLAM: Why Bundle Adjust?