arXiv:2108.04281 Abstract | arXiv Analytics

arXiv:2108.04281 [cs.CV]Abstract References Reviews Resources

Visual SLAM with Graph-Cut Optimized Multi-Plane Reconstruction

Fangwen Shu, Yaxu Xie, Jason Rambach, Alain Pagani, Didier Stricker

Published 2021-08-09Version 1

This paper presents a semantic planar SLAM system that improves pose estimation and mapping using cues from an instance planar segmentation network. While the mainstream approaches are using RGB-D sensors, employing a monocular camera with such a system still faces challenges such as robust data association and precise geometric model fitting. In the majority of existing work, geometric model estimation problems such as homography estimation and piece-wise planar reconstruction (PPR) are usually solved by standard (greedy) RANSAC separately and sequentially. However, setting the inlier-outlier threshold is difficult in absence of information about the scene (i.e. the scale). In this work, we revisit these problems and argue that two mentioned geometric models (homographies/3D planes) can be solved by minimizing an energy function that exploits the spatial coherence, i.e. with graph-cut optimization, which also tackles the practical issue when the output of a trained CNN is inaccurate. Moreover, we propose an adaptive parameter setting strategy based on our experiments, and report a comprehensive evaluation on various open-source datasets.

Comments: accepted to ISMAR-Adjunct 2021

Categories: cs.CV, cs.RO

Keywords: graph-cut optimized multi-plane reconstruction, visual slam, semantic planar slam system, instance planar segmentation network, geometric model estimation problems

Related articles: Most relevant | Search more

arXiv:2207.06738 [cs.CV] (Published 2022-07-14)

Semi-supervised Vector-Quantization in Visual SLAM using HGCN

Amir Zarringhalam, Saeed Shiry Ghidary, Ali Mohades Khorasani

arXiv:2008.00072 [cs.CV] (Published 2020-07-31)

Dynamic Object Tracking and Masking for Visual SLAM

Jonathan Vincent, Mathieu Labbé, Jean-Samuel Lauzon, François Grondin, Pier-Marc Comtois-Rivet, François Michaud

arXiv:1902.03747 [cs.CV] (Published 2019-02-11)

Visual SLAM: Why Bundle Adjust?