arXiv:2209.15056 Abstract | arXiv Analytics

arXiv:2209.15056 [cs.CV]Abstract References Reviews Resources

Graph Attention Network for Camera Relocalization on Dynamic Scenes

Mohamed Amine Ouali, Mohamed Bouguessa, Riadh Ksantini

Published 2022-09-29Version 1

We devise a graph attention network-based approach for learning a scene triangle mesh representation in order to estimate an image camera position in a dynamic environment. Previous approaches built a scene-dependent model that explicitly or implicitly embeds the structure of the scene. They use convolution neural networks or decision trees to establish 2D/3D-3D correspondences. Such a mapping overfits the target scene and does not generalize well to dynamic changes in the environment. Our work introduces a novel approach to solve the camera relocalization problem by using the available triangle mesh. Our 3D-3D matching framework consists of three blocks: (1) a graph neural network to compute the embedding of mesh vertices, (2) a convolution neural network to compute the embedding of grid cells defined on the RGB-D image, and (3) a neural network model to establish the correspondence between the two embeddings. These three components are trained end-to-end. To predict the final pose, we run the RANSAC algorithm to generate camera pose hypotheses, and we refine the prediction using the point-cloud representation. Our approach significantly improves the camera pose accuracy of the state-of-the-art method from $0.358$ to $0.506$ on the RIO10 benchmark for dynamic indoor camera relocalization.

Categories: cs.CV, cs.AI, cs.LG

Keywords: dynamic scenes, convolution neural network, generate camera pose hypotheses, dynamic indoor camera relocalization, scene triangle mesh representation

Related articles: Most relevant | Search more

arXiv:1610.01925 [cs.CV] (Published 2016-10-06)

Metaheuristic Algorithms for Convolution Neural Network

L. M. Rasdi Rere, Mohamad Ivan Fanany, Aniati Murni Arymurthy

arXiv:1903.03731 [cs.CV] (Published 2019-03-09)

Sparse Representations for Object and Ego-motion Estimation in Dynamic Scenes

Hirak J Kashyap, Charless Fowlkes, Jeffrey L Krichmar

arXiv:2207.03489 [cs.CV] (Published 2022-07-08)

Convolution Neural Network based Mode Decomposition for Degenerated Modes via Multiple Images from Polarizers

Hyuntai Kim

arXiv Analytics

arXiv:2209.15056 [cs.CV]Abstract References Reviews Resources

Graph Attention Network for Camera Relocalization on Dynamic Scenes

Links

Toolbox

arXiv:2209.15056 [cs.CV]AbstractReferencesReviewsResources

Graph Attention Network for Camera Relocalization on Dynamic Scenes

Links

Toolbox

arXiv:2209.15056 [cs.CV]Abstract References Reviews Resources