arXiv:2403.08885 Abstract | arXiv Analytics

arXiv:2403.08885 [cs.CV]Abstract References Reviews Resources

SLCF-Net: Sequential LiDAR-Camera Fusion for Semantic Scene Completion using a 3D Recurrent U-Net

Published 2024-03-13Version 1

We introduce SLCF-Net, a novel approach for the Semantic Scene Completion (SSC) task that sequentially fuses LiDAR and camera data. It jointly estimates missing geometry and semantics in a scene from sequences of RGB images and sparse LiDAR measurements. The images are semantically segmented by a pre-trained 2D U-Net and a dense depth prior is estimated from a depth-conditioned pipeline fueled by Depth Anything. To associate the 2D image features with the 3D scene volume, we introduce Gaussian-decay Depth-prior Projection (GDP). This module projects the 2D features into the 3D volume along the line of sight with a Gaussian-decay function, centered around the depth prior. Volumetric semantics is computed by a 3D U-Net. We propagate the hidden 3D U-Net state using the sensor motion and design a novel loss to ensure temporal consistency. We evaluate our approach on the SemanticKITTI dataset and compare it with leading SSC approaches. The SLCF-Net excels in all SSC metrics and shows great temporal consistency.

Comments: 2024 IEEE International Conference on Robotics and Automation (ICRA2024), Yokohama, Japan, May 2024

Categories: cs.CV, cs.AI, cs.RO

Keywords: semantic scene completion, 3d recurrent u-net, sequential lidar-camera fusion, hidden 3d u-net state, depth prior

Tags: conference paper

Related articles: Most relevant | Search more

arXiv:2303.09977 [cs.CV] (Published 2023-03-17)

Semantic Scene Completion with Cleaner Self

Fengyun Wang, Dong Zhang, Hanwang Zhang, Jinhui Tang, Qianru Sun

arXiv:2307.07938 [cs.CV] (Published 2023-07-16)

CVSformer: Cross-View Synthesis Transformer for Semantic Scene Completion

Haotian Dong et al.

arXiv:1908.02893 [cs.CV] (Published 2019-08-08)

EdgeNet: Semantic Scene Completion from RGB-D images

Aloisio Dourado, Teofilo Emidio de Campos, Hansung Kim, Adrian Hilton

arXiv Analytics

arXiv:2403.08885 [cs.CV]Abstract References Reviews Resources

SLCF-Net: Sequential LiDAR-Camera Fusion for Semantic Scene Completion using a 3D Recurrent U-Net

Links

Toolbox

arXiv:2403.08885 [cs.CV]AbstractReferencesReviewsResources

SLCF-Net: Sequential LiDAR-Camera Fusion for Semantic Scene Completion using a 3D Recurrent U-Net

Links

Toolbox

arXiv:2403.08885 [cs.CV]Abstract References Reviews Resources