arXiv:2403.15705 Abstract | arXiv Analytics

arXiv:2403.15705 [cs.CV]Abstract References Reviews Resources

UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation

Yuliang Guo, Abhinav Kumar, Cheng Zhao, Ruoyu Wang, Xinyu Huang, Liu Ren

Published 2024-03-23Version 1

Monocular 3D reconstruction for categorical objects heavily relies on accurately perceiving each object's pose. While gradient-based optimization within a NeRF framework updates initially given poses, this paper highlights that such a scheme fails when the initial pose even moderately deviates from the true pose. Consequently, existing methods often depend on a third-party 3D object to provide an initial object pose, leading to increased complexity and generalization issues. To address these challenges, we present UPNeRF, a Unified framework integrating Pose estimation and NeRF-based reconstruction, bringing us closer to real-time monocular 3D object reconstruction. UPNeRF decouples the object's dimension estimation and pose refinement to resolve the scale-depth ambiguity, and introduces an effective projected-box representation that generalizes well cross different domains. While using a dedicated pose estimator that smoothly integrates into an object-centric NeRF, UPNeRF is free from external 3D detectors. UPNeRF achieves state-of-the-art results in both reconstruction and pose estimation tasks on the nuScenes dataset. Furthermore, UPNeRF exhibits exceptional Cross-dataset generalization on the KITTI and Waymo datasets, surpassing prior methods with up to 50% reduction in rotation and translation error.

Categories: cs.CV

Keywords: unified framework, framework integrating pose estimation, real-time monocular 3d object reconstruction, upnerf achieves state-of-the-art results, external 3d detectors

Related articles: Most relevant | Search more

arXiv:1604.04573 [cs.CV] (Published 2016-04-15)

CNN-RNN: A Unified Framework for Multi-label Image Classification

Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, Wei Xu

arXiv:2309.16126 [cs.CV] (Published 2023-09-28)

UVL: A Unified Framework for Video Tampering Localization

Pengfei Pei, Xianfeng Zhao, Jinchuan Li, Yun Cao

arXiv:2307.02402 [cs.CV] (Published 2023-07-05)

Unbalanced Optimal Transport: A Unified Framework for Object Detection

Henri De Plaen, Pierre-François De Plaen, Johan A. K. Suykens, Marc Proesmans, Tinne Tuytelaars, Luc Van Gool