arXiv Analytics

Sign in

arXiv:1802.10367 [cs.CV]AbstractReferencesReviewsResources

Deep-6DPose: Recovering 6D Object Pose from a Single RGB Image

Thanh-Toan Do, Ming Cai, Trung Pham, Ian Reid

Published 2018-02-28Version 1

Detecting objects and their 6D poses from only RGB images is an important task for many robotic applications. While deep learning methods have made significant progress in visual object detection and segmentation, the object pose estimation task is still challenging. In this paper, we introduce an end-toend deep learning framework, named Deep-6DPose, that jointly detects, segments, and most importantly recovers 6D poses of object instances from a single RGB image. In particular, we extend the recent state-of-the-art instance segmentation network Mask R-CNN with a novel pose estimation branch to directly regress 6D object poses without any post-refinements. Our key technical contribution is the decoupling of pose parameters into translation and rotation so that the rotation can be regressed via a Lie algebra representation. The resulting pose regression loss is differential and unconstrained, making the training tractable. The experiments on two standard pose benchmarking datasets show that our proposed approach compares favorably with the state-of-the-art RGB-based multi-stage pose estimation methods. Importantly, due to the end-to-end architecture, Deep-6DPose is considerably faster than competing multi-stage methods, offers an inference speed of 10 fps that is well suited for robotic applications.

Related articles: Most relevant | Search more
arXiv:1810.03758 [cs.CV] (Published 2018-10-09)
A Summary of the 4th International Workshop on Recovering 6D Object Pose
Tomas Hodan et al.
arXiv:1909.04349 [cs.CV] (Published 2019-09-10)
FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images
arXiv:2204.02445 [cs.CV] (Published 2022-04-05)
CHORE: Contact, Human and Object REconstruction from a single RGB image