arXiv Analytics

Sign in

arXiv:1712.05277 [cs.CV]AbstractReferencesReviewsResources

Face-from-Depth for Head Pose Estimation on Depth Images

Guido Borghi, Matteo Fabbri, Roberto Vezzani, Simone Calderara, Rita Cucchiara

Published 2017-12-12Version 1

Depth cameras allow to setup reliable solutions for people monitoring and behavior understanding, specially when unstable or poor illumination conditions make unusable common RGB sensors. Therefore, we propose a complete framework for the estimation of the head and shoulder pose based on depth images only. A head detection and localization module is also included, in order to develop a complete end-to-end system. The core element of the framework is a Convolutional Neural Network, called POSEidon+, that receives as input three types of images and provides the 3D angles of the pose as output. Moreover, a Face-from-Depth component based on a Deterministic Conditional GAN model is able to hallucinate a face from the corresponding depth image and we empirically demonstrate that this positively impacts the system performances. We test the proposed framework on two public datasets, namely Biwi Kinect Head Pose and ICT-3DHP, and on Pandora, a new challenging dataset mainly inspired by the automotive setup. Experimental results show that our method overcomes all recent state-of-art works based on both intensity and depth input data, running in real time at more than 30 frames per second.

Comments: Submitted to IEEE Transactions on PAMI. arXiv admin note: substantial text overlap with arXiv:1611.10195
Categories: cs.CV
Related articles: Most relevant | Search more
arXiv:1611.10195 [cs.CV] (Published 2016-11-30)
POSEidon: Face-from-Depth for Driver Pose Estimation
arXiv:2407.05357 [cs.CV] (Published 2024-07-07)
On the power of data augmentation for head pose estimation
arXiv:2302.00592 [cs.CV] (Published 2022-12-28)
Comparative Study of Parameter Selection for Enhanced Edge Inference for a Multi-Output Regression model for Head Pose Estimation