Title
Realheponet: A Robust Single-Stage Convnet For Head Pose Estimation In The Wild
Abstract
Human head pose estimation in images has applications in many fields such as human-computer interaction or video surveillance tasks. In this work, we address this problem, defined here as the estimation of both vertical (tilt/pitch) and horizontal (pan/yaw) angles, through the use of a single Convolutional Neural Network (ConvNet) model, trying to balance precision and inference speed in order to maximize its usability in real-world applications. Our model is trained over the combination of two datasets: 'Pointing'04' (aiming at covering a wide range of poses) and 'Annotated Facial Landmarks in the Wild' (in order to improve robustness of our model for its use on real-world images). Three different partitions of the combined dataset are defined and used for training, validation and testing purposes. As a result of this work, we have obtained a trained ConvNet model, coined RealHePoNet, that given a low-resolution grayscale input image, and without the need of using facial landmarks, is able to estimate with low error both tilt and pan angles (4.4 degrees average error on the test partition). Also, given its low inference time (6 ms per head), we consider our model usable even when paired with medium-spec hardware (i.e. GTX 1060 GPU). Code available at: https://github.com/rafabs97/headpose_final Demo video at: https://www.youtube.com/watch?v=2UeuXh5DjAE.
Year
DOI
Venue
2021
10.1007/s00521-020-05511-4
NEURAL COMPUTING & APPLICATIONS
Keywords
DocType
Volume
Human head pose estimation, ConvNets, Human-computer interaction, Deep Learning
Journal
33
Issue
ISSN
Citations 
13
0941-0643
0
PageRank 
References 
Authors
0.34
0
4
Name
Order
Citations
PageRank
Rafael Berral-Soler100.34
F. J. Madrid-Cuevas249825.10
Rafael Muñoz-Salinas335325.03
Manuel J. Marín-Jiménez472041.48