Abstract | ||
---|---|---|
Head pose estimation from images has recently attracted much attention in computer vision due to its diverse applications in face recognition, driver monitoring and human computer interaction. Most successful approaches to head pose estimation formulate the problem as a nonlinear regression between image features and continuous 3D angles (i.e. yaw, pitch and roll). However, regression-like methods suffer from three main drawbacks: (1) They typically lack generalization and overfit when trained using a few samples. (2) They fail to get reliable estimates over some regions of the output space (angles) when the training set is not uniformly sampled. For instance, if the training data contains under-sampled areas for some angles. (3) They are not robust to image noise or occlusion. To address these problems, this paper presents Supervised Local Subspace Learning (SL2), a method that learns a local linear model from a sparse and non-uniformly sampled training set. SL2 learns a mixture of local tangent spaces that is robust to under-sampled regions, and due to its regularization properties it is also robust to over-fitting. Moreover, because SL2 is a generative model, it can deal with image noise. Experimental results on the CMU Multi-PIE and BU-3DFE database show the effectiveness of our approach in terms of accuracy and computational complexity. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1109/CVPR.2011.5995683 | CVPR |
Keywords | Field | DocType |
local tangent spaces,supervised local subspace learning,driver monitoring,face recognition,nonuniformly sampled training set,human computer interaction,cmu multi-pie database,learning (artificial intelligence),nonlinear regression,regression analysis,supervised local subspace,image occlusion,image features,continuous head,image noise,sparse sampled training set,pose estimation,image feature,under-sampled area,generative model,roll,computational complexity,pitch,local linear model,computer vision,local tangent space,yaw,training set,continuous head pose estimation,continuous 3d angles,training data,bu-3dfe database,linear model,learning artificial intelligence | Computer vision,Facial recognition system,Pattern recognition,Subspace topology,Feature (computer vision),Computer science,Image noise,Pose,Artificial intelligence,Overfitting,Generative model,Computational complexity theory | Conference |
Volume | Issue | ISSN |
2011 | 1 | 1063-6919 |
ISBN | Citations | PageRank |
978-1-4577-0394-2 | 24 | 0.84 |
References | Authors | |
14 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Dong Huang | 1 | 163 | 14.20 |
M. Storer | 2 | 38 | 2.97 |
Fernando De La Torre | 3 | 3832 | 181.17 |
Horst Bischof | 4 | 8751 | 541.43 |