Title | ||
---|---|---|
Convolutional Neural Networks with Generalized Attentional Pooling for Action Recognition |
Abstract | ||
---|---|---|
Inspired by the recent advance in attentional pooling techniques in image classification and action recognition tasks, we propose the Generalized Attentional Pooling (GAP) based Convolutional Neural Network (CNN) algorithm for action recognition in still images. The proposed GAP-CNN can be formulated as a new approximation of the second-order/bilinear pooling techniques widely used in fine-grained image classification. Unlike the existing rank-1 approximation, a generalized factoring (with non-linear functions) is introduced to exploit the intrinsic structural information of the sample covariance matrices of convolutional layer outputs. Without requiring preprocessing steps such as object (e.g., human body) bounding boxes detection, the proposed GAP-CNN automatically focuses on the most informative part in still images. With the additional guidance of keypoints of human pose, the proposed GAP-CNN algorithm achieves the state-of-the-art action recognition accuracy on the large-scale MPII still image dataset. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1109/VCIP.2018.8698720 | 2018 IEEE Visual Communications and Image Processing (VCIP) |
Keywords | Field | DocType |
Action Recognition,Generalized Attentional Pooling,Convolutional Neural Network | Computer vision,Pattern recognition,Convolutional neural network,Matrix (mathematics),Computer science,Pooling,Exploit,Preprocessor,Artificial intelligence,Contextual image classification,Bilinear interpolation,Bounding overwatch | Conference |
ISBN | Citations | PageRank |
978-1-5386-4458-4 | 0 | 0.34 |
References | Authors | |
0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yunfeng Wang | 1 | 4 | 1.79 |
Wengang Zhou | 2 | 1226 | 79.31 |
Qilin Zhang | 3 | 41 | 3.30 |
Houqiang Li | 4 | 2090 | 172.30 |