Convolutional Neural Networks with Generalized Attentional Pooling for Action Recognition - Citegraph

Paper Info

Title
Convolutional Neural Networks with Generalized Attentional Pooling for Action Recognition

Abstract
Inspired by the recent advance in attentional pooling techniques in image classification and action recognition tasks, we propose the Generalized Attentional Pooling (GAP) based Convolutional Neural Network (CNN) algorithm for action recognition in still images. The proposed GAP-CNN can be formulated as a new approximation of the second-order/bilinear pooling techniques widely used in fine-grained image classification. Unlike the existing rank-1 approximation, a generalized factoring (with non-linear functions) is introduced to exploit the intrinsic structural information of the sample covariance matrices of convolutional layer outputs. Without requiring preprocessing steps such as object (e.g., human body) bounding boxes detection, the proposed GAP-CNN automatically focuses on the most informative part in still images. With the additional guidance of keypoints of human pose, the proposed GAP-CNN algorithm achieves the state-of-the-art action recognition accuracy on the large-scale MPII still image dataset.

Year	DOI	Venue
2018	10.1109/VCIP.2018.8698720	2018 IEEE Visual Communications and Image Processing (VCIP)
Keywords	Field	DocType
Action Recognition,Generalized Attentional Pooling,Convolutional Neural Network	Computer vision,Pattern recognition,Convolutional neural network,Matrix (mathematics),Computer science,Pooling,Exploit,Preprocessor,Artificial intelligence,Contextual image classification,Bilinear interpolation,Bounding overwatch	Conference
ISBN	Citations	PageRank
978-1-5386-4458-4	0	0.34
References	Authors
0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Yunfeng Wang	1	4	1.79
Wengang Zhou	2	1226	79.31
Qilin Zhang	3	41	3.30
Houqiang Li	4	2090	172.30

1