Title
Convolutional Neural Networks with Generalized Attentional Pooling for Action Recognition
Abstract
Inspired by the recent advance in attentional pooling techniques in image classification and action recognition tasks, we propose the Generalized Attentional Pooling (GAP) based Convolutional Neural Network (CNN) algorithm for action recognition in still images. The proposed GAP-CNN can be formulated as a new approximation of the second-order/bilinear pooling techniques widely used in fine-grained image classification. Unlike the existing rank-1 approximation, a generalized factoring (with non-linear functions) is introduced to exploit the intrinsic structural information of the sample covariance matrices of convolutional layer outputs. Without requiring preprocessing steps such as object (e.g., human body) bounding boxes detection, the proposed GAP-CNN automatically focuses on the most informative part in still images. With the additional guidance of keypoints of human pose, the proposed GAP-CNN algorithm achieves the state-of-the-art action recognition accuracy on the large-scale MPII still image dataset.
Year
DOI
Venue
2018
10.1109/VCIP.2018.8698720
2018 IEEE Visual Communications and Image Processing (VCIP)
Keywords
Field
DocType
Action Recognition,Generalized Attentional Pooling,Convolutional Neural Network
Computer vision,Pattern recognition,Convolutional neural network,Matrix (mathematics),Computer science,Pooling,Exploit,Preprocessor,Artificial intelligence,Contextual image classification,Bilinear interpolation,Bounding overwatch
Conference
ISBN
Citations 
PageRank 
978-1-5386-4458-4
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Yunfeng Wang141.79
Wengang Zhou2122679.31
Qilin Zhang3413.30
Houqiang Li42090172.30