Title
Attention with structure regularization for action recognition.
Abstract
Recognizing human action in video is an important task with a wide range of applications. Recently, motivated by the findings in human visual perception, there have been numerous attempts on introducing attention mechanisms to action recognition systems. However, it is empirically observed that an implementation of attention mechanism using attention mask of free form often generates ineffective distracted attention regions caused by overfitting, which limits the benefit of attention mechanisms for action recognition. By exploiting block-structured sparsity prior on attention regions, this paper proposed an ℓ2,1-norm group sparsity regularization for learning structured attention masks. Built upon such a regularized attention module, an attention-based recurrent network is developed for action recognition. The experimental results on two benchmark datasets showed that, the proposed method can noticeably improve the accuracy of attention masks, which results in performance gain in action recognition.
Year
DOI
Venue
2019
10.1016/j.cviu.2019.102794
Computer Vision and Image Understanding
Keywords
Field
DocType
Action recognition,Attention,Block-wise sparsity,Deep recurrent network
Human visual perception,Action recognition,Regularization (mathematics),Artificial intelligence,Overfitting,Free form,Mathematics,Machine learning
Journal
Volume
Issue
ISSN
187
1
1077-3142
Citations 
PageRank 
References 
0
0.34
0
Authors
4
Name
Order
Citations
PageRank
Yuhui Quan127021.69
Yixin Chen24326299.19
Ruotao Xu321.37
Hui Ji4110549.31