Title
Selecting Informative Frames For Action Recognition With Partial Observations
Abstract
Given a video clip that contains only one type of action (e.g., golfing), the goal of action recognition is to recognize this action category from a given set of action types. To deliver fast response for practical video applications, existing works have been endevouring on processing the leading frames of the input video. In our view, only the informative key frames extracted from this 'partial video' should be used for performing action recognition task. This will not only further speed up action recognition process due to less amount of data to be processed but also achieve higher recognition accuracy owing to more distinctive features presented to the learning network. For that, a novel a two-stage learning network architecture is proposed in this paper that consists of a selection network (S-net) and a recognition network (R-net). The S-net is a relatively-shallow network designed to efficiently identify informative key frames, while the R-net is a deep network to perform the final action recognition. In the S-net, a key frame selection criterion is further proposed for identifying informative key frames. Extensive experiments based on two benchmark datasets, UCF101 and HMDB51, have been conducted and clearly shown that our approach significantly outperforms existing state-of-the-art methods.
Year
Venue
Keywords
2018
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP)
Action recognition, key frames, two-stream convolutional networks
Field
DocType
ISSN
Facial recognition system,Architecture,Pattern recognition,Task analysis,Computer science,Action recognition,Network architecture,Feature extraction,Artificial intelligence,Key frame,Speedup
Conference
1522-4880
Citations 
PageRank 
References 
0
0.34
0
Authors
4
Name
Order
Citations
PageRank
Yanjun Zhu1243.98
Gang Yu238219.85
Junsong Yuan33703187.68
Kai-Kuang Ma42309180.29