Title
Efficient human action recognition using histograms of motion gradients and VLAD with descriptor shape information.
Abstract
Feature extraction and encoding represent two of the most crucial steps in an action recognition system. For building a powerful action recognition pipeline it is important that both steps are efficient and in the same time provide reliable performance. This work proposes a new approach for feature extraction and encoding that allows us to obtain real-time frame rate processing for an action recognition system. The motion information represents an important source of information within the video. The common approach to extract the motion information is to compute the optical flow. However, the estimation of optical flow is very demanding in terms of computational cost, in many cases being the most significant processing step within the overall pipeline of the target video analysis application. In this work we propose an efficient approach to capture the motion information within the video. Our proposed descriptor, Histograms of Motion Gradients (HMG), is based on a simple temporal and spatial derivation, which captures the changes between two consecutive frames. For the encoding step a widely adopted method is the Vector of Locally Aggregated Descriptors (VLAD), which is an efficient encoding method, however, it considers only the difference between local descriptors and their centroids. In this work we propose Shape Difference VLAD (SD-VLAD), an encoding method which brings complementary information by using the shape information within the encoding process. We validated our proposed pipeline for action recognition on three challenging datasets UCF50, UCF101 and HMDB51, and we propose also a real-time framework for action recognition.
Year
DOI
Venue
2017
10.1007/s11042-017-4795-6
Multimedia Tools Appl.
Keywords
Field
DocType
Video classification, Action recognition, Histograms of motion gradients (HMG), Shape difference VLAD (SD-VLAD), Computational efficiency, Real-time processing
Computer vision,Histogram,Pattern recognition,Computer science,Action recognition,Feature extraction,Artificial intelligence,Frame rate,Optical flow,Centroid,Encoding (memory)
Journal
Volume
Issue
ISSN
76
21
1380-7501
Citations 
PageRank 
References 
9
0.47
36
Authors
6
Name
Order
Citations
PageRank
I. C. Duta1793.71
J. R. R. Uijlings2125295.20
Bogdan Ionescu345856.67
Kiyoharu Aizawa41836292.43
Alexander G. Hauptmann57472558.23
Nicu Sebe67013403.03