Title | ||
---|---|---|
Efficient human action recognition using histograms of motion gradients and VLAD with descriptor shape information. |
Abstract | ||
---|---|---|
Feature extraction and encoding represent two of the most crucial steps in an action recognition system. For building a powerful action recognition pipeline it is important that both steps are efficient and in the same time provide reliable performance. This work proposes a new approach for feature extraction and encoding that allows us to obtain real-time frame rate processing for an action recognition system. The motion information represents an important source of information within the video. The common approach to extract the motion information is to compute the optical flow. However, the estimation of optical flow is very demanding in terms of computational cost, in many cases being the most significant processing step within the overall pipeline of the target video analysis application. In this work we propose an efficient approach to capture the motion information within the video. Our proposed descriptor, Histograms of Motion Gradients (HMG), is based on a simple temporal and spatial derivation, which captures the changes between two consecutive frames. For the encoding step a widely adopted method is the Vector of Locally Aggregated Descriptors (VLAD), which is an efficient encoding method, however, it considers only the difference between local descriptors and their centroids. In this work we propose Shape Difference VLAD (SD-VLAD), an encoding method which brings complementary information by using the shape information within the encoding process. We validated our proposed pipeline for action recognition on three challenging datasets UCF50, UCF101 and HMDB51, and we propose also a real-time framework for action recognition. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1007/s11042-017-4795-6 | Multimedia Tools Appl. |
Keywords | Field | DocType |
Video classification, Action recognition, Histograms of motion gradients (HMG), Shape difference VLAD (SD-VLAD), Computational efficiency, Real-time processing | Computer vision,Histogram,Pattern recognition,Computer science,Action recognition,Feature extraction,Artificial intelligence,Frame rate,Optical flow,Centroid,Encoding (memory) | Journal |
Volume | Issue | ISSN |
76 | 21 | 1380-7501 |
Citations | PageRank | References |
9 | 0.47 | 36 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
I. C. Duta | 1 | 79 | 3.71 |
J. R. R. Uijlings | 2 | 1252 | 95.20 |
Bogdan Ionescu | 3 | 458 | 56.67 |
Kiyoharu Aizawa | 4 | 1836 | 292.43 |
Alexander G. Hauptmann | 5 | 7472 | 558.23 |
Nicu Sebe | 6 | 7013 | 403.03 |