Title | ||
---|---|---|
Fast And Robust Key Frame Extraction Method For Gesture Video Based On High-Level Feature Representation |
Abstract | ||
---|---|---|
In gesture video, the inner-frame difference is too subtle to be projected via low-level features, and the gesture frames, expressing semantic information, are distributed only among the tiny part of the whole video frame. This paper introduces a fast and robust key frame extraction method for gesture video, founded upon high-level feature representation to extract the gesture key frame precisely without affecting the semantic information. Firstly, a gesture video segmentation model is designed by employing SSD, which classify gesture video into the semantic scene and the static scene. And then, the 2D-DWT-based perceptual hash algorithm is studied to extract candidate static key frames. Afterward, the multi-channel gradient magnitude frequency histogram (HGMF-MC) based on improved VGG16 is developed as a new image descriptor. Finally, a key frame extraction mechanism based on HGMF-MC is proposed to generate gesture video summary of two scenes, respectively. Experiments consistently show the superiority of the proposed method on Chinese sign language, Cambridge, ChaLearn and CVRR-Hands gesture datasets. The results demonstrate that the method proposed is effective, which improves the video compression ratio and outperforms the state-of-the-art methods. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1007/s11760-020-01783-4 | SIGNAL IMAGE AND VIDEO PROCESSING |
Keywords | DocType | Volume |
Gesture video classification, Improved VGG16, The histogram of gradient magnitude frequency, 2D-DWT-based perceptual hash | Journal | 15 |
Issue | ISSN | Citations |
3 | 1863-1703 | 0 |
PageRank | References | Authors |
0.34 | 0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Huimin Yang | 1 | 0 | 0.34 |
Qiuhong Tian | 2 | 0 | 0.34 |
Qiaoli Zhuang | 3 | 0 | 0.34 |
Linye Li | 4 | 0 | 0.34 |
Qinglong Liang | 5 | 0 | 0.34 |