Title
3D convolutional neural networks for human action recognition.
Abstract
We consider the automated recognition of human actions in surveillance videos. Most current methods build classifiers based on complex handcrafted features computed from the raw inputs. Convolutional neural networks (CNNs) are a type of deep model that can act directly on the raw inputs. However, such models are currently limited to handling 2D inputs. In this paper, we develop a novel 3D CNN model for action recognition. This model extracts features from both the spatial and the temporal dimensions by performing 3D convolutions, thereby capturing the motion information encoded in multiple adjacent frames. The developed model generates multiple channels of information from the input frames, and the final feature representation combines information from all channels. To further boost the performance, we propose regularizing the outputs with high-level features and combining the predictions of a variety of different models. We apply the developed models to recognize human actions in the real-world environment of airport surveillance videos, and they achieve superior performance in comparison to baseline methods.
Year
DOI
Venue
2013
10.1109/TPAMI.2012.59
IEEE Transactions on Pattern Analysis and Machine Intelligence
Keywords
DocType
Volume
motion information,cnn model,raw input,airport surveillance video,model extracts feature,developed model,human action recognition,deep model,action recognition,convolutional neural networks,human action,different model,neural network
Journal
35
Issue
ISSN
Citations 
1
1939-3539
502
PageRank 
References 
Authors
18.64
37
4
Search Limit
100502
Name
Order
Citations
PageRank
Shuiwang Ji12579122.25
wei xu23533207.17
Ming Yang33471162.50
Yu, Kai44799255.21