Title
An Enhanced 3dcnn-Convlstm For Spatiotemporal Multimedia Data Analysis
Abstract
At present, human action recognition is a challenging and complex task in the field of computer vision. The combination of CNN and RNN is a common and effective network structure for this task. Especially, we use 3DCNN in CNN part and ConvLSTM in RNN part. We divide the video into multiple temporal segments by average and compress each segment into one feature map by pooling layer. Adding the pooling layer, dropout layer, and batch normalization layer into ConvLSTM is our groundbreaking work. We test our model on KTH, UCF-11, and HMDB51 datasets and achieve a high accuracy of action recognition.
Year
DOI
Venue
2021
10.1002/cpe.5302
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE
Keywords
DocType
Volume
action recognition, ConvLSTM, 3DCNN
Journal
33
Issue
ISSN
Citations 
2
1532-0626
0
PageRank 
References 
Authors
0.34
0
6
Name
Order
Citations
PageRank
Tian Wang1216.47
Jiakun Li211.36
Mengyi Zhang301.01
Aichun Zhu4168.10
Hichem Snoussi550962.19
Chang Choi626139.04