An Enhanced 3dcnn-Convlstm For Spatiotemporal Multimedia Data Analysis - Citegraph

Paper Info

Title
An Enhanced 3dcnn-Convlstm For Spatiotemporal Multimedia Data Analysis

Abstract
At present, human action recognition is a challenging and complex task in the field of computer vision. The combination of CNN and RNN is a common and effective network structure for this task. Especially, we use 3DCNN in CNN part and ConvLSTM in RNN part. We divide the video into multiple temporal segments by average and compress each segment into one feature map by pooling layer. Adding the pooling layer, dropout layer, and batch normalization layer into ConvLSTM is our groundbreaking work. We test our model on KTH, UCF-11, and HMDB51 datasets and achieve a high accuracy of action recognition.

Year	DOI	Venue
2021	10.1002/cpe.5302	CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE
Keywords	DocType	Volume
action recognition, ConvLSTM, 3DCNN	Journal	33
Issue	ISSN	Citations
2	1532-0626	0
PageRank	References	Authors
0.34	0	6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Tian Wang	1	21	6.47
Jiakun Li	2	1	1.36
Mengyi Zhang	3	0	1.01
Aichun Zhu	4	16	8.10
Hichem Snoussi	5	509	62.19
Chang Choi	6	261	39.04

1