Multimodal activity recognition with local block CNN and attention-based spatial weighted CNN - Citegraph

Paper Info

Title
Multimodal activity recognition with local block CNN and attention-based spatial weighted CNN

Abstract
Deep learning based human activity recognition approach combines spatial and temporal information to complete the recognition task. The temporal information is extracted by optical flow, which is always compensated by the warping method in order to achieve better performance. However, these methods usually take the global feature as the starting point, only consider global information of video frames, and ignore local information that reflects the changes of human behavior, causing the algorithm to be sensitive to the external environment such as occlusion, illumination change. In view of the above problems, this paper fuses the local spatial features of video frames, global spatial features and temporal features to recognize different actions, and further extracts the visual attention weight to make constraint on the global spatial features. Experiments show that the algorithm proposed in this paper has better accuracy compared with the existing methods.

Year	DOI	Venue
2019	10.1016/j.jvcir.2018.12.026	Journal of Visual Communication and Image Representation
Keywords	Field	DocType
Activity recognition,Multimodal,Visual attention	Computer vision,Image warping,Activity recognition,Pattern recognition,Global information,Visual attention,Artificial intelligence,Deep learning,Fuse (electrical),Optical flow,Mathematics	Journal
Volume	ISSN	Citations
60	1047-3203	0
PageRank	References	Authors
0.34	13	5

Authors (5 rows)

Cited by (0 rows)

References (13 rows)

Name	Order	Citations	PageRank
Suguo Zhu	1	30	4.63
Zhenying Fang	2	0	0.68
Yi Wang	3	3	4.12
Jun Yu	4	2597	105.69
Junping Du	5	789	91.80

1