Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition - Citegraph

Paper Info

Title
Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition

Abstract
ABSTRACTThe task of skeleton-based action recognition remains a core challenge in human-centred scene understanding due to the multiple granularities and large variation in human motion. Existing approaches typically employ a single neural representation for different motion patterns, which has difficulty in capturing fine-grained action classes given limited training data. To address the aforementioned problems, we propose a novel multi-granular spatio-temporal graph network for skeleton-based action classification that jointly models the coarse- and fine-grained skeleton motion patterns. To this end, we develop a dual-head graph network consisting of two interleaved branches, which enables us to extract features at two spatio-temporal resolutions in an effective and efficient manner. Moreover, our network utilises a cross-head communication strategy to mutually enhance the representations of both heads. We conducted extensive experiments on three large-scale datasets, namely NTU RGB+D 60, NTU RGB+D 120, and Kinetics-Skeleton, and achieves the state-of-the-art performance on all the benchmarks, which validates the effectiveness of our method1.

Year	DOI	Venue
2021	10.1145/3474085.3475574	International Multimedia Conference
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	7

Authors (7 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Tailin Chen	1	0	0.34
Desen Zhou	2	64	2.90
Jian Wang	3	25	7.10
Shidong Wang	4	0	0.68
Yu Guan	5	195	22.59
Xuming He	6	697	67.54
Er-rui Ding	7	142	29.31

1