DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning - Citegraph

Paper Info

Title
DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

Abstract
ABSTRACTLong-range and short-range temporal modeling are two complementary and crucial aspects of video recognition. Most of the state-of-the-arts focus on short-range spatio-temporal modeling and then average multiple snippet-level predictions to yield the final video-level prediction. Thus, their video-level prediction does not consider spatio-temporal features of how video evolves along the temporal dimension. In this paper, we introduce a novel Dynamic Segment Aggregation (DSA) module to capture relationship among snippets. To be more specific, we attempt to generate a dynamic kernel for a convolutional operation to aggregate long-range temporal information among adjacent snippets adaptively. The DSA module is an efficient plug-and-play module and can be combined with the off-the-shelf clip-based models (i.e., TSM, I3D) to perform powerful long-range modeling with minimal overhead. The final video architecture, coined as DSANet. We conduct extensive experiments on several video recognition benchmarks (i.e., Mini-Kinetics-200, Kinetics-400, Something-Something V1 and ActivityNet) to show its superiority. Our proposed DSA module is shown to benefit various video recognition models significantly. For example, equipped with DSA modules, the top-1 accuracy of I3D ResNet-50 is improved from 74.9% to 78.2% on Kinetics-400. Codes are available at https://github.com/whwu95/DSANet.

Year	DOI	Venue
2021	10.1145/3474085.3475344	International Multimedia Conference
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	11

Authors (11 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Wenhao Wu	1	10	4.87
Yuxiang Zhao	2	0	0.68
Yanwu Xu	3	56	6.59
Xiao Tan	4	47	16.40
He, D.	5	33	13.67
Zhikang Zou	6	8	3.92
Jin Ye	7	0	1.01
Yingying Li	8	0	4.06
Mingde Yao	9	0	0.68
Zichao Dong	10	0	0.34
Yifeng Shi	11	0	1.01

1