AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition - Citegraph

Paper Info

Title
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition

Abstract
Temporal modelling is the key for efficient video action recognition. While understanding temporal information can improve recognition accuracy for dynamic actions, removing temporal redundancy and reusing past features can significantly save computation leading to efficient action recognition. In this paper, we introduce an adaptive temporal fusion network, called AdaFuse, that dynamically fuses channels from current and past feature maps for strong temporal modelling. Specifically, the necessary information from the historical convolution feature maps is fused with current pruned feature maps with the goal of improving both recognition accuracy and efficiency. In addition, we use a skipping operation to further reduce the computation cost of action recognition. Extensive experiments on SomethingV1 \u0026 V2, Jester and Mini-Kinetics show that our approach can achieve about 40% computation savings with comparable accuracy to state-of-the-art methods. The project page can be found at https://mengyuest.github.io/AdaFuse/

Year	Venue	DocType
2021	ICLR	Conference
Citations	PageRank	References
0	0.34	0
Authors
8

Authors (8 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Meng Yue	1	8	2.20
Rameswar Panda	2	85	14.02
Chung-Ching Lin	3	45	9.19
Prasanna Sattigeri	4	85	17.23
Karlinsky, Leonid	5	102	11.33
kate saenko	6	4478	202.48
Aude Oliva	7	5121	298.19
Rogério Feris	8	1529	89.95

1