Paying More Attention to Motion: Attention Distillation for Learning Video Representations. - Citegraph

Paper Info

Title
Paying More Attention to Motion: Attention Distillation for Learning Video Representations.

Abstract
We address the challenging problem of learning motion representations using deep models for video recognition. To this end, we make use of attention modules that learn to highlight regions in the video and aggregate features for recognition. Specifically, we propose to leverage output attention maps as a vehicle to transfer the learned representation from a motion (flow) network to an RGB network. We systematically study the design of attention modules, and develop a novel method for attention distillation. Our method is evaluated on major action benchmarks, and consistently improves the performance of the baseline RGB network by a significant margin. Moreover, we demonstrate that our attention maps can leverage motion cues in learning to identify the location of actions in video frames. We believe our method provides a step towards learning motion-aware representations in deep models.

Year	Venue	DocType
2019	arXiv: Computer Vision and Pattern Recognition	Journal
Volume	Citations	PageRank
abs/1904.03249	0	0.34
References	Authors
0	5

Authors (5 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Miao Liu	1	0	2.70
Xin Chen	2	478	31.81
Yun Zhang	3	0	1.01
Yin Li	4	797	35.85
James M. Rehg	5	5259	474.66

1