Learning Discriminative Motion Features Through Detection. - Citegraph

Paper Info

Title
Learning Discriminative Motion Features Through Detection.

Abstract
Despite huge success in the image domain, modern detection models such as Faster R-CNN have not been used nearly as much for video analysis. This is arguably due to the fact that detection models are designed to operate on single frames and as a result do not have a mechanism for learning motion representations directly from video. We propose a learning procedure that allows detection models such as Faster R-CNN to learn motion features directly from the RGB video data while being optimized with respect to a pose estimation task. Given a pair of video frames---Frame A and Frame B---we force our model to predict human pose in Frame A using the features from Frame B. We do so by leveraging deformable convolutions across space and time. Our network learns to spatially sample features from Frame B in order to maximize pose detection accuracy in Frame A. This naturally encourages our network to learn motion offsets encoding the spatial correspondences between the two frames. We refer to these motion offsets as DiMoFs (Discriminative Motion Features). In our experiments we show that our training scheme helps learn effective motion cues, which can be used to estimate and localize salient human motion. Furthermore, we demonstrate that as a byproduct, our model also learns features that lead to improved pose detection in still-images, and better keypoint tracking. Finally, we show how to leverage our learned model for the tasks of spatiotemporal action localization and fine-grained action recognition.

Year	Venue	DocType
2018	arXiv: Computer Vision and Pattern Recognition	Journal
Volume	Citations	PageRank
abs/1812.04172	2	0.37
References	Authors
0	5

Authors (5 rows)

Cited by (2 rows)

References (0 rows)

Name	Order	Citations	PageRank
Gedas Bertasius	1	169	10.38
Christoph Feichtenhofer	2	519	20.44
Du Tran	3	1289	38.35
Jianbo Shi	4	10207	1031.66
Lorenzo Torresani	5	2756	120.63

1