A Feature Pair Fusion And Hierarchical Learning Framework For Video Re-Localization - Citegraph

Paper Info

Title
A Feature Pair Fusion And Hierarchical Learning Framework For Video Re-Localization

Abstract
Video re-localization has become an emerging research topic nowadays but existing methods still have many deficiencies. The existing deficiencies mainly lie in the interference caused by the irrelevant information in the input reference video and the ignorance of the correlation between query and reference video features. Therefore, we present a novel framework named Semantic Relevance Learning Network to address these shortcomings. First, we extract effective proposals from reference video as new inputs to reduce interference from irrelevant video frames. Second, two key components of our proposed model, the Attention-based Fusion Tensor and Semantic Relevance Measurement, jointly explore the intrinsic correlation between video feature pairs and finally get a score as measurement. To better evaluate our proposed model, we reorganize Thumos14 to obtain another new dataset for the video re-localization task. For both ActivityNet and Thumos14, our model achieves the best performance reported so far.

Year	DOI	Venue
2020	10.1109/ICIP40778.2020.9190869	2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP)
Keywords	DocType	ISSN
Video re-localization, attention-based fusion tensor, semantic relevance measurement	Conference	1522-4880
Citations	PageRank	References
0	0.34	0
Authors
2

Authors (2 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Ruolin Wang	1	2	1.71
Yuan Zhou	2	44	9.82

1