Title
VSRNet: End-to-end video segment retrieval with text query
Abstract
•We propose a novel framework that combines both video retrieval and segment localization into one network, and the joint training improves the performance of each task.•We introduce a text-aligned attention mechanism to efficiently generate temporal proposal and a collaborative ranking strategy to improve the performance of video segment retrieval.•Extensive experiments conducted on DiDeMo and ActivityNet Captions demonstrate the superiority of our method in VSR task.
Year
DOI
Venue
2021
10.1016/j.patcog.2021.108027
Pattern Recognition
Keywords
DocType
Volume
Video segment retrieval,Video retrieval,Description localization
Journal
119
Issue
ISSN
Citations 
1
0031-3203
0
PageRank 
References 
Authors
0.34
5
5
Name
Order
Citations
PageRank
Xiao Sun1125.88
Xiang Long23010.70
He, D.33313.67
Shilei Wen47913.59
Zhou-hui Lian547532.27