Abstract | ||
---|---|---|
•We propose a novel framework that combines both video retrieval and segment localization into one network, and the joint training improves the performance of each task.•We introduce a text-aligned attention mechanism to efficiently generate temporal proposal and a collaborative ranking strategy to improve the performance of video segment retrieval.•Extensive experiments conducted on DiDeMo and ActivityNet Captions demonstrate the superiority of our method in VSR task. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1016/j.patcog.2021.108027 | Pattern Recognition |
Keywords | DocType | Volume |
Video segment retrieval,Video retrieval,Description localization | Journal | 119 |
Issue | ISSN | Citations |
1 | 0031-3203 | 0 |
PageRank | References | Authors |
0.34 | 5 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xiao Sun | 1 | 12 | 5.88 |
Xiang Long | 2 | 30 | 10.70 |
He, D. | 3 | 33 | 13.67 |
Shilei Wen | 4 | 79 | 13.59 |
Zhou-hui Lian | 5 | 475 | 32.27 |