Title | ||
---|---|---|
A Two-Stream Network With Joint Spatial-Temporal Distance For Video-Based Person Re-Identification |
Abstract | ||
---|---|---|
Video-based person re-identification aims to match videos of pedestrians captured by non-overlapping cameras. Video provides spatial information and temporal information. However, most existing methods do not combine these two types of information well and ignore that they are of different importance in most cases. To address the above issues, we propose a two-stream network with a joint distance metric for measuring the similarity of two videos. The proposed two-stream network has several appealing properties. First, the spatial stream focuses on multiple parts of a person and outputs robust local spatial features. Second, a lightweight and effective temporal information extraction block is introduced in video-based person re-identification. In the inference stage, the distance of two videos is measured by the weighted sum of spatial distance and temporal distance. We conduct extensive experiments on four public datasets, i.e., MARS, PRID2011, iLIDS-VID and DukeMTMC-VideoReID to show that our proposed approach outperforms existing methods in video-based person re-ID. |
Year | DOI | Venue |
---|---|---|
2020 | 10.3233/JIFS-192067 | JOURNAL OF INTELLIGENT & FUZZY SYSTEMS |
Keywords | DocType | Volume |
Person re-identification, two-stream network, local information, temporal information, similarity measurement | Journal | 39 |
Issue | ISSN | Citations |
3 | 1064-1246 | 0 |
PageRank | References | Authors |
0.34 | 0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Zhisong Han | 1 | 0 | 0.34 |
Yaling Liang | 2 | 0 | 0.34 |
Zengqun Chen | 3 | 0 | 1.69 |
Zhiheng Zhou | 4 | 43 | 23.53 |