Abstract | ||
---|---|---|
Recent studies show that aggregating activations of convolutional layers from CNN models together as a global descriptor leads to promising performance for instance retrieval. However, due to the global pooling strategy adopted, the generated feature representation is lack of discriminative local structure information and is degraded by irrelevant image patterns or background clutter. In this paper, we propose a novel Bag-of-Deep-Visual-Words (BoDVW) model for instance retrieval. Activations of convolutional feature maps are extracted as a set of individual semantic-aware local features. An energy-based feature selection is adopted to filter out features on homogeneous background with poor distinction. To achieve the scalability of local feature-level cross matching, the local deep CNN features are quantized to adapt to the inverted index structure. A new cross-matching metric is defined to measure image similarity. Our approach achieves respectable performance in comparison to other state-of-the-art methods. Especially, it is proved to be more effective and efficient on large scale datasets. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1007/978-3-319-73600-6_21 | Lecture Notes in Computer Science |
Keywords | Field | DocType |
Instance retrieval,Local deep features,Feature selection,Bag-of-Deep-Visual-Words | Inverted index,Computer vision,Feature selection,Pattern recognition,Computer science,Homogeneous,Clutter,Pooling,Local structure,Artificial intelligence,Discriminative model,Scalability | Conference |
Volume | ISSN | Citations |
10705 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 18 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yue Lv | 1 | 0 | 0.34 |
Wengang Zhou | 2 | 22 | 12.93 |
Qi Tian | 3 | 6443 | 331.75 |
Houqiang Li | 4 | 2090 | 172.30 |