Title
MapReduce-based clustering for near-duplicate image identification.
Abstract
In this paper, an effective algorithm is developed for tackling the problem of near-duplicate image identification from large-scale image sets, where the LLC (locality-constrained linear coding) method is seamlessly integrated with the maxIDF cut model to achieve more discriminative representations of images. By incorporating MapReduce framework for image clustering and pairwise merging, the near duplicates of images can be identified effectively from large-scale image sets. An intuitive strategy is also introduced to guide the process for parameter selection. Our experimental results on large-scale image sets have revealed that our algorithm can achieve significant improvement on both the accuracy rates and the computation efficiency as compared with other baseline methods.
Year
DOI
Venue
2017
https://doi.org/10.1007/s11042-016-4060-4
Multimedia Tools Appl.
Keywords
Field
DocType
Near-duplicate identification,Image clustering,Representative image,MapReduce,Large-scale photos
Image identification,Data mining,Pairwise comparison,Pattern recognition,Feature detection (computer vision),Computer science,Linear coding,Artificial intelligence,Cluster analysis,Merge (version control),Discriminative model,Computation
Journal
Volume
Issue
ISSN
76
22
1380-7501
Citations 
PageRank 
References 
0
0.34
22
Authors
4
Name
Order
Citations
PageRank
Wanqing Zhao1157.07
Hangzai Luo271843.92
Jinye Peng328440.93
Jianping Fan42677192.33