Title
Factors affecting rocchio-based pseudorelevance feedback in image retrieval
Abstract
AbstractPseudorelevance feedback PRF was proposed to solve the limitation of relevance feedback RF, which is based on the user-in-the-loop process. In PRF, the top-k retrieved images are regarded as PRF. Although the PRF set contains noise, PRF has proven effective for automatically improving the overall retrieval result. To implement PRF, the Rocchio algorithm has been considered as a reasonable and well-established baseline. However, the performance of Rocchio-based PRF is subject to various representation choices or factors. In this article, we examine these factors that affect the performance of Rocchio-based PRF, including image-feature representation, the number of top-ranked images, the weighting parameters of Rocchio, and similarity measure. We offer practical insights on how to optimize the performance of Rocchio-based PRF by choosing appropriate representation choices. Our extensive experiments on NUS-WIDE-LITE and Caltech 101+Corel 5000 data sets show that the optimal feature representation is color moment+wavelet texture in terms of retrieval efficiency and effectiveness. Other representation choices are that using top-20 ranked images as pseudopositive and pseudonegative feedback sets with the equal weight i.e., 0.5 by the correlation and cosine distance functions can produce the optimal retrieval result.
Year
DOI
Venue
2015
10.1002/asi.23154
Periodicals
Keywords
Field
DocType
image retrieval
Data mining,Weighting,Data set,Relevance feedback,Similarity measure,Ranking,Information retrieval,Computer science,Image retrieval,Correlation,Rocchio algorithm
Journal
Volume
Issue
ISSN
66
1
2330-1635
Citations 
PageRank 
References 
2
0.36
36
Authors
3
Name
Order
Citations
PageRank
Chih-fong Tsai1125554.93
Ya-Han Hu232324.17
Zong-Yao Chen320.36