Title
A decentralized search engine for dynamic Web communities
Abstract
Currently, most Web search engines perform search on corpus comprising nearly entire content of the Web. The same centralized search service can be performed on a single site as well. Nonetheless, there is little research on community-wide search. This paper presents a peer-to-peer search engine ComSearch. ComSearch is designed to provide small- and middle-scale online communities—the ability to perform text search within the community. Communities are formed in a self-organizing style. P2P IR system may suffer unnecessary internal traffic in answering a multi-term query. In this paper, we propose several techniques to optimize the multi-term query process. The simulation results show that our proposed algorithms have good scalability. Compared with baseline approach, our improved algorithm can reduce the communication cost by about two orders of magnitude in the best case. We also deploy the system in a small-scale network and conduct a series of experiments to estimate the actual query response time as well as to investigate the data movement effect caused by node joining. Experimental results show that multiple data movements are quite common during network expansion. However, the percentage of multiple data movements decreases when a network is getting stable after the initial frequent joining activities. This provides possibilities for improvement on P2P data movement management.
Year
DOI
Venue
2011
10.1007/s10115-009-0270-7
Knowl. Inf. Syst.
Keywords
Field
DocType
multi-term query,multiple data movement,peer-to-peer search engine,p2p data movement management,distributed hash table · bloom filter · information retrieval · community level search · web feed,community-wide search,centralized search service,data movement effect,decentralized search engine,web search engine,text search,dynamic web community,actual query response time,distributed hash table,bloom filter,information retrieval,self organization,p2p,search engine
Data mining,Search engine,Query expansion,Computer science,Full text search,Web query classification,Beam search,Web feed,Search analytics,Dynamic web page
Journal
Volume
Issue
ISSN
26
1
0219-3116
Citations 
PageRank 
References 
7
0.50
11
Authors
3
Name
Order
Citations
PageRank
Daze Wang170.84
Quincy Chi Kwan Tse270.50
Ying Zhou3394.81