Title
Making Search Efficient on Gnutella-Like P2P Systems
Abstract
Leveraging the state-of-the-art information retrieval (IR) algorithms like VSM and relevance ranking algorithm, we present GES, an efficient IR system built on top of Gnutellalike P2P networks. The key idea is that GES employs a distributed, content-based, and capacity-aware topology adaptation algorithm to organize nodes (each of which is represented by a node vector) into semantic groups. The intuition behind this design is that semantically associated nodes within a semantic group tend to be relevant to the same queries. Given a query, GES uses a capacity-aware search protocol based on semantic groups and selective one-hop node vector replication, to direct the query to the most relevant nodes which are responsible for the query, thereby achieving high recall with probing only a small faction of nodes. Moreover, GES adopts automatic query expansion techniques to improve quality of search results, and it is the first work to show that node vector size plays a very important role in system performance. The experimental results show that GES is very efficient, and even outperforms the centralized node clustering system like SETS.
Year
DOI
Venue
2005
10.1109/IPDPS.2005.273
IPDPS
Keywords
Field
DocType
information retrieval algorithm,relevance ranking algorithm,selective one-hop node vector,semantic networks,centralized node,automatic query expansion technique,ir system,semantic group,node vector size,system performance,peer-to-peer systems,ges,gnutella-like p2p systems,search protocol,gnutella-like p2p networks,capacity-aware search protocol,node vector,relevant node,capacity-aware topology adaptation algorithm,search efficient,peer-to-peer computing,efficient ir system,selective one-hop node vector replication,vsm,information retrieval systems,query formulation,protocols,information retrieval,network topology,routing
Data mining,Ranking,Query expansion,Computer science,Keyword search,Peer to peer computing,Intuition,Network topology,Semantic network,Node clustering
Conference
ISBN
Citations 
PageRank 
0-7695-2312-9
29
1.68
References 
Authors
15
3
Name
Order
Citations
PageRank
Yingwu Zhu136223.69
Xiaoyu Yang2555.21
Yiming Hu363944.91