Title
Efficient semantic search on DHT overlays
Abstract
Distributed hash tables (DHTs) excel at exact-match lookups, but they do not directly support complex queries such as semantic search that is based on content. In this paper, we propose a novel approach to efficient semantic search on DHT overlays. The basic idea is to place indexes of semantically close files into same peer nodes with high probability by exploiting information retrieval algorithms and locality sensitive hashing. A query for retrieving semantically close files is answered with high recall by consulting only a small number (e.g., 10-20) of nodes that stores the indexes of the files semantically close to the query. Our approach adds only index information to peer nodes, imposing only a small storage overhead. Via detailed simulations, we show that our approach achieves high recall for queries at very low cost, i.e., the number of nodes visited for a query is about 10-20, independent of the overlay size.
Year
DOI
Venue
2007
10.1016/j.jpdc.2007.01.005
J. Parallel Distrib. Comput.
Keywords
Field
DocType
high probability,index information,high recall,semantic locating,information retrieval algorithm,top term,peer-to-peer,dht overlay,semantically close file,semantic search,locality sensitive hashing,small number,recall,novel approach,semantic indexing,efficient semantic search,vector space model,complex query,distributed hash table,indexation,information retrieval
Locality-sensitive hashing,Locality,Information retrieval,Semantic search,Peer-to-peer,Computer science,Search engine indexing,Hash function,Vector space model,Hash table
Journal
Volume
Issue
ISSN
67
5
Journal of Parallel and Distributed Computing
Citations 
PageRank 
References 
25
0.91
24
Authors
2
Name
Order
Citations
PageRank
Yingwu Zhu136223.69
Yiming Hu263944.91