Title
Assigning documents to master sites in distributed search
Abstract
An appealing solution to scale Web search with the growth of the Internet is the use of distributed architectures. Distributed search engines rely on multiple sites deployed in distant regions across the world, where each site is specialized to serve queries issued by the users of its region. This paper investigates the problem of assigning each document to a master site. We show that by leveraging similarities between a document and the activity of the users, we can accurately detect which site is the most relevant to place a document. We conduct various experiments using two document assignment approaches, showing performance improvements of up to 20.8% over a baseline technique which assigns the documents to search sites based on their language.
Year
DOI
Venue
2011
10.1145/2063576.2063591
CIKM
Keywords
Field
DocType
performance improvement,distant region,web search,document assignment approach,assigning document,various experiment,appealing solution,master site,search engine,baseline technique,multiple site,distributed architecture,web search engine,indexation
Data mining,Web search query,World Wide Web,Site map,Search engine,Semantic search,Information retrieval,Computer science,Distributed index,Search engine indexing,Search analytics,The Internet
Conference
Citations 
PageRank 
References 
7
0.44
15
Authors
5
Name
Order
Citations
PageRank
Roi Blanco187257.42
B. Barla Cambazoglu273538.87
Flavio P. Junqueira3103749.96
Ivan Kelly4130.93
Vincent Leroy519718.23