Title
Scalability and efficiency challenges in large-scale web search engines
Abstract
The main goals of a web search engine are quality, efficiency, and scalability. In this tutorial, we focus on the last two goals, providing a fairly comprehensive overview of the scalability and efficiency challenges in large-scale web search engines. In particular, the tutorial provides an in-depth architectural overview of a web search engine, mainly focusing on the web crawling, indexing, and query processing components. The scalability and efficiency issues encountered in these components are presented at four different granularities: at the level of a single computer, a cluster of computers, a single data center, and a multi-center search engine. The tutorial also points at open research problems and provides recommendations to researchers who are new to the field.
Year
DOI
Venue
2014
10.1145/2567948.2577271
WWW (Companion Volume)
Keywords
Field
DocType
in-depth architectural overview,comprehensive overview,single computer,large-scale web search engine,multi-center search engine,web crawling,efficiency challenge,web search engine,efficiency issue,single data center,scalability,indexing,crawling,efficiency
Web search engine,Web search query,World Wide Web,Computer science,Search engine indexing,Web query classification,Web modeling,Search analytics,Web crawler,Distributed web crawling
Conference
Citations 
PageRank 
References 
1
0.42
3
Authors
2
Name
Order
Citations
PageRank
Ricardo Baeza-Yates16173635.97
B. Barla Cambazoglu273538.87