Abstract | ||
---|---|---|
The main goals of a web search engine are quality, efficiency, and scalability. In this tutorial, we focus on the last two goals, providing a fairly comprehensive overview of the scalability and efficiency challenges in large-scale web search engines. In particular, the tutorial provides an in-depth architectural overview of a web search engine, mainly focusing on the web crawling, indexing, and query processing components. The scalability and efficiency issues encountered in these components are presented at four different granularities: at the level of a single computer, a cluster of computers, a single data center, and a multi-center search engine. The tutorial also points at open research problems and provides recommendations to researchers who are new to the field. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1145/2567948.2577271 | WWW (Companion Volume) |
Keywords | Field | DocType |
in-depth architectural overview,comprehensive overview,single computer,large-scale web search engine,multi-center search engine,web crawling,efficiency challenge,web search engine,efficiency issue,single data center,scalability,indexing,crawling,efficiency | Web search engine,Web search query,World Wide Web,Computer science,Search engine indexing,Web query classification,Web modeling,Search analytics,Web crawler,Distributed web crawling | Conference |
Citations | PageRank | References |
1 | 0.42 | 3 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ricardo Baeza-Yates | 1 | 6173 | 635.97 |
B. Barla Cambazoglu | 2 | 735 | 38.87 |