Abstract | ||
---|---|---|
Modern web search engines, while indexing billions of web pages, are expected to process queries and return results in a very short time. Many approaches have been proposed for efficiently computing top-k query results, but most of them ignore one key factor in the ranking functions of commercial search engines - term-proximity, which is the metric of the distance between query terms in a document. When term-proximity is included in ranking functions, most of the existing top-k algorithms will become inefficient. To address this problem, in this paper we propose to build a compact phrase index to speed up the search process when incorporating the term-proximity factor. The compact phrase index can help more accurately estimate the score upper bounds of unknown documents. The size of the phrase index is controlled by including a small portion of phrases which are possibly helpful for improving search performance. Phrase index has been used to process phrase queries in existing work. It is, however, to the best of our knowledge, the first time that phrase index is used to improve the performance of generic queries. Experimental results show that, compared with the state-of-the-art top-k computation approaches, our approach can reduce average query processing time to 1/5 for typical setttings. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1145/1458082.1458174 | CIKM |
Keywords | Field | DocType |
ranking function,modern web search engine,phrase query,compact phrase index,average query processing time,non-phrase query,search performance,search process,existing top-k algorithm,phrase indexing help,commercial search engine,phrase index,upper bound,indexing terms,indexation,search engine,web search engine,web pages | Web search query,Data mining,Web page,Ranking,Information retrieval,Phrase search,Computer science,Phrase,Search engine indexing,Web query classification,Index term | Conference |
Citations | PageRank | References |
13 | 0.55 | 21 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Mingjie Zhu | 1 | 89 | 4.32 |
Shuming Shi | 2 | 620 | 58.27 |
Nenghai Yu | 3 | 2238 | 183.33 |
Ji-Rong Wen | 4 | 4431 | 265.98 |