Title
Estimating Size of Search Engines in an Uncooperative Environment
Abstract
The number of documents that are indexed by a search engine is referred to as the size of the search engine. The information about the size of each underlying search engine is essential for any metasearch engine to conduct search engine selection, result merging and a few other processes. Thus, effectively estimating the size of search engines is important for a metasearch engine that incorporates multiple autonomous search engines. In this paper, we propose an algorithm that achieves better accuracy compared to the other existing methods for estimating the size of search engines, without losing efficiency. Compared to the Sample-Resample approach, which is the best-known approach in literature, our technique also shows much better tolerance to unfavorable environments.
Year
Venue
Keywords
2004
Workshop on Web-based Support Systems
search engine,indexation
Field
DocType
Citations 
Data mining,Search aggregator,Metasearch engine,Search engine,Information retrieval,Beam search,Search analytics,Engineering,Merge (version control)
Conference
3
PageRank 
References 
Authors
0.47
5
6
Name
Order
Citations
PageRank
Surendra Karnatapu130.81
Karthik Ramachandran2283.27
Zonghuan Wu349227.08
Biren Shah4403.64
Vijay V. Raghavan52544506.92
Ryan G. Benton64615.78