Title
A prediction model for web search hit counts using word frequencies
Abstract
A search engine user with a well-defined information need is not interested in getting thousands of hits, but a few hits that are all highly relevant to their search. Often search words need to be refined and augmented to narrow results to more relevant pages. However, an overly specific query may lead to no hits at all, while most typical queries lead to thousands or even millions of them, both undesirable outcomes. This paper suggests a query rewriting method for generating alternative query strings and proposes a hit count prediction model for predicting the number of search engine hits for each alternative query string, based on the English language frequencies of the words in the search terms. Using the hit count prediction model, different types of search strategies, such as a lowest hit count query preference, can be utilized to improve users' search experience. We present an evaluation experiment of the hit count prediction model for three major search engines. We also discuss and quantify how far the Google, Yahoo! and Bing search engines diverge from monotonic behaviour, considering negative and positive search terms separately.
Year
DOI
Venue
2011
10.1177/0165551511415183
J. Information Science
Keywords
DocType
Volume
web search hit,search strategy,search engine hit,major search engine,word frequency,alternative query string,search engine user,positive search term,hit count prediction model,search term,search experience,Bing search engine
Journal
37
Issue
ISSN
Citations 
5
0165-5515
1
PageRank 
References 
Authors
0.35
26
3
Name
Order
Citations
PageRank
Tian Tian112017.90
Soon Ae Chun2893100.67
James Geller310.35