Title
Robust classification of rare queries using web knowledge
Abstract
We propose a methodology for building a practical robust query classification system that can identify thousands of query classes with reasonable accuracy, while dealing in real-time with the query volume of a commercial web search engine. We use a blind feedback technique: given a query, we determine its topic by classifying the web search results retrieved by the query. Motivated by the needs of search advertising, we primarily focus on rare queries, which are the hardest from the point of view of machine learning, yet in aggregation account for a considerable fraction of search engine traffic. Empirical evaluation confirms that our methodology yields a considerably higher classification accuracy than previously reported. We believe that the proposed methodology will lead to better matching of online ads to rare queries and overall to a better user experience.
Year
DOI
Venue
2007
10.1145/1277741.1277783
SIGIR
Keywords
Field
DocType
query volume,web search result,search advertising,proposed methodology,practical robust query classification,commercial web search engine,methodology yield,web knowledge,robust classification,rare query,search engine traffic,query class,real time,user experience,search engine,query classification,machine learning
Web search engine,Data mining,Query language,Computer science,Web query classification,Artificial intelligence,Query optimization,Web search query,Information retrieval,Query expansion,Sargable,Online aggregation,Machine learning
Conference
Citations 
PageRank 
References 
113
4.16
19
Authors
6
Search Limit
100113
Name
Order
Citations
PageRank
Andrei Broder17357920.20
Marcus Fontoura2111661.74
Evgeniy Gabrilovich34573224.48
Amruta Joshi41878.67
Vanja Josifovski52265148.84
Zhang, Tong67126611.43