Abstract | ||
---|---|---|
Web query classification is an effective way to understand Web user intents, which can further improve Web search and online advertising relevance. However, Web queries are usually very short which cannot fully reflect their meanings. What is more, it is quite hard to obtain enough training data for training accurate classifiers. Therefore, previous work on query classification has focused on two issues. One is how to represent Web queries through query expansion. The other is how to increase the amount of training data. In this paper, we took product query classification as an example, which is to classify Web queries into a predefined product taxonomy, and systematically studied the impact of query expansion and the size of training data. We proposed two methods of enriching Web queries and three approaches of collecting training data. Thereafter, we conducted a series of experiments to compare the classification performance of using different combinations of training data and query representations over a real data set. The data set consists of hundreds of thousands queries collected from a popular commercial search engine. From the experiments, we found some interesting observations, which were not discussed before. Finally, we proposed an effective and efficient product query classification method based on our observations. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1145/1645953.1646047 | International Conference on Information and Knowledge Management |
Keywords | Field | DocType |
training data,query enrichment,query classification,efficient product query classification,enough training data,web query,query representation,product query classification,query expansion,online advertising,search engine | Query optimization,Web search query,Data mining,Query language,Query expansion,Information retrieval,Computer science,Sargable,Web query classification,Spatial query,Online aggregation | Conference |
Citations | PageRank | References |
9 | 0.53 | 20 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Dou Shen | 1 | 1224 | 59.46 |
Ying Li | 2 | 265 | 21.64 |
Xiao Li | 3 | 18 | 1.75 |
Dengyong Zhou | 4 | 1709 | 65.63 |