Title
Weakly Supervised Co-Training of Query Rewriting andSemantic Matching for e-Commerce.
Abstract
Relevance is the core problem of a search engine, and one of the main challenges is the vocabulary gap between user queries and documents. This problem is more serious in e-commerce, because language in product titles is more professional. Query rewriting and semantic matching are two key techniques to bridge the semantic gap between them to improve relevance. Recently, deep neural networks have been successfully applied to the two tasks and enhanced the relevance performance. However, such approaches suffer from the sparseness of training data in e-commerce scenario. In this study, we investigate the instinctive connection between query rewriting and semantic matching tasks, and propose a co-training framework to address the data sparseness problem when training deep neural networks. We first build a huge unlabeled dataset from search logs, on which the two tasks can be considered as two different views of the relevance problem. Then we iteratively co-train them via labeled data generated from this unlabeled set to boost their performance simultaneously. We conduct a series of offline and online experiments on a real-world e-commerce search engine, and the results demonstrate that the proposed method improves relevance significantly.
Year
DOI
Venue
2019
10.1145/3289600.3291039
WSDM
Keywords
Field
DocType
co-training, neural networks, query rewriting, semantic matching
Query Rewriting,Search engine,Information retrieval,Computer science,Semantic gap,Co-training,Artificial neural network,Vocabulary,E-commerce,Semantic matching
Conference
ISBN
Citations 
PageRank 
978-1-4503-5940-5
0
0.34
References 
Authors
16
8
Name
Order
Citations
PageRank
Rong Xiao100.68
Jianhui Ji201.01
Baoliang Cui311.72
Hai-Hong Tang4174.76
Wenwu Ou519115.56
Yanghua Xiao648254.90
Jiwei Tan7585.78
Xuan Ju800.34