Title
Beyond Relevance Ranking: A General Graph Matching Framework for Utility-Oriented Learning to Rank
Abstract
AbstractLearning to rank from logged user feedback, such as clicks or purchases, is a central component of many real-world information systems. Different from human-annotated relevance labels, the user feedback is always noisy and biased. Many existing learning to rank methods infer the underlying relevance of query–item pairs based on different assumptions of examination, and still optimize a relevance based objective. Such methods rely heavily on the correct estimation of examination, which is often difficult to achieve in practice. In this work, we propose a general framework U-rank+ for learning to rank with logged user feedback from the perspective of graph matching. We systematically analyze the biases in user feedback, including examination bias and selection bias. Then, we take both biases into consideration for unbiased utility estimation that directly based on user feedback, instead of relevance. In order to maximize the estimated utility in an efficient manner, we design two different solvers based on Sinkhorn and LambdaLoss for U-rank+. The former is based on a standard graph matching algorithm, and the latter is inspired by the traditional method of learning to rank. Both of the algorithms have good theoretical properties to optimize the unbiased utility objective while the latter is proved to be empirically more effective and efficient in practice. Our framework U-rank+ can deal with a general utility function and can be used in a widespread of applications including web search, recommendation, and online advertising. Semi-synthetic experiments on three benchmark learning to rank datasets demonstrate the effectiveness of U-rank+. Furthermore, our proposed framework has been deployed on two different scenarios of a mainstream App store, where the online A/B testing shows that U-rank+ achieves an average improvement of 19.2% on click-through rate and 20.8% improvement on conversion rate in recommendation scenario, and 5.12% on platform revenue in online advertising scenario over the production baselines.
Year
DOI
Venue
2022
10.1145/3464303
ACM Transactions on Information Systems
Keywords
DocType
Volume
Learning to rank, utilitymaximization, graph matching, implicit feedback, position bias, examination bias, selection bias
Journal
40
Issue
ISSN
Citations 
2
1046-8188
0
PageRank 
References 
Authors
0.34
0
9
Name
Order
Citations
PageRank
Xinyi Dai100.68
Yunjia Xi220.70
Weinan Zhang3122897.24
Qing Liu481.54
Ruiming Tang5397.21
Xiuqiang He631239.21
Jiawei Hou720.70
Jun Wang8164.99
Yong Yu97637380.66