Abstract | ||
---|---|---|
Complex machine learning models are now an integral part of modern, large-scale retrieval systems. However, collection size growth continues to outpace advances in efficiency improvements in the learning models which achieve the highest effectiveness. In this paper, we re-examine the importance of tightly integrating feature costs into multi-stage learning-to-rank (LTR) IR systems. We present a novel approach to optimizing cascaded ranking models which can directly leverage a variety of different state-of-the-art LTR rankers such as LambdaMART and Gradient Boosted Decision Trees. Using our cascade model, we conclusively show that feature costs and the number of documents being re-ranked in each stage of the cascade can be balanced to maximize both efficiency and effectiveness. Finally, we also demonstrate that our cascade model can easily be deployed on commonly used collections to achieve state-of-the-art effectiveness results while only using a subset of the features required by the full model. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1145/3077136.3080819 | SIGIR |
Field | DocType | ISBN |
Learning to rank,Data mining,Leverage (finance),Ranking,Ranking SVM,Computer science,Learning models,Cascade,Artificial intelligence,Machine learning,Alternating decision tree | Conference | 978-1-4503-5022-8 |
Citations | PageRank | References |
11 | 0.48 | 26 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ruey-Cheng Chen | 1 | 108 | 11.87 |
Luke Gallagher | 2 | 16 | 2.92 |
Roi Blanco | 3 | 872 | 57.42 |
Shane Culpepper | 4 | 519 | 47.52 |