Title
Posthoc Interpretability of Learning to Rank Models using Secondary Training Data.
Abstract
Predictive models are omnipresent in automated and assisted decision making scenarios. But for the most part they are used as black boxes which output a prediction without understanding partially or even completely how different features influence the model prediction avoiding algorithmic transparency. Rankings are ordering over items encoding implicit comparisons typically learned using a family of features using learning-to-rank models. In this paper we focus on how best we can understand the decisions made by a ranker in a post-hoc model agnostic manner. We operate on the notion of interpretability based on explainability of rankings over an interpretable feature space. Furthermore we train a tree based model (inherently interpretable) using labels from the ranker, called secondary training data to provide explanations. Consequently, we attempt to study how well does a subset of features, potentially interpretable, explain the full model under different training sizes and algorithms. We do experiments on the learning to rank datasets with 30k queries and report results that serve show in certain settings we can learn a faithful interpretable ranker.
Year
Venue
Field
2018
arXiv: Information Retrieval
Training set,Transparency (graphic),Learning to rank,Data mining,Interpretability,Feature vector,Computer science,Black box,Model prediction,Encoding (memory)
DocType
Volume
Citations 
Journal
abs/1806.11330
2
PageRank 
References 
Authors
0.51
0
2
Name
Order
Citations
PageRank
Jaspreet Singh1202.96
Avishek Anand2145.10