Title
Active Comparison of Prediction Models.
Abstract
We address the problem of comparing the risks of two given predictive models - for instance, a baseline model and a challenger - as confidently as possible on a fixed labeling budget. This problem occurs whenever models cannot be compared on held-out training data, possibly because the training data are unavailable or do not reflect the desired test distribution. In this case, new test instances have to be drawn and labeled at a cost. We devise an active comparison method that selects instances according to an instrumental sampling distribution. We derive the sampling distribution that maximizes the power of a statistical test applied to the observed empirical risks, and thereby minimizes the likelihood of choosing the inferior model. Empirically, we investigate model selection problems on several classification and regression tasks and study the accuracy of the resulting p-values.
Year
Venue
DocType
2012
NIPS
Conference
Citations 
PageRank 
References 
3
0.38
8
Authors
3
Name
Order
Citations
PageRank
Christoph Sawade1556.21
Niels Landwehr250631.54
Tobias Scheffer31862139.64