Title
Active Estimation of F-Measures.
Abstract
We address the problem of estimating the F-measure of a given model as accurately as possible on a fixed labeling budget. This problem occurs whenever an estimate cannot be obtained from held-out training data; for instance, when data that have been used to train the model are held back for reasons of privacy or do not reflect the test distribution. In this case, new test instances have to be drawn and labeled at a cost. An active estimation procedure selects instances according to an instrumental sampling distribution. An analysis of the sources of estimation error leads to an optimal sampling distribution that minimizes estimator variance. We explore conditions under which active estimates of F-measures are more accurate than estimates based on instances sampled from the test distribution.
Year
Venue
Field
2010
NIPS
Sampling distribution,Training set,Data mining,Computer science,Estimator
DocType
Citations 
PageRank 
Conference
7
0.56
References 
Authors
6
3
Name
Order
Citations
PageRank
Christoph Sawade1556.21
Niels Landwehr250631.54
Tobias Scheffer31862139.64