Title
Preferences on a Budget: Prioritizing Document Pairs when Crowdsourcing Relevance Judgments
Abstract
ABSTRACT In Information Retrieval (IR) evaluation, preference judgments are collected by presenting to the assessors a pair of documents and asking them to select which of the two, if any, is the most relevant. This is an alternative to the classic relevance judgment approach, in which human assessors judge the relevance of a single document on a scale; such an alternative allows to make relative rather than absolute judgments of relevance. While preference judgments are easier for human assessors to perform, the number of possible document pairs to be judged is usually so high that it makes it unfeasible to judge them all. Thus, following a similar idea to pooling strategies for single document relevance judgments where the goal is to sample the most useful documents to be judged, in this work we focus on analyzing alternative ways to sample document pairs to judge, in order to maximize the value of a fixed number of preference judgments that can feasibly be collected. Such value is defined as how well we can evaluate IR systems given a budget, that is, a fixed number of human preference judgments that may be collected. By relying on several datasets featuring relevance judgments gathered by means of experts and crowdsourcing, we experimentally compare alternative strategies to select document pairs and show how different strategies lead to different IR evaluation result quality levels. Our results show that, by using the appropriate procedure, it is possible to achieve good IR evaluation results with a limited number of preference judgments, thus confirming the feasibility of using preference judgments to create IR evaluation collections.
Year
DOI
Venue
2022
10.1145/3485447.3511960
International World Wide Web Conference
Keywords
DocType
Citations 
Preference Judgments, Relevance Assessment, Crowdsourcing
Conference
1
PageRank 
References 
Authors
0.36
0
4
Name
Order
Citations
PageRank
Kevin Roitero13013.74
Alessandro Checco27812.85
Stefano Mizzaro3348.33
Gianluca Demartini4627.99