On aggregating labels from multiple crowd workers to infer relevance of documents - Citegraph

Paper Info

Title
On aggregating labels from multiple crowd workers to infer relevance of documents

Abstract
We consider the problem of acquiring relevance judgements for information retrieval (IR) test collections through crowdsourcing when no true relevance labels are available. We collect multiple, possibly noisy relevance labels per document from workers of unknown labelling accuracy. We use these labels to infer the document relevance based on two methods. The first method is the commonly used majority voting (MV) which determines the document relevance based on the label that received the most votes, treating all the workers equally. The second is a probabilistic model that concurrently estimates the document relevance and the workers accuracy using expectation maximization (EM). We run simulations and conduct experiments with crowdsourced relevance labels from the INEX 2010 Book Search track to investigate the accuracy and robustness of the relevance assessments to the noisy labels. We observe the effect of the derived relevance judgments on the ranking of the search systems. Our experimental results show that the EM method outperforms the MV method in the accuracy of relevance assessments and IR systems ranking. The performance improvements are especially noticeable when the number of labels per document is small and the labels are of varied quality.

Year	DOI	Venue
2012	10.1007/978-3-642-28997-2_16	ECIR
Keywords	Field	DocType
relevance judgement,multiple crowd worker,aggregating label,em method,true relevance label,crowdsourced relevance label,noisy relevance label,document relevance,unknown labelling accuracy,workers accuracy,relevance assessment,relevance judgment,expectation maximization,majority voting,probabilistic model,information retrieval	Data mining,Information retrieval,Ranking,Expectation–maximization algorithm,Crowdsourcing,Computer science,Book search,Robustness (computer science),Statistical model,Artificial intelligence,Majority rule,Machine learning	Conference
Citations	PageRank	References
30	1.33	15
Authors
5

Authors (5 rows)

Cited by (30 rows)

References (15 rows)

Name	Order	Citations	PageRank
Mehdi Hosseini	1	54	3.77
Ingemar Cox	2	3652	795.60
Nataša Milić-Frayling	3	59	3.26
Gabriella Kazai	4	1151	97.35
Vishwa Vinay	5	245	15.94

1