Title
Uncovering the Latent Structures of Crowd Labeling.
Abstract
Crowdsourcing provides a new way to distribute enormous tasks to a crowd of annotators. The divergent knowledge background and personal preferences of crowd annotators lead to noisy (or even inconsistent) answers to a same question. However, diverse labels provide us information about the underlying structures of tasks and annotators. This paper proposes latent-class assumptions for learning-from-crowds models, that is, items can be separated into several latent classes and workers' annotating behaviors may differ among different classes. We propose a nonparametric model to uncover the latent classes, and also extend the state-of-the-art minimax entropy estimator to learn latent structures. Experimental results on both synthetic data and real data collected from Amazon Mechanical Turk demonstrate our methods can disclose interesting and meaningful latent structures, and incorporating latent class structures can also bring significant improvements on ground truth label recovery for difficult tasks.
Year
DOI
Venue
2015
10.1007/978-3-319-18038-0_31
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART I
Field
DocType
Volume
Dirichlet process,Minimax,Confusion matrix,Computer science,Crowdsourcing,Synthetic data,Ground truth,Artificial intelligence,Majority rule,Machine learning,Estimator
Conference
9077
ISSN
Citations 
PageRank 
0302-9743
7
0.51
References 
Authors
4
2
Name
Order
Citations
PageRank
Tian Tian1784.24
Jun Zhu21926154.82