Title
Crowdclustering Items Into Overlapping Clusters
Abstract
Crowdclustering clusters data items in a crowdsourcing manner, which makes discovered item categories more consistent with human perception. However, due to diversity of crowdsourcing workers and fluctuation of the number of tasks assigned to each worker, inferring stable and reliable clusters is challenging. Moreover, an item may be associated with multiple attributes, and such items should be put into different clusters, which makes inferring accurate clusters more complicated. To address the challenges above, in this paper we present a robust and fast crowdclustering scheme for finding overlapping clusters of items. Distinguished from existing works, we extract reliable and stable cluster information from workers' answers by majority voting. We then formulate an optimization problem to find overlapping clusters, and develop a nonnegative matrix factorization based approach to approximate the optimal solution. Experiments show the robustness, accuracy and efficiency of our approach.
Year
DOI
Venue
2016
10.1109/ICC.2016.7511257
2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC)
Field
DocType
ISSN
Resource management,Data mining,Cluster (physics),Crowdsourcing,Computer science,Robustness (computer science),Redundancy (engineering),Non-negative matrix factorization,Majority rule,Optimization problem
Conference
1550-3607
Citations 
PageRank 
References 
0
0.34
7
Authors
6
Name
Order
Citations
PageRank
You Wu100.34
Xiong Wang2705.15
Zhe Yang3101.27
Xiaoying Gan434448.16
Xiaohua Tian556865.92
Xinbing Wang62642214.43