Abstract | ||
---|---|---|
Many important data management and analytics tasks cannot be completely addressed by automated processes. Crowdsourcing is an effective way to harness human cognitive abilities to process these computer-hard tasks, such as entity resolution, sentiment analysis, and image recognition. Crowdsourced data management has been extensively studied in research and industry recently. In this tutorial, we will survey and synthesize a wide spectrum of existing studies on crowdsourced data management. We first give an overview of crowdsourcing, and then summarize the fundamental techniques, including quality control, cost control, and latency control, which must be considered in crowdsourced data management. Next we review crowdsourced operators, including selection, collection, join, top-k, sort, categorize, aggregation, skyline, planning, schema matching, mining and spatial crowdsourcing. We also discuss crowdsourcing optimization techniques and systems. Finally, we provide the emerging challenges. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1145/3035918.3054776 | SIGMOD Conference |
Keywords | Field | DocType |
Crowdsourcing,Data Management,Crowdsourcing Optimization | Skyline,Data science,Data mining,Sentiment analysis,Computer science,Crowdsourcing,sort,Analytics,Schema matching,Data management,Crowdsourcing software development,Database | Conference |
Citations | PageRank | References |
16 | 0.58 | 90 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Guoliang Li | 1 | 3077 | 154.70 |
Yudian Zheng | 2 | 418 | 16.91 |
Ju Fan | 3 | 406 | 28.58 |
Jiannan Wang | 4 | 1109 | 45.38 |
Reynold Cheng | 5 | 3069 | 154.13 |