Title
CrowdMatcher: crowd-assisted schema matching
Abstract
Schema matching is a central challenge for data integration systems. Due to the inherent uncertainty arose from the inability of schema in fully capturing the semantics of the represented data, automatic tools are often uncertain about suggested matching results. However, human is good at understanding data represented in various forms and crowdsourcing platforms are making the human annotation process more affordable. Thus in this demo, we will show how to utilize the crowd to find the right matching. In order to do that, we need to make the tasks posted on the crowdsouricng platforms extremely simple, to be performed by non-expert people, and reduce the number of tasks as less as possible to save the cost. We demonstrate CrowdMatcher, a hybrid machine-crowd system for schema matching. The machine-generated matchings are verified by correspondence correctness queries (CCQs), which is to ask the crowd to determine whether a given correspondence is correct or not. CrowdMatcher includes several original features: it integrates different matchings generated from classical schema matching tools; in order to minimize the cost of crowdsourcing, it automatically selects the most informative set of CCQs from the possible matchings; it is able to manage inaccurate answers provided by the workers; the crowdsourced answers are used to improve matching results.
Year
DOI
Venue
2014
10.1145/2588555.2594515
SIGMOD Conference
Keywords
Field
DocType
crowdsourcing,schema and subschema,schema matching
Data integration,Data mining,Annotation,Ask price,Crowdsourcing,Computer science,Correctness,Schema matching,Schema (psychology),Database,Semantics
Conference
Citations 
PageRank 
References 
5
0.43
7
Authors
5
Name
Order
Citations
PageRank
Chen Jason Zhang11618.28
Ziyuan Zhao2502.11
Lei Chen36239395.84
H. V. Jagadish4111412495.67
Caleb Chen Cao529212.15