Title
Knowledge Base Semantic Integration Using Crowdsourcing.
Abstract
The semantic web has enabled the creation of a growing number of knowledge bases (KBs), which are designed independently using different techniques. Integration of KBs has attracted much attention as different KBs usually contain overlapping and complementary information. Automatic techniques for KB integration have been improved but far from perfect. Therefore, in this paper, we study the problem of knowledge base semantic integration using crowd intelligence. There are both classes and instances in a KB, in our work, we propose a novel hybrid framework for KB semantic integration considering the semantic heterogeneity of KB class structures. We first perform semantic integration of the class structures via crowdsourcing, then apply the blocking-based instance matching approach according to the integrated class structure. For class structure (taxonomy) semantic integration, the crowd is leveraged to help identifying the semantic relationships between classes to handle the semantic heterogeneity problem. Under the conditions of both large scale KBs and limited monetary budget for crowdsourcing, we formalize the class structure (taxonomy) semantic integration problem as a Local Tree Based Query Selection (LTQS) problem. We show that the LTQS problem is NP-hard and propose two greedy-based algorithms, i.e., static query selection and adaptive query selection. Furthermore, the KBs are usually of large scales and have millions of instances, direct pairwise-based instance matching is inefficient. Therefore, we adopt the blocking-based strategy for instance matching, taking advantage of the class structure (taxonomy) integration result. The experiments on real large scale KBs verify the effectiveness and efficiency of the proposed approaches.
Year
DOI
Venue
2017
10.1109/TKDE.2017.2656086
IEEE Trans. Knowl. Data Eng.
Keywords
Field
DocType
Taxonomy,Semantics,Knowledge based systems,Ontologies,Crowdsourcing,Data integration,Computer science
Data integration,Ontology-based data integration,Data mining,Semantic integration,Crowdsourcing,Computer science,Knowledge-based systems,Semantic Web,Artificial intelligence,Semantic heterogeneity,Semantic computing,Machine learning
Journal
Volume
Issue
ISSN
29
5
1041-4347
Citations 
PageRank 
References 
4
0.40
32
Authors
4
Name
Order
Citations
PageRank
Rui Meng1483.35
Lei Chen26239395.84
Yongxin Tong3109556.54
Chen Jason Zhang41618.28