Title
Noisy Type Assertion Detection in Semantic Datasets
Abstract
Semantic datasets provide support to automate many tasks such as decision-making and question answering. However, their performance is always decreased by the noises in the datasets, among which, noisy type assertions play an important role. This problem has been mainly studied in the domain of data mining but not in the semantic web community. In this paper, we study the problem of noisy type assertion detection in semantic web datasets by making use of concept disjointness relationships hidden in the datasets. We transform noisy type assertion detection into multiclass classification of pairs of type assertions which type an individual to two potential disjoint concepts. The multiclass classification is solved by Adaboost with C4.5 as the base classifier. Furthermore, we propose instance-concept compatability metrics based on instance-instance relationships and instance-concept assertions. We evaluate the approach on both synthetic datasets and DBpedia. Our approach effectively detect noisy type assertions in DBpedia with a high precision of 95%.
Year
DOI
Venue
2014
10.1007/978-3-319-11964-9_24
Semantic Web Conference
Field
DocType
Volume
Data mining,Computer science,Semantic Web,Type inference,SPARQL,Artificial intelligence,Classifier (linguistics),Multiclass classification,Question answering,AdaBoost,Assertion,Database,Machine learning
Conference
8796
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
18
3
Name
Order
Citations
PageRank
Man Zhu100.34
Zhiqiang Gao234939.84
Zhibin Quan3162.65