Abstract | ||
---|---|---|
Semantic datasets provide support to automate many tasks such as decision-making and question answering. However, their performance is always decreased by the noises in the datasets, among which, noisy type assertions play an important role. This problem has been mainly studied in the domain of data mining but not in the semantic web community. In this paper, we study the problem of noisy type assertion detection in semantic web datasets by making use of concept disjointness relationships hidden in the datasets. We transform noisy type assertion detection into multiclass classification of pairs of type assertions which type an individual to two potential disjoint concepts. The multiclass classification is solved by Adaboost with C4.5 as the base classifier. Furthermore, we propose instance-concept compatability metrics based on instance-instance relationships and instance-concept assertions. We evaluate the approach on both synthetic datasets and DBpedia. Our approach effectively detect noisy type assertions in DBpedia with a high precision of 95%. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1007/978-3-319-11964-9_24 | Semantic Web Conference |
Field | DocType | Volume |
Data mining,Computer science,Semantic Web,Type inference,SPARQL,Artificial intelligence,Classifier (linguistics),Multiclass classification,Question answering,AdaBoost,Assertion,Database,Machine learning | Conference | 8796 |
ISSN | Citations | PageRank |
0302-9743 | 0 | 0.34 |
References | Authors | |
18 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Man Zhu | 1 | 0 | 0.34 |
Zhiqiang Gao | 2 | 349 | 39.84 |
Zhibin Quan | 3 | 16 | 2.65 |