Learning Semantic Models of Data Sources Using Probabilistic Graphical Models - Citegraph

Paper Info

Title
Learning Semantic Models of Data Sources Using Probabilistic Graphical Models

Abstract
A semantic model of a data source is a representation of the concepts and relationships contained in the data. Building semantic models is a prerequisite to automatically publishing data to a knowledge graph. However, creating these semantic models is a complex process requiring considerable manual effort and can be error-prone. In this paper, we present a novel approach that efficiently searches over the combinatorial space of possible semantic models, and applies a probabilistic graphical model to identify the most probable semantic model for a data source. Probabilistic graphical models offer many advantages over existing methods: they are robust to noisy inputs and provide a straightforward approach for exploiting relationships within the data. Our solution uses a conditional random field (CRF) to encode structural patterns and enforce conceptual consistency within the semantic model. In an empirical evaluation, our approach outperforms state of the art systems by an average 8.4% of F1 score, even with noisy input data.

Year	DOI	Venue
2019	10.1145/3308558.3313711	WWW '19: The Web Conference on The World Wide Web Conference WWW 2019
Keywords	Field	DocType
Semantic models, knowledge graph, linked data, ontology, probabilistic graphical models, semantic web	Conditional random field,F1 score,Ontology,Data mining,Computer science,Semantic Web,Linked data,Artificial intelligence,Graphical model,Probabilistic logic,Machine learning,Semantic data model	Conference
ISBN	Citations	PageRank
978-1-4503-6674-8	1	0.36
References	Authors
0	3

Authors (3 rows)

Cited by (1 rows)

References (0 rows)

Name	Order	Citations	PageRank
Binh Vu	1	4	4.50
Craig A. Knoblock	2	5229	680.57
Jay Pujara	3	86	14.81

1