Title
Exploring Importance Measures for Summarizing RDF/S KBs.
Abstract
Given the explosive growth in the size and the complexity of the DataWeb, there is now more than ever, an increasing need to develop methods and tools in order to facilitate the understanding and exploration of RDF/S Knowledge Bases (KBs). To this direction, summarization approaches try to produce an abridged version of the original data source, highlighting the most representative concepts. Central questions to summarization are: how to identify the most important nodes and then how to link them in order to produce a valid sub-schema graph. In this paper, we try to answer the first question by revisiting six well-known measures from graph theory and adapting them for RDF/S KBs. Then, we proceed further to model the problem of linking those nodes as a graph Steiner-Tree problem (GSTP) employing approximations and heuristics to speed up the execution of the respective algorithms. The performed experiments show the added value of our approach since (a) our adaptations outperform current state of the art measures for selecting the most important nodes and (b) the constructed summary has a better quality in terms of the additional nodes introduced to the generated summary.
Year
DOI
Venue
2017
10.1007/978-3-319-58068-5_24
Lecture Notes in Computer Science
Keywords
Field
DocType
Semantic summaries,Schema summary,RDF/S Knowledge Bases,Graph theory
Graph theory,Automatic summarization,Graph,Information retrieval,Computer science,Data Web,Added value,Heuristics,RDF,Speedup
Conference
Volume
ISSN
Citations 
10249
0302-9743
4
PageRank 
References 
Authors
0.39
17
5
Name
Order
Citations
PageRank
Alexandros Pappas140.39
Georgia Troullinou2526.72
Giannis Roussakis340.39
Haridimos Kondylakis432536.63
Dimitris Plexousakis52586326.38