Title
High throughput indexing for large-scale semantic web data
Abstract
Distributed RDF data management systems become increasingly important with the growth of the Semantic Web. Currently, several such systems have been proposed, however, their indexing methods meet performance bottlenecks either on data loading or querying when processing large amounts of data. In this work, we propose a high throughout index to enable rapid analysis of large datasets. We adopt a hybrid structure to combine the loading speed of similar-size based methods with the execution speed of graph-based approaches, using dynamic data repartitioning over query workloads. We introduce the design and detailed implementation of our method. Experimental results show that the proposed index can indeed vastly improve loading speeds while remaining competitive in terms of performance. Therefore, the method could be considered as a good choice for RDF analysis in large-scale distributed scenarios.
Year
DOI
Venue
2015
10.1145/2695664.2695920
SAC 2015: Symposium on Applied Computing Salamanca Spain April, 2015
Field
DocType
ISBN
Graph,Data mining,Computer science,Semantic Web,Search engine indexing,Folksonomy,Dynamic data,Throughput,Data management,RDF
Conference
978-1-4503-3196-8
Citations 
PageRank 
References 
2
0.36
21
Authors
4
Name
Order
Citations
PageRank
Long Cheng19116.99
Spyros Kotoulas259046.46
Tomas E. Ward310419.10
Georgios Theodoropoulos433231.39