Title
High-Performance, Distributed Dictionary Encoding of RDF Datasets
Abstract
In this work we propose a novel approach for RDF (Resource Description Framework) dictionary encoding that employs a parallel RDF parser and a distributed dictionary data structure, exploiting RDF-specific optimizations. In contrast with previous solutions, this approach exploits the Partitioned Global Address Space (PGAS) programming model combined with active messages. We evaluate the performance of our dictionary encoder in our RDF database, GEMS (Graph Engine for Multithreaded Systems), and provide an empirical comparison against previous approaches. Our comparison shows that our dictionary encoder scales significantly better and achieves higher performance than the current state of the art, providing a key element for the realization of a more efficient RDF database.
Year
DOI
Venue
2015
10.1109/CLUSTER.2015.44
Cluster Computing
Keywords
Field
DocType
RDF,GEMS,dictionary encoding,multithreading,mapreduce,MPI
Data structure,Programming paradigm,Computer science,Parallel computing,Theoretical computer science,Encoder,Parsing,Partitioned global address space,RDF,Benchmark (computing),Encoding (memory)
Conference
ISSN
Citations 
PageRank 
1552-5244
0
0.34
References 
Authors
7
7
Name
Order
Citations
PageRank
Alessandro Morari1313.46
Jesse Weaver21057.63
Oreste Villa349733.54
David J. Haglin411219.45
Antonino Tumeo535644.70
Vito Giovanni Castellana6329.96
John Feo717721.16