Title
Sparqling kleene: fast property paths in RDF-3X
Abstract
As Semantic Web efforts continue to gather steam, the RDF engines are faced with graphs with millions of nodes and billions of edges. While much recent work in addressing the resulting scalability issues in processing queries over these datasets have mainly considered SPARQL 1.0, the next-generation query language recommendations have proposed the addition of regular expression restricted navigation queries into SPARQL. We address the problem of supporting efficient processing of property paths into RDF-3X -- a high-performance RDF engine. In this paper, we restrict our attention to a restricted definition of property paths that is not only tractable but also most commonly used -- instead of enumerating all paths that satisfy the given query, we focus on regular expression based reachability queries. Based on this, we make the following three major technical contributions: first, we present a detailed account of integrating the recently proposed highly compact reachability index called FERRARI into the RDF-3X engine to support property path evaluation; second, we show how property path queries can be efficiently answered using multiple instances of this index -- one instance for each distinct label in the graph; and finally, we develop a set of queries over real-world RDF data that can serve as benchmark set for evaluating the efficiency of property path queries. Our experimental results over Yago2, a large RDF-based knowledge base, show that our proposed approach is highly scalable and flexible.
Year
DOI
Venue
2013
10.1145/2484425.2484443
GRADES
Keywords
Field
DocType
regular expression,rdf engine,high-performance rdf engine,property path,benchmark set,fast property path,property path evaluation,rdf-3x engine,sparqling kleene,property path query,real-world rdf data,rdf,reachability
Regular expression,RDF query language,Query language,Computer science,Semantic Web,SPARQL,Reachability,Theoretical computer science,RDF Schema,RDF
Conference
Citations 
PageRank 
References 
3
0.42
6
Authors
3
Name
Order
Citations
PageRank
Andrey Gubichev126311.88
Srikanta Bedathur260743.23
Stephan Seufert327910.69