Title
Distribution Policies for Datalog.
Abstract
Modern data management systems extensively use parallelism to speed up query processing over massive volumes of data. This trend has inspired a rich line of research on how to formally reason about the parallel complexity of join computation. In this paper, we go beyond joins and study the parallel evaluation of recursive queries. We introduce a novel framework to reason about multi-round evaluation of Datalog programs, which combines implicit predicate restriction with distribution policies to allow expressing a combination of data-parallel and query-parallel evaluation strategies. Using our framework, we reason about key properties of distributed Datalog evaluation, including parallel-correctness of the evaluation strategy, disjointness of the computation effort, and bounds on the number of communication rounds.
Year
DOI
Venue
2018
10.1007/s00224-019-09959-3
ICDT
Keywords
Field
DocType
Datalog queries, Distributed evaluation, Distribution policies
Evaluation strategy,Joins,Computer science,Theoretical computer science,Predicate (grammar),Datalog,Data management,Recursion,Speedup,Computation
Conference
Volume
Issue
ISSN
64
5
1433-0490
Citations 
PageRank 
References 
0
0.34
0
Authors
3
Name
Order
Citations
PageRank
Bas Ketsman1325.21
Aws Albarghouthi225022.87
Paraschos Koutris334726.63