Title
Generative Datalog with Continuous Distributions
Abstract
Arguing for the need to combine declarative and probabilistic programming, Bárány et al. (TODS 2017) recently introduced a probabilistic extension of Datalog as a "purely declarative probabilistic programming language." We revisit this language and propose a more foundational approach towards defining its semantics. It is based on standard notions from probability theory known as stochastic kernels and Markov processes. This allows us to extend the semantics to continuous probability distributions, thereby settling an open problem posed by Bárány et al. We show that our semantics is fairly robust, allowing both parallel execution and arbitrary chase orders when evaluating a program. We cast our semantics in the framework of infinite probabilistic databases (Grohe and Lindner, ICDT 2020), and we show that the semantics remains meaningful even when the input of a probabilistic Datalog program is an arbitrary probabilistic database.
Year
DOI
Venue
2020
10.1145/3375395.3387659
SIGMOD/PODS '20: International Conference on Management of Data Portland OR USA June, 2020
Keywords
DocType
ISBN
Datalog, Probabilistic Databases, Generative Datalog, Measure Theory, Stochastic Kernels, Probabilistic Programming
Conference
978-1-4503-7108-7
Citations 
PageRank 
References 
1
0.37
7
Authors
4
Name
Order
Citations
PageRank
Martin Grohe12280127.40
Benjamin Lucien Kaminski212610.46
Joost-Pieter Katoen34444289.65
Lindner Peter410.37