Title | ||
---|---|---|
G-CARE: A Framework for Performance Benchmarking of Cardinality Estimation Techniques for Subgraph Matching |
Abstract | ||
---|---|---|
Despite the crucial role of cardinality estimation in query optimization, there has been no systematic and in-depth study of the existing cardinality estimation techniques for subgraph matching queries. In this paper, for the first time, we present a comprehensive study of the existing cardinality estimation techniques for subgraph matching queries, scaling far beyond the original experiments. We first introduce a novel framework called g-care that enables us to realize all existing techniques on top of it and that provides insights on their performance. By using g-care, we then reimplement representative cardinality estimation techniques for graph databases as well as relational databases. We next evaluate these techniques w.r.t accuracy on rdf and non-rdf graphs from different domains with subgraph matching queries of various topologies so far considered. Surprisingly, our results reveal that all existing techniques have serious problems in accuracy for various scenarios and datasets. Intriguingly, a simple sampling method based on an online aggregation technique designed for relational data, consistently outperforms all existing techniques.
|
Year | DOI | Venue |
---|---|---|
2020 | 10.1145/3318464.3389702 | SIGMOD/PODS '20: International Conference on Management of Data
Portland
OR
USA
June, 2020 |
DocType | ISBN | Citations |
Conference | 978-1-4503-6735-6 | 1 |
PageRank | References | Authors |
0.35 | 0 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yeonsu Park | 1 | 1 | 0.35 |
Seongyun Ko | 2 | 24 | 1.66 |
Sourav S. Bhowmick | 3 | 1519 | 272.35 |
Kyoungmin Kim | 4 | 6 | 2.10 |
Kijae Hong | 5 | 1 | 0.35 |
Wook-Shin Han | 6 | 805 | 57.85 |