Adaptive Optimization of Very Large Join Queries. - Citegraph

Paper Info

Title
Adaptive Optimization of Very Large Join Queries.

Abstract
The use of business intelligence tools and other means to generate queries has led to great variety in the size of join queries. While most queries are reasonably small, join queries with up to a hundred relations are not that exotic anymore, and the distribution of query sizes has an incredible long tail. The largest real-world query that we are aware of accesses more than 4,000 relations. This large spread makes query optimization very challenging. Join ordering is known to be NP-hard, which means that we cannot hope to solve such large problems exactly. On the other hand most queries are much smaller, and there is no reason to sacrifice optimality there. This paper introduces an adaptive optimization framework that is able to solve most common join queries exactly, while simultaneously scaling to queries with thousands of joins. A key component there is a novel search space linearization technique that leads to near-optimal execution plans for large classes of queries. In addition, we describe implementation techniques that are necessary to scale join ordering algorithms to these extremely large queries. Extensive experiments with over 10 different approaches show that the new adaptive approach proposed here performs excellent over a huge spectrum of query sizes, and produces optimal or near-optimal solutions for most common queries.

Year	DOI	Venue
2018	10.1145/3183713.3183733	SIGMOD/PODS '18: International Conference on Management of Data Houston TX USA June, 2018
Field	DocType	ISSN
Query optimization,Data mining,Joins,Adaptive optimization,Computer science,Business intelligence,Linearization	Conference	0730-8078
ISBN	Citations	PageRank
978-1-4503-4703-7	2	0.36
References	Authors
16	2

Authors (2 rows)

Cited by (2 rows)

References (16 rows)

Name	Order	Citations	PageRank
Thomas Neumann	1	2523	156.50
Bernhard Radke	2	14	1.86

1