Title
Robust estimation of resource consumption for SQL queries using statistical techniques
Abstract
The ability to estimate resource consumption of SQL queries is crucial for a number of tasks in a database system such as admission control, query scheduling and costing during query optimization. Recent work has explored the use of statistical techniques for resource estimation in place of the manually constructed cost models used in query optimization. Such techniques, which require as training data examples of resource usage in queries, offer the promise of superior estimation accuracy since they can account for factors such as hardware characteristics of the system or bias in cardinality estimates. However, the proposed approaches lack robustness in that they do not generalize well to queries that are different from the training examples, resulting in significant estimation errors. Our approach aims to address this problem by combining knowledge of database query processing with statistical models. We model resource-usage at the level of individual operators, with different models and features for each operator type, and explicitly model the asymptotic behavior of each operator. This results in significantly better estimation accuracy and the ability to estimate resource usage of arbitrary plans, even when they are very different from the training instances. We validate our approach using various large scale real-life and benchmark workloads on Microsoft SQL Server.
Year
DOI
Venue
2012
10.14778/2350229.2350269
PVLDB
Keywords
DocType
Volume
database query processing,query scheduling,query optimization,significant estimation error,resource estimation,superior estimation accuracy,statistical technique,better estimation accuracy,sql query,robust estimation,resource usage,resource consumption,information retrieval,database system,robust estimator,statistical model
Journal
5
Issue
ISSN
Citations 
11
Proceedings of the VLDB Endowment (PVLDB), Vol. 5, No. 11, pp. 1555-1566 (2012)
29
PageRank 
References 
Authors
0.91
13
4
Name
Order
Citations
PageRank
Jiexing Li121110.36
Arnd Christian König2718121.72
Vivek Narasayya31642173.24
Surajit Chaudhuri499711373.71