SPIN: Concurrent Workload Scaling over Data Warehouses - Citegraph

Paper Info

Title
SPIN: Concurrent Workload Scaling over Data Warehouses

Abstract
Increasingly, data warehouse DW analyses are being used not only for strategic business decisions but also as valuable tools in operational daily decisions. As a consequence, a large number of queries are concurrently submitted, stressing the database engine ability to handle such query workloads without severely degrading query response times. The query-at-time model of common database engines, where each query is independently executed and competes for the same resources, is inefficient for handling large DWs and does not provide the expected performance and scalability when processing large numbers of concurrent queries. However, the query workload, which is mainly composed of aggregation star queries, frequently has to process similar data and perform similar computations. While materialized views can help in this matter, their usefulness is limited to queries and query patterns that are known in advance. The reviewed proposals on data and processing sharing suffer from memory limitations, reduced scalability and unpredictable execution times when applied to large DWs, particularly those with large dimensions. We present SPIN, a data and processing sharing model that delivers predictable execution times to aggregated star-queries even in the presence of large concurrent query loads, without the memory and scalability limitations of existing approaches. We used the TPC-H benchmark to experimentally evaluate SPIN.

Year	DOI	Venue
2013	10.1007/978-3-642-40131-2_6	DaWaK
Field	DocType	Citations
Data warehouse,Query optimization,Data mining,Query expansion,Computer science,View,Web query classification,Spatial query,Online aggregation,Materialized view,Database	Conference	1
PageRank	References	Authors
0.38	5	2

Authors (2 rows)

Cited by (1 rows)

References (5 rows)

Name	Order	Citations	PageRank
João Pedro Costa	1	64	11.99
Pedro Furtado	2	204	55.67

1