Title
Enhanced stream processing in a DBMS kernel
Abstract
Continuous query processing has emerged as a promising query processing paradigm with numerous applications. A recent development is the need to handle both streaming queries and typical one-time queries in the same application. For example, data warehousing can greatly benefit from the integration of stream semantics, i.e., online analysis of incoming data and combination with existing data. This is especially useful to provide low latency in data-intensive analysis in big data warehouses that are augmented with new data on a daily basis. However, state-of-the-art database technology cannot handle streams efficiently due to their "continuous" nature. At the same time, state-of-the-art stream technology is purely focused on stream applications. The research efforts are mostly geared towards the creation of specialized stream management systems built with a different philosophy than a DBMS. The drawback of this approach is the limited opportunities to exploit successful past data processing technology, e.g., query optimization techniques. For this new problem we need to combine the best of both worlds. Here we take a completely different route by designing a stream engine on top of an existing relational database kernel. This includes reuse of both its storage/execution engine and its optimizer infrastructure. The major challenge then becomes the efficient support for specialized stream features. This paper focuses on incremental window-based processing, arguably the most crucial streamspecific requirement. In order to maintain and reuse the generic storage and execution model of the DBMS, we elevate the problem at the query plan level. Proper optimizer rules, scheduling and intermediate result caching and reuse, allow us to modify the DBMS query plans for efficient incremental processing. We describe in detail the new approach and we demonstrate efficient performance even against specialized stream engines, especially when scalability becomes a crucial factor.
Year
DOI
Venue
2013
10.1145/2452376.2452435
EDBT
Keywords
Field
DocType
specialized stream feature,enhanced stream processing,specialized stream management system,data warehousing,state-of-the-art stream technology,stream application,stream semantics,incoming data,specialized stream engine,dbms kernel,stream engine,big data warehouse
Data warehouse,Query optimization,Data mining,Relational database,Reuse,Computer science,Sargable,Stream processing,Database,Query plan,Scalability
Conference
Citations 
PageRank 
References 
8
0.46
28
Authors
4
Name
Order
Citations
PageRank
Erietta Liarou118410.81
Stratos Idreos2107963.03
Stefan Manegold31656123.43
Martin L. Kersten43243509.01