Title
MJoin: a metadata-aware stream join operator
Abstract
Join algorithms must be re-designed when processing stream data instead of persistently stored data. Data streams are potentially infinite and the query result is expected to be generated incrementally instead of once only. Data arrival patterns are often unpredictable and the statistics of the data and other relevant metadata often are only known at runtime. In some cases they are supplied interleaved with the actual data in the form of stream markers. Recently, stream join algorithms, like Symmetric Hash Join and XJoin, have been designed to perform in a pipelined fashion to cope with the latent delivery of data. However, none of them to date takes metadata, especially runtime metadata, into consideration. Hence, the join execution logic defined statically before runtime may not be well suited to deal with varying types of dynamic runtime scenarios. Also the potentially unbounded state needs to be maintained by the join operator to guarantee the precision of the result. In this paper, we propose a metadata-aware stream join operator called MJoin which is able to exploit metadata to (1) detect and purge useless materialized data to save computation resources and (2) optimize the execution logic to target diferent optimization goals. We have implemented the MJoin operator. The experimental results validate our metadata-driven join optimization strategies.
Year
DOI
Venue
2003
10.1145/966618.966639
DEBS
Keywords
Field
DocType
actual data,metadata-aware stream,dynamic runtime scenario,data arrival pattern,stream marker,processing stream data,relevant metadata,runtime metadata,data stream,execution logic,constraint,metadata,optimization
Hash join,Metadata,Data stream mining,Recursive join,Computer science,Real-time computing,Sort-merge join,Exploit,Operator (computer programming),Distributed computing,Computation
Conference
ISBN
Citations 
PageRank 
1-58113-843-1
11
0.70
References 
Authors
9
3
Name
Order
Citations
PageRank
Luping Ding124518.86
Elke A. Rundensteiner24076700.65
George T. Heineman347665.84