Title
Analytical and Experimental Evaluation of Stream-based Join
Abstract
Continuous queries over data streams have gained popularity as the breadth of possible applications, ranging from network monitoring to online pattern discovery, have increased. Joining of streams is a fundamental issue that must be resolved to enable complex queries over multiple streams. However, as streams can represent potentially infinite data, it is infeasible to have full join evaluations as is the case with traditional databases. Joins in a stream environment are thus evaluated not over entire streams, but on specific windows defined on the streams. In this paper. we present windowed implementations of the traditional nested loops and hash join algorithms. In our work we analytically and experimentally evaluate the performance of these algorithms for different parameters. We find that, in general, a hash join provides better performance. We also investigate invalidation strategies to remove stale data from the window buffers. and propose an optimal strategy that balances processing time versus buffer size.
Year
DOI
Venue
2005
10.1007/978-1-4020-5347-4_9
ENTERPRISE INFORMATION SYSTEMS VII
Keywords
Field
DocType
data streams,continuous queries,join,main memory joins
Query optimization,Hash join,Data mining,Joins,Data stream mining,Computer science,Ranging,Network monitoring,Distributed computing,Nested loop join,Query plan
Conference
Citations 
PageRank 
References 
0
0.34
7
Authors
2
Name
Order
Citations
PageRank
Henry Kostowski100.34
Kajal T. Claypool258064.35