Title
Comparing Global Optimization and Default Settings of Stream-Based Joins - (Experimental Paper)
Abstract
One problem encountered in real-time data integration is the join of a continuous incoming data stream with a disk-based relation. In this paper we investigate a stream-based join algorithm, called mesh join (MESHJOIN), and focus on a critical component in the algorithm, called the disk-buffer. In MESHJOIN the size of disk-buffer varies with a change in total memory budget and tuning is required to get the maximum service rate within limited available memory. Until now there was little data on the position of the optimum value depending on the memory size, and no performance comparison has been carried out between the optimum and reasonable default sizes for the disk-buffer. To avoid tuning, we propose a reasonable default value for the disk-buffer size with a small and acceptable performance loss. The experimental results validate our arguments.
Year
DOI
Venue
2009
10.1007/978-3-642-14559-9_10
Lecture Notes in Business Information Processing
Keywords
Field
DocType
ETL for real-time data warehouse,ETL optimization,Tuning and management of the real-time data warehouse,Performance and scalability,Stream-based join
Data mining,Joins,Global optimization,Computer science,Database
Conference
Volume
ISSN
Citations 
41
1865-1348
0
PageRank 
References 
Authors
0.34
21
3
Name
Order
Citations
PageRank
M. Asif Naeem110219.73
Gill Dobbie272877.75
Gerald Weber324830.62