Title
Resource optimization for processing of stream data in data warehouse environment
Abstract
To fulfill the increasing demand of business for the latest information, current data integration approaches are moving towards real-time updates. In the case of real-time data integration the updates occurring on the source systems need to be reflected in the data warehouse immediately. One important element in real-time data integration is the join of a continuous incoming data stream with a disk-based master data. In this context a stream-based algorithm called X-HYBRIDJOIN (Extended Hybrid Join) has been proposed earlier, with a favorable asymptotic runtime behavior. However, the absolute performance was not as good as hoped for. In this paper we present results showing that through properly tuning the algorithm, the resulting Tuned X-HYBRIDJOIN performs significantly better than that of the previous X-HYBRIDJOIN, and better as other applicable join operators found in literature. We present the tuning approach, based on measurement techniques and a revised cost model. To evaluate the algorithm's performance we conduct an experimental study that shows that Tuned X-HYBRIDJOIN exhibits the desired performance characteristics.
Year
DOI
Venue
2012
10.1145/2345396.2345407
ICACCI
Keywords
Field
DocType
resource optimization,data warehouse,continuous incoming data stream,previous x-hybridjoin,stream data,absolute performance,real-time data integration,tuned x-hybridjoin,current data integration approach,performance characteristic,data warehouse environment,real-time updates,disk-based master data,real time,analysis,data management,real time data,data integrity
Data warehouse,Data integration,Data stream mining,Data stream clustering,Data stream,Computer science,Master data,Control engineering,Real-time computing,Operator (computer programming),Computer engineering,Data management
Conference
Citations 
PageRank 
References 
0
0.34
13
Authors
4
Name
Order
Citations
PageRank
M. Asif Naeem110219.73
Gill Dobbie272877.75
Imran Sarwar Bajwa38722.31
Gerald Weber424830.62