Title
Parallelizing Probabilistic Streaming Skyline Operator in Cloud Computing Environments
Abstract
The skyline query processing over uncertain data streams has received considerable attention, due to its importance in helping users make intelligent decisions over complex data. Nevertheless, existing studies only focus on retrieving the skylines over data streams in a centralized environment typically with one processor, which limits the scalability of algorithms and cannot meet the requirement for massive data analysis. The emerging cloud computing environment provides much more reliable and stable environments than the traditional distributed environments, which can be well adapted to the massive data management and complex queries. Unfortunately, existing parallel frameworks in clouds such as MapReduce and its variants are not suitable for the skyline queries over uncertain data streams. In this paper, we propose a general framework for parallelizing the probabilistic streaming skyline operator with the sliding window partitioning. Particularly, we propose four items mapping strategies CMS, AMS, DMS and APS to optimize the queries based on the proposed parallel framework. Extensive experiments with real deployment are conducted to demonstrate the effectiveness and efficiency of the proposals.
Year
DOI
Venue
2013
10.1109/COMPSAC.2013.15
COMPSAC
Keywords
Field
DocType
sliding window partitioning,parallel processing,massive data management,cloud computing environments,ams,uncertain data,skyline operator,skyline query,uncertain data stream,data streams,query optimization,aps,parallelizing probabilistic streaming skyline,centralized environment,skyline query processing,cms,massive data analysis,cloud computing,dms,complex data,probabilistic streaming skyline operator parallelization,uncertain data streams,data stream,cloud computing environment,query processing,mapping strategies,distributed databases,probabilistic logic
Skyline,Data mining,Data stream mining,Computer science,Uncertain data,Probabilistic logic,Distributed database,Data management,Cloud computing,Scalability,Distributed computing
Conference
ISSN
Citations 
PageRank 
0730-3157
0
0.34
References 
Authors
19
5
Name
Order
Citations
PageRank
Xiaoyong Li1193.76
Yijie Wang223942.22
Xiaoling Li3808.02
Yuan Wang4634.31
Rubing Huang510419.73