Title
Probabilistic Skyline Computation on Vertically Distributed Uncertain Data
Abstract
The skyline query is important in database community. Recently, owing to the inherent uncertainty of some applications, skyline query on uncertain data has been widelystudied using probabilistic model, e.g. p-skyline. In the scenario where uncertain data is vertically distributed among multiple servers, the main purpose of p-skyline computation is to minimize the retrieved records from servers to the local client due to the dominance factor of expensive network communication. In this paper, we present three communication-efficient p-skyline algorithms ASR, IASR and FSLR on vertically distributed uncertain data. ASR alternates sorted and random accesses to retrieve the records at servers and performs retrieving-boundingchecking iteration until all the objects can be determined whether they are in the p-skyline result or not. The communication of the instances not retrieved can be saved. IASR is an improved version of ASR. By examining the net gain of retrieving-boundingchecking iteration, IASR early terminates the iteration to further reduce the cost of communication. Compared to ASR and IASR, FSLR performs random accesses only on demand. FSLR first conducts sorted accesses to get loose upper bounds of skyline probabilities of the instances. Then, FSLR uses random accesses to complement a part of retrieved instances to get tighter upper and lower bounds of skyline probabilities until the p-skyline result is computed. Our experimental results demonstrate that our algorithms ASR, IASR and FSLR significantly outperform the intuitive method for p-skyline computation on vertically distributed uncertain data.
Year
DOI
Venue
2019
10.1109/ICDCS.2019.00024
2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS)
Keywords
Field
DocType
probabilistic skyline,vertical distribution,uncertain data
Skyline,Data mining,Upper and lower bounds,Computer science,Server,Uncertain data,Statistical model,Probabilistic logic,Distributed database,Distributed computing,Computation
Conference
ISSN
ISBN
Citations 
1063-6927
978-1-7281-2520-6
0
PageRank 
References 
Authors
0.34
0
4
Name
Order
Citations
PageRank
Kaiqi Zhang100.68
Jinbao Wang214211.58
Muxian Wang300.68
Xixian Han47810.45