Title
Ran king the big s ky: efficient top- k s kyline computation on massive data
Abstract
In many applications, top-k skyline query is an important operation to return k skyline tuples with the highest domination scores in a potentially huge data space. It is analyzed that the existing algorithms cannot process top-k skyline query on massive data efficiently. In this paper, we propose a novel table-scan-based algorithm RSTS to compute top-k skyline results on massive data efficiently. RSTS first builds the presorted table, whose tuples are arranged in the order of round-robin retrieval on sorted column lists. RSTS consists of two phases. In phase 1, the candidate tuples are acquired by the sequential scan on the presorted table. In phase 2, RSTS calculates the domination scores of the candidates and returns query results by another sequential scan. It is proved that RSTS has the characteristic of early termination, along with the theoretical analysis of scan depths. The pruning rule for candidate tuples is devised in this paper. The theoretical pruning effect shows that majority of the skyline results can be discarded directly. The extensive experimental results, conducted on synthetic and real-life data sets, show that RSTS outperforms the existing algorithms significantly.
Year
DOI
Venue
2019
10.1007/s10115-018-1256-0
Knowledge and Information Systems
Keywords
Field
DocType
Massive data,Top-k skyline,RSTS algorithm,Table scan,Pruning operation
Skyline,Data mining,Data set,Data space,Ranking,Tuple,Computer science,Full table scan,Sky,Skyline computation
Journal
Volume
Issue
ISSN
60
1
0219-3116
Citations 
PageRank 
References 
0
0.34
22
Authors
4
Name
Order
Citations
PageRank
Xixian Han17810.45
Bailing Wang256.94
Jianzhong Li36324.23
Hong Gao41086120.07