Title
EPF: A General Framework for Supporting Continuous Top-k Queries Over Streaming Data
Abstract
Continuous top-k query over sliding window is a fundamental problem in the domain of streaming data management, which monitors the query window and retrieves k objects with the highest scores when the window slides. The key of supporting this query is maintaining a subset of objects in the window, and try to retrieve answers from them when the window slides. The state-of-the-art approach called SAP utilizes the partition technique to support top-k searches. Its key idea is using, as few as possible, high-quality candidates to support the query via finding a proper partition. However, it has to waste relatively high computation cost in evaluating whether the partition is proper and re-scanning the widow. In this paper, we propose an ELM-based framework named EPF, which improves SAP via learning the nature of streaming data. If we learn that the distribution of streaming data is predictable, we could construct a suitable prediction model for a more efficient partition of the window. Furthermore, we propose a novel algorithm to reduce the re-scanning cost. We conduct a thorough experimental study of this technique on real and synthetic datasets and show the significant performance improvement when applying the technique in existing algorithms.
Year
DOI
Venue
2020
10.1007/s12559-019-09661-z
Cognitive Computation
Keywords
Field
DocType
ELM stream classification top-k
Data mining,Sliding window protocol,Computer science,Artificial intelligence,Streaming data,Partition (number theory),Machine learning,Computation,Performance improvement
Journal
Volume
Issue
ISSN
12
1
1866-9956
Citations 
PageRank 
References 
0
0.34
0
Authors
3
Name
Order
Citations
PageRank
Hong Jiang189.37
Rui Zhu2213.96
Bin Wang300.34