Title
Approximating Frequent Items in Asynchronous Data Stream over a Sliding Window
Abstract
In an asynchronous data stream, the data items may be out of order with respect to their original timestamps. This paper gives a space-efficient data structure to maintain such a data stream so that it can approximate the frequent item set over a sliding time window with sufficient accuracy. Prior to our work, Cormode et al. [3] have the best solution, with space complexity O(\frac1e</font > logW log(\frace</font >BlogW) min{logW, \frac1e</font >}logU)O(\frac{1}{\varepsilon} \log W \log (\frac{\varepsilon B}{\log W}) \min\{\log W, \frac{1}{\varepsilon}\}\log U), where ε is the given error bound, W and B are parameters of the sliding window, and U is the number of all possible item names. Our solution reduces the space to O(\frac1e</font > logW log(\frace</font >BlogW))O(\frac{1}{\varepsilon} \log W \log (\frac{\varepsilon B}{\log W})). We also unify the study of synchronous and asynchronous data stream by quantifying the delay of the data items. When the delay is zero, our solution matches the space complexity of the best solution to the synchronous data streams [8].
Year
DOI
Venue
2009
10.1007/978-3-642-12450-1_5
Workshop on Approximation and Online Algorithms
Keywords
Field
DocType
space-efficient data structure,frequent item,synchronous data stream,data item,best solution,asynchronous data stream,possible item name,varepsilon b,data stream,space complexity,out of order,data structure,sliding window
Asynchronous communication,Data structure,Combinatorics,Data stream mining,Sliding window protocol,Data stream,Computer science,Out-of-order execution
Conference
Volume
ISSN
ISBN
5893
0302-9743
3-642-12449-6
Citations 
PageRank 
References 
1
0.35
15
Authors
4
Name
Order
Citations
PageRank
Ho-Leung Chan147125.23
Tak-Wah Lam21860164.96
Lap-Kei Lee340621.59
Hing-Fung Ting41078.17