Title
Estimation of the Number of Distinct Values over Data Stream Based on Compound Sliding Window.
Abstract
Estimating the number of distinct values in a data stream is a vital problem with many applications such as complex join query over multiple data streams. In this paper, we focus on the continuous and periodic distinct values estimation over sliding windows. We propose a compound sliding window model to compute the distinct values over basic sliding windows in an incremental way. LDV, HDV and AHDV are the three algorithms that are based on compound sliding windows. The basic idea behind the compound sliding windows is to organize the basic windows into a Hash table according to distinct values. Whenever a new data arrives at the data stream, it is inserted into a basic window. Once the basic window is full, a scan using distinct values is executed and the distinct values number is updated incrementally. Theoretical analysis and experiment results show that the distinct values estimation algorithms based on compound sliding windows have a great performance benefits. © 2013 ACADEMY PUBLISHER.
Year
DOI
Venue
2013
10.4304/jsw.8.1.19-24
JSW
Keywords
Field
DocType
basic window,compound sliding window,data stream,distinct values estimation
Multiple data,Sliding window protocol,Data stream,Computer science,Algorithm,Periodic graph (geometry),Hash table
Journal
Volume
Issue
Citations 
8
1
1
PageRank 
References 
Authors
0.37
2
4
Name
Order
Citations
PageRank
Yingli Zhong131.75
Jinghua Zhu22311.29
Meirui Ren3217.30
Yan Yang41911.40