Title
Neighbor-based pattern detection for windows over streaming data
Abstract
The discovery of complex patterns such as clusters, outliers, and associations from huge volumes of streaming data has been recognized as critical for many domains. However, pattern detection with sliding window semantics, as required by applications ranging from stock market analysis to moving object tracking remains largely unexplored. Applying static pattern detection algorithms from scratch to every window is prohibitively expensive due to their high algorithmic complexity. This work tackles this problem by developing the first solution for incremental detection of neighbor-based patterns specific to sliding window scenarios. The specific pattern types covered in this work include density-based clusters and distance-based outliers. Incremental pattern computation in highly dynamic streaming environments is challenging, because purging a large amount of to-be-expired data from previously formed patterns may cause complex pattern changes including migration, splitting, merging and termination of these patterns. Previous incremental neighbor-based pattern detection algorithms, which were typically not designed to handle sliding windows, such as incremental DBSCAN, are not able to solve this problem efficiently in terms of both CPU and memory consumption. To overcome this, we exploit the "predictability" property of sliding windows to elegantly discount the effect of expiring objects on the remaining pattern structures. Our solution achieves minimal CPU utilization, while still keeping the memory utilization linear in the number of objects in the window. Our comprehensive experimental study, using both synthetic as well as real data from domains of stock trades and moving object monitoring, demonstrates superiority of our proposed strategies over alternate methods in both CPU and memory utilization.
Year
DOI
Venue
2009
10.1145/1516360.1516422
EDBT
Keywords
Field
DocType
specific pattern type,static pattern detection algorithm,memory utilization,remaining pattern structure,incremental pattern computation,neighbor-based pattern,previous incremental neighbor-based pattern,complex pattern change,complex pattern,neighbor-based pattern detection,pattern detection,sliding window
Data mining,Sliding window protocol,Computer science,CPU time,Outlier,Exploit,Video tracking,Ranging,DBSCAN,Database,Computation
Conference
Citations 
PageRank 
References 
37
1.46
19
Authors
3
Name
Order
Citations
PageRank
Di Yang11529.72
Elke A. Rundensteiner24076700.65
Matthew O. Ward31757189.48