Title
A general streaming algorithm for pattern discovery.
Abstract
Abstract Discovering frequent patterns over event sequences is an important data mining problem. Existing methods typically require multiple passes over the data, rendering them unsuitable for streaming contexts. We present the first streaming algorithm for mining frequent patterns over a window of recent events in the stream. We derive approximation guarantees for our algorithm in terms of: (i) the separation of frequent patterns from the infrequent ones, and (ii) the rate of change of stream characteristics. Our parameterization of the problem provides a new sweet spot in the tradeoff between making distributional assumptions over the stream and algorithmic efficiencies of mining. We illustrate how this yields significant benefits when mining practical streams from neuroscience and telecommunications logs.
Year
DOI
Venue
2013
10.1007/s10115-013-0669-z
Knowl. Inf. Syst.
Keywords
Field
DocType
Event sequences,Data streams,Frequent patterns,Pattern discovery,Streaming algorithms,Approximation algorithms
Approximation algorithm,Data mining,Data stream mining,Streaming algorithm,Computer science,Theoretical computer science,Rendering (computer graphics)
Journal
Volume
Issue
ISSN
37
3
0219-3116
Citations 
PageRank 
References 
4
0.41
23
Authors
4
Name
Order
Citations
PageRank
Debprakash Patnaik119114.89
Srivatsan Laxman242121.65
Badrish Chandramouli352242.85
Naren Ramakrishnan41913176.25