Title
SPAMS: A Novel Incremental Approach for Sequential Pattern Mining in Data Streams
Abstract
Mining sequential patterns in data streams is a new challenging problem for the datamining community since data arrives sequentially in the form of continuous rapid and infinite streams. In this paper, we propose a new on-line algorithm, SPAMS, to deal with the sequential patterns mining problem in data streams. This algorithm uses an automaton-based structure to maintain the set of frequent sequential patterns, i.e. SPA (Sequential Pattern Automaton). The sequential pattern automaton can be smaller than the set of frequent sequential patterns by two or more orders of magnitude, which allows us to overcome the problem of combinatorial explosion of sequential patterns. Current results can be output constantly on any user's specified thresholds. In addition, taking into account the characteristics of data streams, we propose a well-suited method said to be approximate since we can provide near optimal results with a high probability. Experimental studies show the relevance of the SPA data structure and the efficiency of the SPAMS algorithm on various datasets. Our contribution opens a promising gateway, by using an automaton as a data structure for mining frequent sequential patterns in data streams.
Year
DOI
Venue
2009
10.1007/978-3-642-00580-0_12
ADVANCES IN KNOWLEDGE DISCOVERY AND MANAGEMENT
Keywords
Field
DocType
Algorithm,Data Stream,Sequential Pattern,Automata
Data mining,Data structure,Data stream mining,Pattern recognition,Data stream,Computer science,Automaton,Default gateway,Artificial intelligence,STREAMS,Sequential Pattern Mining,Combinatorial explosion
Conference
Volume
ISSN
Citations 
292
1860-949X
3
PageRank 
References 
Authors
0.44
14
4
Name
Order
Citations
PageRank
Lionel Vinceslas151.86
Jean-Emile Symphor2235.08
Alban Mancheron3143.81
Pascal Poncelet4768126.47