Abstract | ||
---|---|---|
Conventional sequential pattern mining methods may meet inherent difficulties in mining databases with long sequences and noise. They may generate a huge number of short and trivial patterns but fail to find interesting patterns approximately shared by many sequences. In this paper, we propose the theme of approximate sequential pattern mining roughly defined as identifying patterns approximately shared by many sequences. We present an efficient and effective algorithm, ApproxMAP, to mine consensus patterns from large sequence databases in two steps. First, sequences are clustered by similarity. Then, consensus patterns are mined directly from each cluster through multiple alignment. We use a real case study to illustrate the effectiveness of ApproxMAP. |
Year | Venue | Keywords |
---|---|---|
2003 | SIAM Proceedings Series | multiple alignment,sequential pattern mining |
Field | DocType | Citations |
Pattern recognition,Computer science,Artificial intelligence,Multiple sequence alignment,Sequential Pattern Mining | Conference | 46 |
PageRank | References | Authors |
2.06 | 19 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hye-Chung Kum | 1 | 114 | 12.99 |
Jian Pei | 2 | 19002 | 995.54 |
Wei Wang | 3 | 7122 | 746.33 |
Dean Duncan | 4 | 57 | 4.97 |