Title
Noise Sensitivity of an Information Granules Filtering Procedure by Genetic Optimization for Inexact Sequential Pattern Mining
Abstract
One of the most essential challenges in Data Mining and Knowledge Discovery is the development of effective tools able to find regularities in data. In order to highlight and to extract interesting knowledge from the data at hand, a key problem is frequent pattern mining, i.e. to discover frequent substructures hidden in the available data. In many interesting application fields, data are often represented and stored as sequences over time or space of generic objects. Due to the presence of noise and uncertainties in data, searching for frequent subsequences must employ approximate matching techniques, such as edit distances. A common procedure to identify recurrent patterns in noisy data is based on clustering algorithms relying on some edit distance between subsequences. However, this plain approach can produce many spurious patterns due to multiple pattern matchings on close positions in the same sequence excerpt. In this paper, we present a method to overcome this drawback by applying an optimization-based step lter that identifies the most descriptive patterns among those found by the clustering process, and allows to return more compact and easily interpretable clusters. We evaluate the mining systems performances on synthetic data in two separate cases, corresponding respectively to two different (simulated) sources of noise. In both cases, our method performs well in retrieving the original patterns with acceptable information loss.
Year
DOI
Venue
2014
10.1007/978-3-319-26393-9_9
Studies in Computational Intelligence
Keywords
DocType
Volume
Granular modeling,Sequence data mining,Inexact sequence matching,Frequent subsequences extraction,Evolutionary computation
Conference
620
ISSN
Citations 
PageRank 
1860-949X
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Enrico Maiorino1212.91
Francesca Possemato200.34
Valerio Modugno383.22
Antonello Rizzi436341.68