Title
Analyzing Similarities of Datasets Using a Pattern Set Kernel.
Abstract
In the area of pattern discovery, there is much interest in discovering small sets of patterns that characterize the data well. In such scenarios, when data is represented by a small set of characterizing patterns, an interesting problem is the comparison of datasets, by comparing the respective representative sets of patterns. In this paper, we propose a novel kernel function for measuring similarities between two sets of patterns, which is based on evaluating the structural similarities between the patterns in the two sets, weighted using their relative frequencies in the data. We define the kernel for injective serial episodes and itemsets. We also present an efficient algorithm for computing this kernel. We demonstrate the effectiveness of our kernel on classification scenarios and for change detection using sequential datasets and transaction databases.
Year
DOI
Venue
2016
10.1007/978-3-319-31753-3_22
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I
Field
DocType
Volume
Kernel (linear algebra),Data mining,Change detection,Injective function,Pattern recognition,Kernel embedding of distributions,Computer science,Support vector machine classifier,Artificial intelligence,Small set,Machine learning,Kernel (statistics)
Conference
9651
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
0
3
Name
Order
Citations
PageRank
A. Ibrahim120.69
P. Sastry223512.27
Shivakumar Sastry37913.63