Analyzing Similarities of Datasets Using a Pattern Set Kernel. - Citegraph

Paper Info

Title
Analyzing Similarities of Datasets Using a Pattern Set Kernel.

Abstract
In the area of pattern discovery, there is much interest in discovering small sets of patterns that characterize the data well. In such scenarios, when data is represented by a small set of characterizing patterns, an interesting problem is the comparison of datasets, by comparing the respective representative sets of patterns. In this paper, we propose a novel kernel function for measuring similarities between two sets of patterns, which is based on evaluating the structural similarities between the patterns in the two sets, weighted using their relative frequencies in the data. We define the kernel for injective serial episodes and itemsets. We also present an efficient algorithm for computing this kernel. We demonstrate the effectiveness of our kernel on classification scenarios and for change detection using sequential datasets and transaction databases.

Year	DOI	Venue
2016	10.1007/978-3-319-31753-3_22	ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I
Field	DocType	Volume
Kernel (linear algebra),Data mining,Change detection,Injective function,Pattern recognition,Kernel embedding of distributions,Computer science,Support vector machine classifier,Artificial intelligence,Small set,Machine learning,Kernel (statistics)	Conference	9651
ISSN	Citations	PageRank
0302-9743	0	0.34
References	Authors
0	3

Authors (3 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
A. Ibrahim	1	2	0.69
P. Sastry	2	235	12.27
Shivakumar Sastry	3	79	13.63

1