Title
A feasible k-means kernel trick under non-Euclidean feature space
Abstract
AbstractAbstractThis paper poses the question of whether or not the usage of the kernel trick is justified. We investigate it for the special case of its usage in the kernel k-means algorithm. Kernel-k-means is a clustering algorithm, allowing clustering data in a similar way to k-means when an embedding of data points into Euclidean space is not provided and instead a matrix of “distances” (dissimilarities) or similarities is available. The kernel trick allows us to by-pass the need of finding an embedding into Euclidean space. We show that the algorithm returns wrong results if the embedding actually does not exist. This means that the embedding must be found prior to the usage of the algorithm. If it is found, then the kernel trick is pointless. If it is not found, the distance matrix needs to be repaired. But the reparation methods require the construction of an embedding, which first makes the kernel trick pointless, because it is not needed, and second, the kernel-k-means may return different clusterings prior to repairing and after repairing so that the value of the clustering is questioned. In the paper, we identify a distance repairing method that produces the same clustering prior to its application and afterwards and does not need to be performed explicitly, so that the embedding does not need to be constructed explicitly. This renders the kernel trick applicable for kernel-k-means.
Year
DOI
Venue
2020
10.34768/amcs-2020-0052
Periodicals
Keywords
DocType
Volume
kernel methods, k-means, clustering, non-Euclidean feature space, Gower/Legendre theorem
Journal
30
Issue
ISSN
Citations 
4
1641-876X
0
PageRank 
References 
Authors
0.34
0
3
Name
Order
Citations
PageRank
Robert Kłopotek100.34
Mieczyslaw A. Klopotek236678.58
Sławomir Wierzchoń300.34