Title
Outlier Robust Geodesic K-means Algorithm for High Dimensional Data.
Abstract
This paper proposes an outlier robust geodesic K-mean algorithm for high dimensional data. The proposed algorithm features three novel contributions. First, it employs a shared nearest neighbour (SNN) based distance metric to construct the nearest neighbour data model. Second, it combines the notion of geodesic distance to the well-known local outlier factor (LOF) model to distinguish outliers from inlier data. Third, it introduces a new ad-hoc strategy to integrate outlier scores into geodesic distances. Numerical experiments with synthetic and real world remote sensing spectral data show the efficiency of the proposed algorithm in clustering of high-dimensional data in terms of the overall clustering accuracy and the average precision.
Year
DOI
Venue
2016
10.1007/978-3-319-49055-7_23
Lecture Notes in Computer Science
Keywords
Field
DocType
Clustering,K-means,High-dimensional data,Geodesic distance,Shared nearest neighbour,Local outlier factor
Local outlier factor,k-means clustering,Clustering high-dimensional data,Pattern recognition,Computer science,Outlier,Metric (mathematics),Artificial intelligence,Cluster analysis,Data model,Geodesic
Conference
Volume
ISSN
Citations 
10029
0302-9743
0
PageRank 
References 
Authors
0.34
8
3
Name
Order
Citations
PageRank
Aidin Hassanzadeh121.06
Arto Kaarna217427.50
Tuomo Kauranne3429.71