Title
Clustering spatial data by the neighbors intersection and the density difference.
Abstract
Clustering is a classical unsupervised learning task, which is aimed to divide a data set into several groups with similar objects. Clustering problem has been studied for many years, and many excellent clustering algorithms have been proposed. In this paper, we propose a novel clustering method based on density, which is simple but effective. The primary idea of the proposed method is given as follows. Firstly, the point with the largest local density in a cluster is considered as the cluster center. The local density of each point is estimated based on the distance (called radius) between the point and its k-th nearest neighbor. The point with a smaller radius indicates a larger local density. Secondly, the difference of the local densities between each two internal points should be small, while the difference between the density of a border point and the density of an internal point should be relatively large. Thirdly, if the intersection of k nearest neighbors of two points is small, they should be assigned to different clusters. The proposed algorithm has been compared with a typical clustering algorithm named FDPCluster, and the experimental results show that our algorithm has better clustering quality.
Year
DOI
Venue
2016
10.1145/3006299.3006332
Proceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies
Keywords
Field
DocType
data mining, density-based custering, spatial data, clustering
Data mining,Fuzzy clustering,CURE data clustering algorithm,Computer science,Artificial intelligence,Nearest-neighbor chain algorithm,Cluster analysis,Single-linkage clustering,k-medians clustering,Pattern recognition,Correlation clustering,DBSCAN,Machine learning
Conference
ISBN
Citations 
PageRank 
978-1-5090-4468-9
1
0.36
References 
Authors
15
4
Name
Order
Citations
PageRank
Zhenglong Yan150.79
Wenjian Luo271.48
Chenyang Bu3479.18
Li Ni431.22